Slashdot Mirror


Bulk Data Storage For The Common Man?

Vigyaan writes "Lately, I have been looking into different bulk data storage options available to a common man. My work depends on generating, storing and analyzing a large amount of data -- averaging about 1 TB per month. I would like to have a storage system which is automated, fast, reliable and most importantly does not cost the price of an eye. Right now, I have a 4 node Linux cluster with 10 large hard disks (total capacity 1.6 TB); data storage roughly costs about $0.60/GB (excluding the cost of PC hardware). But long term storage is painful -- DVDs cost about $0.10-$0.15/GB but takes too much human time and leaving data on hard disks makes me nervous because of possible failures. RAID is a possibility, but it increases the cost significantly. I was wondering, if Slashdot readers have any recommendations for a cheap automated way to store and retrieve data."

483 comments

  1. Finally a use for my 1GB Gmail invites... by anakin357 · · Score: 4, Funny

    I'll send you a couple.

    --
    http://www.fsckin.com/
    1. Re:Finally a use for my 1GB Gmail invites... by kyknos.org · · Score: 1

      send it to me please :)

      --

      SHE does throw dice.
    2. Re:Finally a use for my 1GB Gmail invites... by Usquebaugh · · Score: 0, Offtopic

      You'll need more than a couple, 1TB=1,000GB

    3. Re:Finally a use for my 1GB Gmail invites... by G-funk · · Score: 0, Offtopic

      I know somebody who'd appreciate them ;-)

      *whistles innocently*

      --
      Send lawyers, guns, and money!
    4. Re:Finally a use for my 1GB Gmail invites... by macdaddy · · Score: 1, Offtopic

      Oh please............. Pretty please.... With sugar and 1GB RAM sticks on top?

    5. Re:Finally a use for my 1GB Gmail invites... by mabinogi · · Score: 1

      hmmm...the GMail thing must be wearing off, you've only got 4 responses wanting your invites.

      --
      Advanced users are users too!
    6. Re:Finally a use for my 1GB Gmail invites... by FuzzzyLogik · · Score: 0, Offtopic

      if you send a message and 1gig of ddr 400 i'll send ya a gmail invite :-P seriously, email me.. unlogikal@yahoo.com

    7. Re:Finally a use for my 1GB Gmail invites... by herrvinny · · Score: 1

      I could use an invite, auhthesis (the at sign) YAHOO (dot as always) com

    8. Re:Finally a use for my 1GB Gmail invites... by Anonymous Coward · · Score: 2, Funny

      That's auhthesis@yahoo.com ? Just wanted to make sure I got it right. auhthesis@yahoo.com . I'm guessing you'd want to have auhthesis@gmail.com as your e-mail address? Invitation is on its way!

    9. Re:Finally a use for my 1GB Gmail invites... by EvilTwinSkippy · · Score: 4, Funny

      Nah, just tarball your backup into 1 or 2 GB file sizes, name it "PR0N XXX TEEN SEX DONKEY LOVE - MILITANT ISLAMIC BUKAKKE KITTEN.MPG.AVI.WMV" and share is on Gnutella.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    10. Re:Finally a use for my 1GB Gmail invites... by anakin357 · · Score: 3, Informative

      An invitation has been emailed to your friend.

      Yee. Sent someone else who replied a invite too.

      Mod this up and I'll send you one too. :P

      --
      http://www.fsckin.com/
    11. Re:Finally a use for my 1GB Gmail invites... by macdaddy · · Score: 1

      LOL. I don't know that it's really worth it. I've got 480GB of space on my own mail server to play with (I could never delete mail again!). I'd be curious to play with gmail but it's not necessary for my email survival. I'll keep you in mind if I come across some extra pc3200 though! :-)

    12. Re:Finally a use for my 1GB Gmail invites... by FuzzzyLogik · · Score: 1

      ok, off the invite goes to another person who asked. i wasn't real serious about the ram :-P lol

    13. Re:Finally a use for my 1GB Gmail invites... by e133tc1pher · · Score: 2, Interesting

      Actualy you may have something here... Lets say you want to back up your precious config files in your /etc/ directory, just take a real porn picture and use a stenography program to hide your config file in it. Hell, if its a real good picture, you can probly get away with it bieng a couple megs. Just share them on your favorite file sharing service, if they support chat hype them up. I wouldn't use this as even a backup to your real backup, but if all hell breaks loose, you know where to go : )

    14. Re:Finally a use for my 1GB Gmail invites... by macdaddy · · Score: 1

      Hehe, at least someone gets it. Have a good one.

    15. Re:Finally a use for my 1GB Gmail invites... by Spackler · · Score: 0, Offtopic

      But if I mod it, I can't post to it (you won't know who I am)

      But if I post to it, I can't mod it (you won't be modded up)

      Infinite loop. Thank goodness the gmail account you will invite me to has enough space to handle infinity.

    16. Re:Finally a use for my 1GB Gmail invites... by febuiles · · Score: 0

      It wouldn't hurt to get one at nospampleasethanks-febuiles@inbox.lv...:D

    17. Re:Finally a use for my 1GB Gmail invites... by complete+loony · · Score: 0, Offtopic
      Gee, they're not worth that much... anymore.

      There's still heaps on ebay going fairly cheap these days. But I'm a cheap lazy bastard, so if anyone is feeling charitable...

      --
      09F91102 no, 455FE104 nope, F190A1E8 uh-uh, 7A5F8A09 that's not it, C87294CE no. Ah! 452F6E403CDF10714E41DFAA257D313F.
    18. Re:Finally a use for my 1GB Gmail invites... by Captain+Splendid · · Score: 1
      That's actually a method I use...sorta.

      Let's say I downloaded a really good file from wherever, and it's the best rendition of Long Tall Sally/animal porn clip/linux distro I've ever seen. I'll make an effort to spread that file around. Then, when I accidently delete it, I know I have a fairly good chance to recover it if I want.

      Like you said, not to be used with critical data, but it does seem that the internet will never let anything die.

      --
      Linux, you magnificent bastard, I read the fucking manual!
    19. Re:Finally a use for my 1GB Gmail invites... by superpulpsicle · · Score: 2, Informative

      People are so hyped up about Gmail. Did you know that www.hotmail.com is NOT effectively backed up as of 2002 (cough, inside source, hmm).

      Knowing how long hotmail and M$ has been around, and still failed to backup hotmail with their infinite windows license. What makes you think your 1 Gig will be backed up by Google.

    20. Re:Finally a use for my 1GB Gmail invites... by commodoresloat · · Score: 0, Offtopic
      OK I modded you up; send me one.

      Of course, now that I posted, the mod has been revoked. But send me one anyway!

    21. Re:Finally a use for my 1GB Gmail invites... by Suidae · · Score: 0, Offtopic

      If you insert the files into freenet, you don't have to worry about mislabeling them. Just put the keys to the file into a job that frequently requests the files from various entrypoints into the network to be sure they stick around.

    22. Re:Finally a use for my 1GB Gmail invites... by gcaseye6677 · · Score: 1, Offtopic

      In other words, a Beowolf cluster of GMail accounts.

    23. Re:Finally a use for my 1GB Gmail invites... by vaibhavkhattri · · Score: 0, Offtopic

      Send me one please :-)

      at vaibhav_khattri@yahoo.com

    24. Re:Finally a use for my 1GB Gmail invites... by __aafkqj3628 · · Score: 2, Informative

      (cough, inside source, hmm)

      Inside source? Just call them up and ask! It's not hidden knowledge.

    25. Re:Finally a use for my 1GB Gmail invites... by richie2000 · · Score: 1
      the internet will never let anything die.
      Guybrush Threepwood.

      Exactly. :-D

      --
      Money for nothing, pix for free
    26. Re:Finally a use for my 1GB Gmail invites... by Anonymous Coward · · Score: 0, Offtopic

      You should add a .zip at the end, so that more will download it to the bitter end!

    27. Re:Finally a use for my 1GB Gmail invites... by ColaMan · · Score: 1

      Ironic that you use a yahoo! address to offer a gmail invite.

      --

      You are in a twisty maze of processor lines, all alike.
      There is a lot of hype here.
    28. Re:Finally a use for my 1GB Gmail invites... by mrchaotica · · Score: 1

      Well, you don't have to have a backup of your backup, and the OP was talking about using Gmail as a backup.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    29. Re:Finally a use for my 1GB Gmail invites... by Anonymous Coward · · Score: 0

      http://www.gmail-is-too-creepy.com/gprofile.html

    30. Re:Finally a use for my 1GB Gmail invites... by EvilTwinSkippy · · Score: 2, Interesting
      Yes you do have to backup your backup. This is why I abhor backing up to disk. (Or at least a single disk.) The advantage of tapes, for all their warts, is that you have several copies of them going back through time. When a user shows up at my desk looking for a file that MAY have been on the array a week ago, something that get's mirror (and only mirrored) once a night isn't going to cover it.

      You always need at least 3 generations of backup. The Current backup, the "father", and the "Grandfather." These are complete backups, not incremental. And you need them in case you run into a media error. In our case we keep the last week of tapes, a weekly backup from the last 30 days, a monthly backup from the last year, and a yearly backup starting at the dawn of time.

      If the data isn't worth backing up properly, you might as well not bother backing it up at all.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    31. Re:Finally a use for my 1GB Gmail invites... by MysteryMilk · · Score: 0, Offtopic

      Got any Gmail invites left? Please? :)

      --
      ~~ Scott If animals aren't supposed to be good for you, why are they made of meat?
    32. Re:Finally a use for my 1GB Gmail invites... by red+floyd · · Score: 0, Offtopic

      &ltAOL>
      ME TOO!
      </AOL>

      --
      The only reason we have the rights we have is that people just like us died to gain those rights. -- Cheerio Boy
    33. Re:Finally a use for my 1GB Gmail invites... by mrchaotica · · Score: 1

      I just meant you can make all your backups from the originals; you don't need to make backups from your backups.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    34. Re:Finally a use for my 1GB Gmail invites... by Hortensia+Patel · · Score: 1

      Well, as others have pointed out, I can't. But I'll promise to mod up the next post of yours I see in another thread.

      Please? Pretty please? Dammit, Excellent karma has to be good for something. (Well, it was excellent before these Offtopic mods, anyhow.)

      </grovel>
  2. I have one by Anonymous Coward · · Score: 0, Funny

    Floppy disks.

    1. Re:I have one by Anonymous Coward · · Score: 1, Funny

      Punchcards

    2. Re:I have one by Anonymous Coward · · Score: 0

      Binders full of zeros and ones.

    3. Re:I have one by Chess_the_cat · · Score: 2, Funny

      Just do what I do: memorize it all. No need for binders, cards, floppies, HD, DVD.

      --
      Support the First Amendment. Read at -1
    4. Re:I have one by Anonymous Coward · · Score: 0

      Clay tablets

    5. Re:I have one by Anonymous Coward · · Score: 0

      i personally keep a basement full of asian teenagers and make Them memorize the binary code

      but i was thinking about going with compression, having them memorize it in hex instead

  3. Give Up Now by Anonymous Coward · · Score: 0

    "..hard disks makes me nervous because of possible failures." everything can possibly fail. I do not know the failure rate of DVDs, but I suspect it is very low and comparable to other mediums. Anyone have a figure on this?

    1. Re:Give Up Now by Zone-MR · · Score: 5, Informative

      No figures, but I think the opposite. I've had several DVD-R disks which I've written backups to only to discover that they are unreadable a year later. My personal experience has been that HD's are unreliable, but less unreliable than writable DVDs.

      Of course higher quality media might be better, but then you can no longer quote the $0.10/GB figure.

    2. Re:Give Up Now by ePhil_One · · Score: 4, Insightful

      Now call me crazy, but have folks completely forgotten the age old solution, TAPE? A SDLT tape goes for about $50 and holds about 320GB, LTO holds even more, and I believe Quantum has just released the latest generation of SDLT. While its not "cheap" an autoloader can be had for about $15,000 that can backup many TB hands off. Might be a bit much initially, but it the best solution long term

      --
      You are in a maze of twisted little posts, all alike.
    3. Re:Give Up Now by tchuladdiass · · Score: 4, Informative

      Come on, this is Slashdot. A tape changer doesn't have to cost that much money if it's make of lego (shamelessly pulled from an earlier slashdot story which I can't find at the moment).

    4. Re:Give Up Now by kentsin · · Score: 1

      BUT Tapes outdate very soon

    5. Re:Give Up Now by Anonymous Coward · · Score: 0
      The problem is most of us have 8mm tapes sitting around from a previous time we did a backup of something important to tape, only to find that the tape-drive-vendor's long dead, and the tape device is long dead too.


      Better to keep the data "live" so as your systems get upgraded, your backups get updated/refreshed as well.

    6. Re:Give Up Now by Anonymous Coward · · Score: 0

      Not if you used Sony. AIT has been around quite a while, and Sony's releasing AIT-4 soon, plus cheaper versions of AIT, AIT-2, and AIT-3 drives. Any later generation can handle the earlier generations.

    7. Re:Give Up Now by putaro · · Score: 1

      Who do you think made all the 8mm tape drives? Those were all Sony internals.

    8. Re:Give Up Now by soward · · Score: 1

      That's 320G compressed, 160 raw, and I've not been able to find the tapes for $60 each. Not to mention that a single drive costs $2500+. Biggest problem is speed, ~6-8M/sec on a good day. $1000 gets you a laCie external 1TB firewire 800 drive which can move data at ~45M/sec. Guess it depends on how much one needs to keep, for how long, and what the backup window is.

      --
      John Soward...University of Kentucky
    9. Re:Give Up Now by ePhil_One · · Score: 1

      Most tapes drives remain backwards compatible, so those new SDLT 320 drives will read the old DLT 10/20 format that is about 8 years old at least. By sticking with an accepted corporate format you're pretty safe that something will be able to read the data in the future. If your looking to 50 year recoverability, do what the pro's do and lock a few new computers away, etc.

      --
      You are in a maze of twisted little posts, all alike.
    10. Re:Give Up Now by ePhil_One · · Score: 1
      If you couldn't find the tapes for $50 you haven't looked in a while, or didn't look very well. Insight and Dell both had them for under $50 each, I imagine I could have gone to a local vendor as well since in the past I've had good luck with getting my local vendor to match prices on items such as Bulk tape orders, not to mention any of the other large online vendors such CDW.

      Heck, Pricewatch even lists them.

      --
      You are in a maze of twisted little posts, all alike.
    11. Re:Give Up Now by Nutria · · Score: 1

      those new SDLT 320 drives will read the old DLT 10/20 format that is about 8 years old at least.

      There is no such thing as DLT 10/20. 10GB/20GB would have been the TZ87 format, which went on DLT3 tapes.

      SDLT will read, but not write, DLT4 tapes, which came out about 10 years ago. AFAICR, the oldest format for DLT4 tapes is 20GB native (40GB compressed).

      --
      "I don't know, therefore Aliens" Wafflebox1
    12. Re:Give Up Now by Nutria · · Score: 1

      Biggest problem is speed, ~6-8M/sec on a good day. You haven't optimized for speed, then. Are you overloading the SCSI bus?

      --
      "I don't know, therefore Aliens" Wafflebox1
    13. Re:Give Up Now by ReaperEB-Moo · · Score: 1

      what type of DVD writer do you have?? I know a few people that have pioneer DVD writers, and have ran into issues where they've done backups to the DVD-r and then not been able to read the backups back later. I guess there's a small piece of software that you have to get, that doesnt come bundled with the writer. Not sure what that is, if i find out I can let you know.

    14. Re:Give Up Now by WuphonsReach · · Score: 2, Interesting

      The problem is most of us have 8mm tapes sitting around from a previous time we did a backup of something important to tape, only to find that the tape-drive-vendor's long dead, and the tape device is long dead too.

      Sounds trite, but EBay to the rescue.

      I started with a single Exabyte 8mm backup drive and picked up 2 more on EBay for around $150. (This was a few years ago even and the original drive had been given to me with 50-60 used/new tapes.) Now that I have a DVD-burner, those drives don't do me much good (too small of capacity, and way too slow compared to DVD).

      Rule of thumb for corporate / business use is that you always buy 2 or 3 of any mission-critical backup hardware. That way, if one of the units breaks, you still have the other to rely on while the first is either fixed or replaced.

      Having identical backup hardware and software at another location is also a great idea.

      --
      Wolde you bothe eate your cake, and have your cake?
    15. Re:Give Up Now by WuphonsReach · · Score: 2, Informative

      First rule of archiving data on optical media:

      It will get scratched and damaged.

      Which means that unless you're adding recovery data (using QuickPar) or burning 2 copies, you will lose at least some data on the media within a few years. (Cheap media sometimes only lasts a few months if not stored in dark and climate controller conditions.)

      QuickPar is nice because you can pick how much redundancy you want on the disc. I find that 5-10% is plenty for most uses and guards against all but catestrophic damage to the disc.

      (The guideline for redundancy is based on how often you check the media vs how fast the media degrades or is damaged. If the media degrades at a rate of 1% per month and you only verify the disc annually, you'll want at least 12% redundancy but more like 18% redundancy.)

      --
      Wolde you bothe eate your cake, and have your cake?
  4. Hard disks by ConsumedByTV · · Score: 5, Informative

    You're always going to get a better rate with Hard drives but you're going to be prone to failure.

    If you buy them in bulk you can save.

    Burning DVDs is going to take you forever and drive you nuts.

    Find a hotswappable set of drives and use that for your offline backups. Use a raid for your current backups.

    --


    "Not my manner of thinking but the manner of thinking of others has been the source of my unhappiness." - M
    1. Re:Hard disks by Rik+van+Riel · · Score: 1

      My personal option for smaller data sets would be hard disks too. Specifically, hard disks in external fast-usb or firewire enclosures, so they really become hot pluggable.

      A friend tells me that 20GB tapes are cheap. With a simple tape changing robot that might be manageable, just put in a new tape container every day...

    2. Re:Hard disks by momogasuki · · Score: 0

      My company uses hard drives for backups. We make backups on redundant hard drives, some of which are located off site.

      Hard drives are prone to failure, but so are DVDs or any other storage medium. We chose hard drives because they are cheap, fast, and automated backups are easy.

    3. Re:Hard disks by littlerubberfeet · · Score: 4, Informative

      hard disks are good.

      If you want one of those nifty things with robotic arms and whatnot, plan on spending upwards of $3500. The AIT Automated Tape Library goes for that much and holds only 15 tapes. Plan on spending tens of thousands for something like Ampex's DIS 914 for 30 Terabytes.

      Your friend is right: tapes or cheap. The equipment needed to support them is expensive, slow and error prone. It gets cost effective once you have enough money for a new Porsche though...

      --
      Sig (appended to the end of comments you post, 120 chars)
    4. Re:Hard disks by Anonymous Coward · · Score: 0

      How can that be modded informative is a total mystery to me. Redundant at best would do the trick.

      You are basically re-writing the original story with your own words. How is that supposed to help that poor guy?

    5. Re:Hard disks by ConsumedByTV · · Score: 2, Interesting

      I personally use a firewire enclosure, it's fast, it's hotpluggable and it's easy to swap the internal disk.

      --


      "Not my manner of thinking but the manner of thinking of others has been the source of my unhappiness." - M
    6. Re:Hard disks by ConsumedByTV · · Score: 1

      Not really, I am agreeing that harddrives are the right way to go.

      --


      "Not my manner of thinking but the manner of thinking of others has been the source of my unhappiness." - M
    7. Re:Hard disks by DetrimentalFiend · · Score: 5, Interesting

      We're dealing with storage issues right now at work, and what we're doing is buying a server with 8x250 GB SATA drives. We then run the drives in raid 5, so we have 1.75TB of storage space (unformatted). Including computer costs, it's running us about $2.50 per GB, but it's a very beefy 3u server. For backup, we're currently backing up to tape. That costs us under $0.50 per GB with ultrium tapes. For some of our data, we've been backing up to DVD's, but we've pretty much given up on that. In the long run, it's not worth it.

    8. Re:Hard disks by silas_moeckel · · Score: 2, Informative

      If your going to just plug in backup and swap try the USB 2.0 to IDE backup boxes pretty much its a power brick and an US to IDE chipset in a plastic case with a 40 pin IDE connector on it. You plug in the drive and your good to go. No cases or hot swap caddies to deal with. And 5400 RPM drives dont get hot to the touch sitting on the desk. It's not pretty but if your just running backup keep on buying $100 IDE disks (generaly best cost per GB)

      --
      No sir I dont like it.
    9. Re:Hard disks by eric76 · · Score: 3, Informative

      Tapes can be pretty dependable, but you need a better quality tape system than that typically sold for PC backups. The 20 GB tapes are just not that dependable.

      If I had the money, at a minimum I'd get a tape drive that could handle the 200 GB (uncompressed) tapes. Something like IBM's LTO Gen-2 Tape Library. That should run a bit less than $6,000.

      For that matter, if I won the lottery, my first purchase would probably be a top of the line tape backup system instead of a the usual new car.

      Since I can't afford it, I use DVDs and CDs for backups. They are a pain in the neck and are not that dependable, but I keep backups up to a year on DVD+RW so if one fails, hopefully the others will have the data.

      Instead of writing directly to the DVD writer, I write the backups to disk and then copy the backup sets to the DVDs.

      I also keep a complete current backup of nearly everything important on a seperate computer.

    10. Re:Hard disks by Anonymous Coward · · Score: 0

      I'm in a graphic design house with loads of volume. We have 3TB of 'active' data at any given time. We have a decent hardware RAID 5 device (Ultratrak from Promise) that uses low-cost IDE drives. (250MB each)

      Old data is burned to DVD once a week or so. It's a big task but we have no alternative with our volume.

      The part that will interest you is our backup system. We salvaged a cheap old Celeron and whacked in several IDE expansion cards, with a load of IDE drives setup as software RAID.

      We then wrote a daemon to scan the main storage device and duplicate the files on the massive backup device. Files deleted on the main storage are kept on the backup drive for 14 days before being deleted there - this ensures accidentally deleted files are not lost.

      Because we only copy new / changed files this only takes up 4 hours each night. Making the initial mirror takes a couple of weeks though. I believe in RAID at all times, even if only software RAID like in our backup system where uptime is not critical to operations.

    11. Re:Hard disks by Naffer · · Score: 2, Funny

      I think you're absolutly right.
      Tapes or cheap.

    12. Re:Hard disks by ericdano · · Score: 1
      Thats very interesting. So, the Celeron has roughly the same amount of storage as the main RAID?


      And it's a software RAID? Wouldn't it be better to get a hardware controller to do that?

      --
      It's either on the beat or off the beat, it's that easy.
      I moderate therefore I rule!
      --
    13. Re:Hard disks by eric76 · · Score: 2, Informative

      Use RAID to increase your on-line availability.

      RAID does not a backup system make. You still need backups.

      For increased on-line availability, how about a good distribued file system with several servers? And, of course, back everything up anyway.

    14. Re:Hard disks by Anonymous Coward · · Score: 0

      If needs that much data and it's critical for his work, then there's just no way around it for storage and safety. He needs RAID arrays in that cluster of his.

    15. Re:Hard disks by Bilestoad · · Score: 1

      Get one from wiebetech.com and you don't even have to swap the disk - it's a unit that plugs right into the back of a bare IDE drive.

    16. Re:Hard disks by RMH101 · · Score: 1

      they're not that cheap, and they're no good for quick retrieval. in the UK, 40/80DLT tapes go for around 60UKP each, as opposed to a 120GB HDD for the same money. apples and oranges, of course, but it's a myth tapes are cheap.
      i'd say your cheapest option is a *lot* of hard disks - either in a large tower with a lot of IDE cards or as a SAN. big storage done properly is *expensive* though.

    17. Re:Hard disks by AsbestosRush · · Score: 1

      Easier, probably. Cheaper... probably not.

      --
      EveryDNS. Use it. It works.
      AC's need not reply
    18. Re:Hard disks by WuphonsReach · · Score: 1

      When buying exteranl USB/firewire enclosures for archiving / off-site backup, it's important to pay attention to whether the box uses an internal power supply or an external power brick.

      Brick - Enclosure is smaller and easier to carry, but you have to keep track of the power brick as well or it's useless. If you go this route for long-term archival, make sure you own at least 3 identical units with interchangeable power bricks and store at least one power brick offsite along with the back up drive.

      Built-in - Larger enclosure (although the CA-405U2 is small enough). Big advantage is that since it only requires a standard PC power cord, you now have one less thing to keep track of.

      I'll second the opinion that it's best to use 5400rpm drives for external enclosures. While the CA-405U2 enclosure has a fan, some USB enclosures don't and will cook most 7200rpm drives unless you work in a fully air-conditioned office. A 5400rpm drive is a lot more forgiving.

      --
      Wolde you bothe eate your cake, and have your cake?
    19. Re:Hard disks by WuphonsReach · · Score: 1

      DVD is fine for long-term archival, especially if you add parity data to the disc. (I usually add 5-10% recovery data to allow for discs getting scratched or otherwise damaged.) I find CD/DVD (and back when floppies were realistic) to be too much of a hassle, which means that backups don't always get done.

      For day-to-day backups I use 250-300GB 5400rpm drives in removable caddies (StarTech.com's DRW115 series). Base unit is $60 or you can buy the components separately: extra bays are $20-$30, extra caddies are $50-60. By rotating 3 or 4 of those drives, I have the convenience of tape without the cost of tape.

      Combine that with software like Second Copy 2000 (sits in the system tray, plays nicely with the system, does backups in the background, doesn't use a proprietary storage format) and you have something that simply works. No muss, no fuss, and no procrastination.

      --
      Wolde you bothe eate your cake, and have your cake?
    20. Re:Hard disks by ecloud · · Score: 1
      ...Ampex's DIS 914 for 30 Terabytes...enough money for a new Porsche...

      Yeah maybe that model number is no coincidence. (Well they don't make new 914's anymore I guess. Maybe the newest "budget" model will be called the Tapester.)

    21. Re:Hard disks by ericdano · · Score: 1

      But would it be safer? I think so. I really wouldn't trust software RAID at all........

      --
      It's either on the beat or off the beat, it's that easy.
      I moderate therefore I rule!
      --
    22. Re:Hard disks by Anonymous Coward · · Score: 0

      And it's a software RAID? Wouldn't it be better to get a hardware controller to do that?

      There are so many factors involved in this. For hardware RAID, if the controller dies you need to replace the controller. For software RAID, any pc will do. In terms of speed, the differences are how much cpu time software takes (which is moot if the computer does nothing more than the software disk) the bus speeds involved and perhaps interrupts, and the type of RAID being used (RAID5 is more compute intensive than the rest). These days software RAID recovery generally takes more work (e.g. type in commands to recover) whereas hardware raid is plug and forget.

      Whether one is better for your application depends on these factors and more.

      P.S. - for disk storage i prefer to use rsync and hardlinks. Write up a script to rsync every X time then hardlink to that data. When you rsync next, the link is broken for the changed files, so you have a copy of old stuff. add some rotation into the mix and you have your copies.

    23. Re:Hard disks by ConsumedByTV · · Score: 1

      The cost of those enclosures is out of fucking hand.

      The ones that plug into the back of the disk are even worse.

      $400 to not bother opening the case of a removable hard disk container is nuts.

      I mean, it's a slick design and it's nice to have all of those ports, but that's crazy for that price.

      For under $200 I just got a 250 gig drive (7200rpm) and it does firewire 400/usb1.1/2.0

      And that was for the *expensive* enclosure!

      --


      "Not my manner of thinking but the manner of thinking of others has been the source of my unhappiness." - M
  5. 1024 GMail Accounts by OrangeCarrot · · Score: 0, Redundant

    Hook yourself up with as many gmail accounts as you can. Email yourself 1GB chunks of data whenever you need to back up your stuff.

    1. Re:1024 GMail Accounts by Chess_the_cat · · Score: 2, Informative

      Too bad the maximum attachment size is 10MB.

      --
      Support the First Amendment. Read at -1
    2. Re:1024 GMail Accounts by HungWeiLo · · Score: 1

      RAR. 10MB chunks. Email attachment script. Bwahahaha.

      --
      There are a huge number of yeast infections in this county. Probably because we're downriver from the bread factory.
    3. Re:1024 GMail Accounts by Myen · · Score: 1

      Plus the encoding overhead (unless you whip up your own MTA to send unencoded 8bit data - in which case you pretty much just hope it gets to the other end intact).

      Can you even fit a 800MB CD image in a GMail account?

    4. Re:1024 GMail Accounts by Chrispy1000000+the+2 · · Score: 0

      File fragments, friend, file fragments.

      --
      Sig
    5. Re:1024 GMail Accounts by scubacuda · · Score: 1
      Hook me up with one of those e-mail attachment scripts.

  6. Waiting for ... by Entropy · · Score: 3, Interesting

    Blu ray based dvd burners.

    Those will be sweet =)

    --
    The sea changes color, but the sea does not change.
    1. Re:Waiting for ... by ericdano · · Score: 1

      Yeah. What, 20 gigs a disc? That might be a good solution. I have a 700Gig RAID that I kind of want to backup.......regular DVDs are not going to cut it (@ 200 discs.....not!)

      --
      It's either on the beat or off the beat, it's that easy.
      I moderate therefore I rule!
      --
    2. Re:Waiting for ... by broller · · Score: 1

      I thought those were 50gb, but I'm also too lazy to go look it up from the earlier slashdot article. :)

      "20 gigs a disc? That might be a good solution. I have a 700Gig RAID "

      You consider 35 discs instead of 200 a good solution? Even if I'm right and it's 50gb, that's still 14 discs... not a good solution IMO. A great solution would be 1-5 per backup. By the time we get those discs we'll be talking at least 7TB+, so... :/

    3. Re:Waiting for ... by ericdano · · Score: 1
      Depends on how much the data changes. If the data doesn't change much, then 35 discs is not a bad way to backup......

      If data changes more, then........it won't be a good solution.

      --
      It's either on the beat or off the beat, it's that easy.
      I moderate therefore I rule!
      --
  7. good luck by Madcapjack · · Score: 4, Funny

    PRINTSCREEN should do the trick.

    1. Re:good luck by mlk · · Score: 2, Funny

      please dont joke about that, not too long ago I received an email, please help me open this word document. Complete with a BMP, natrally I assumed this was the error message. Nope, it was a "print screen" of its Icon. And yes, while I had said picture up, the user did try double clicking.

      Poor luser.

      --
      Wow, I should not post when knackered.
    2. Re:good luck by Anonymous Coward · · Score: 0

      That isn't even close to being understandable.

    3. Re:good luck by schtum · · Score: 3, Funny

      Translation:

      Please don't joke about that. Not too long ago, I received an email asking for help opening a Word document. Attached was a bitmap image which I naturally assumed was an error message. Instead, it was a screen-capture of the document's icon! The user was double-clicking on the image!

      So I shot her.

  8. !RAID by therandthem · · Score: 1, Informative

    On the subject of RAID please remember, if it's spinning, it ain't a backup!

    1. Re:!RAID by Anonymous Coward · · Score: 0

      Why not ?

      I have a number of drives that are older than many CDs I have burned and that then failed.

    2. Re:!RAID by ecalkin · · Score: 4, Insightful

      because it protects against device failure, not *user* error. if you delete a file from a raid array, it's gone. that's part of what offline is all about.

      eric

    3. Re:!RAID by slashjames · · Score: 1

      I'll second that. Be aware that RAID does have a downside. All the disks are usually) hooked up to the same power supply. If the power supply goes nuts, it can fry more than 1 disk (which is all that RAID 5 gives you for redundancy). A better option with a large number of drives (> 4) is RAID 10 (not 0+1 or 1+0). Then more drives can fail simultaneously and you can probably get back some/all of the data.

    4. Re:!RAID by devilspgd · · Score: 1

      Drives don't need to be in a live RAID configuration, it could be an occasional mirror.

      --
      Give a man a fish, he'll eat for a day, but teach a man to phish...
    5. Re:!RAID by Anonymous Coward · · Score: 0

      Yes, but it deletes all your files so reliably and quickly!

    6. Re:!RAID by MasTRE · · Score: 2, Insightful

      > because it protects against device failure, not *user* error. if you delete a file from a raid array, it's gone. that's part of what offline is all about.

      You can add to that getting hacked. They can't hack your off-line data.

      --
      Must-not-watch TV!
    7. Re:!RAID by Glonoinha · · Score: 1

      Bingo.
      On my home network I run two 80G IDE drives in Glo-RAID configuration.
      Two different computers, one drive in each. One is shared to everybody, the other isn't shared except during synch time.
      Every once in a while when everything is cool I simply copy all the files from the working drive to the other drive.

      Protects vs. virus, end user deleting / overwriting files, 100% hardware redundancy (regular RAID generally doesn't protect against a smoked CPU, power supply, motherboard, bad RAM, bad IDE cable, etc.), and what have you.

      The only problem with applying it to the OP's problem is being able to do monthly archives of a full T of data. I guess you could simply buy big drives and how swap trays - hard drives are pretty long lasting in cold storage unless you drop them.

      --
      Glonoinha the MebiByte Slayer
    8. Re:!RAID by femtoguy · · Score: 1

      I faced this set of questions in our department and had to find not only a solution, but a way to discuss the problem with technical neophytes. In the end I identified backups into disaster recovery backups (a drive or a server blows up) and archival backups (you wish you had an older copy of a file you have edited). For initial disaster recovery, we have a RAID 5+0 set-up with redundent servers, so a lot of hardware has to fail before we are in trouble. For archiving, we do a nightly copy onto a firewire drive (250 GB), with 4 drives in rotation, and one stored off-site. If somebody needs a deleted file from the last week we have it. For longer term, we are looking into a system which writes all files modified in the last 24 hours to a CD or DVD, and then writes the contents of the disk in an index folder. It's no good for disaster recovery, but if somebody needs a copy of a file from last thursday, the index tell which DVD to look at and then you pull the disk and restore the file. Permanent cheap storage, and fast to access. Of course we don't have the code finished yet, but . . .

    9. Re:!RAID by Nutria · · Score: 1

      All the disks are usually) hooked up to the same power supply.

      Nowhere I've ever worked.

      Dual-redundant power supplies and SCSI controllers are the norm in an enterprise that cares about it's data.

      --
      "I don't know, therefore Aliens" Wafflebox1
  9. Wirewire drives? by NanoGator · · Score: 5, Interesting

    For long term storage, how do you feel about firewire drives? Maybe not as cheap as you'd like, but you can get them in >160 gig flavors, plus you can hook them up to just about anything. Once you do the backup, which'd be a simple copy and paste, you can just unplug the drive and store it in a safe or something.

    Again, I'm not sure if that's as cheap as you'd want, but that's a solution I came up with for a similar problem. My company's going to be 3D rendering some stuff that could end up eating 50 megabytes a frame. (Extra data is stored for future refinement... I can go into detail if I've piqued anybody's curiosity.) We can't afford to lose this data, so the Firewire drive approach is what we're considering right now.

    --
    "Derp de derp."
    1. Re:Wirewire drives? by Rik+van+Riel · · Score: 3, Informative
      For long term storage, how do you feel about firewire drives? Maybe not as cheap as you'd like,

      Oh, but they are cheap. Just buy a large IDE disk and a $30 firewire/fast-usb enclosure.

      I'm just not sure about the "long term", though. I have no idea what the shelf life of a hard disk is.

    2. Re:Wirewire drives? by littlerubberfeet · · Score: 5, Informative

      Lemme address the firewire thing: I work in a sound studio, and we generate about 5-8 gigs of data a month, mostly music for TV. This isn't a huge amount, but we rely on multiple sets of Firewire drives for backup and then internal hard drives for current projects. This means we have all 400 or so projects at our fingertips. Given how fast we do things, this is important.

      Lacie makes their 1 terabyte firewire (943 gigabyte formatted) drive. I we get them for $1,080 a drive (Macmall matched Provantage's price). This is more then the article author spends now per gig, but these drives have done quite well in the studio. You can find cheaper firewire though.

      We are at the point where hard drives give the best bang for the buck. The only fault of firewire is that my bosses have burned several bridges. ground yourself before unplugging the drives. The bridges were cheap though. In any case, hard drives are probably the most failsafe and cost effective solution, with firewire being the easiest interface to use those drives with.

      --
      Sig (appended to the end of comments you post, 120 chars)
    3. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      So you have data that you can't afford to lose and you're going to store it on a single disk bought from Target? Sounds like a pretty questionable solution.

    4. Re:Wirewire drives? by NanoGator · · Score: 1

      "So you have data that you can't afford to lose and you're going to store it on a single disk bought from Target?"

      As opposed to a bunch of recordable DVDs also bought from Target?

      So where's your solution, smart ass?

      --
      "Derp de derp."
    5. Re:Wirewire drives? by line.at.infinity · · Score: 1

      I wouldn't get those sold as firewire drives -- firewire enclosure plus internal hdd would be better cost-wise. Also USB 2.0 is faster than Firewire 400. From Wikipedia:

      FireWire 400 -- 400 Mbit/s
      USB 2.0 -- 480 Mbit/s
      FireWire 800 -- 800 Mbit/s

      One enclosure and several internal drives can be a cheaper solution, however plugging in the HDD into the enclosure can be cumbersome and time consuming.

    6. Re:Wirewire drives? by Anonymous Coward · · Score: 1, Interesting

      But the reality is due to the way the firewire protocol works vs. USB 2.0 firewire tends to still outperform USB 2.0 by a bit, though not as extreme as it used to be (at one time it was 50% or more). Now its not a whole lot faster. But the thing is that firewire is more suited to these kinds of transfers - large sustained transfers. USB 2.0 works well for many many smaller transfers at the same time.

    7. Re:Wirewire drives? by Anonymous Coward · · Score: 2, Insightful

      USB2 is *not* faster then Firewire 400
      USB2 is *bustable* up to 480mb/s transfer
      Firewire can *sustain* 400mb/s transfer
      In almost all cases, you'll find Firewire much faster.

    8. Re:Wirewire drives? by SlamMan · · Score: 4, Informative

      USB makes the computer actually do work, while firewire ports handle it themselves. For a normal user, not much of an issue, but over a couple drives, you'd notice.

      --
      Mod point free since 2001
    9. Re:Wirewire drives? by espiritu · · Score: 1

      I was checking out a trade rag, and saw an ad for http://www.strongholdstorage.ca...currently it redirects to the parent company page href http://www.corenetworksolutions.ca, so i called them, and was told it is a new service that should be up and running by the end of the week Hope this helps... Cheers!

    10. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      Only $30? You're damn lucky.

      Here in New Zealand, the only place stocking firewire enclosures has them for $180NZ (that's $90 American or thereabouts).

      And I don't have a credit card to order offshore myself.

      Ah, to be in a civilised country where the tech prices are competitive...

    11. Re:Wirewire drives? by eric76 · · Score: 1

      I've considered getting some of those hard drive trays that let you treat a regular internal hard drive as a removable drive.

      One tray for each hard drive used for copies of the backup files and keep the two latest hard drives in a bank deposit box 35 miles away.

    12. Re:Wirewire drives? by Glonoinha · · Score: 1

      Very few IDE drives will sustain 60MB/s across the entire drive. In fact I'm not aware of any IDE drives that will write the final 25% of capacity anywhere near 50MB/s, much less dream of 60MB/s.

      If my desktops had firewire ports, however, I would give external firewire drives serious consideration though.

      --
      Glonoinha the MebiByte Slayer
    13. Re:Wirewire drives? by Trejkaz · · Score: 1

      Probably a bit longer than the life of one of my DVDs which corrupted itself in under two weeks, having been used twice (once was the burn) and not scratched. :-/

      --
      Karma: It's all a bunch of tree-huggin' hippy crap!
    14. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      I've currently got 14 maxtor 300 gig firewire/usb2 drives that i've been trying to use for backups. I've had horrible luck with them. I've been unable to get them to work in a daisy chained configuration (even though this is well within spec for firewire). I've been told by a maxtor tech to use only three per firewire bus, but I have even had problems with a single disk per bus. I am using them with xfs, and they stay mounted all the time. what happens is a disk becomes unavailable occasionally causing data loss when the filesystem can't write out it's metadata (i think, anyway). i think the problem is that the connections between the drives are not very good (we're talking about >14 here, so just one will screw you up for all the disks past that on the chain). i'm looking at buying a bunch of usb hubs and going that route, but i'm not convinced it'll be any better. having lost a bunch of backups, i wouldn't suggest firewire drives en masse to anyone serious about not losing their stuff. mebbe the usb will work better, but i think the best possible thing i could do now is void the warranty on the drives by putting them into a proper ide raid. i think you're better off looking at something like: www.powerfile.com, or build a decent hardware raid yourself www.supermicro.com www.chenbro.com promise makes an ide raid card as does 3ware. bite the bullet and spend the money. this stuff you can't do halfway.

    15. Re:Wirewire drives? by gerardrj · · Score: 0

      there are no external firewire drives. There are (to my knowledge) no Firewire drives at all.
      People take standard ATA/IDE drives and use an ATA/Firewire bridge to connect them up externally and bypass the extremely limited cable length of ATA.

      --
      Article X: The powers not delegated... by the Constitution...are reserved...to the people
    16. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      crazy! That LaCie crap is proprietary pseudo-RAID in expensive enclosures! Why not use a genuine RAID external enclosure from Granite Digital or similar and save money at the same time?

    17. Re:Wirewire drives? by insert+3+letters · · Score: 2, Informative

      Exactly I've run the exact same drive (WD 120 SE) in a usb2 enclosure and a firewire. On benchmarks, the firewire was generally about 30% faster. More reliable connection too.

    18. Re:Wirewire drives? by NanoGator · · Score: 2, Funny

      "here are no external firewire drives. There are (to my knowledge) no Firewire drives at all.
      People take standard ATA/IDE drives and use an ATA/Firewire bridge to connect them up externally and bypass the extremely limited cable length of ATA."


      Well that totally blows my point out of the water!

      --
      "Derp de derp."
    19. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      "I've currently got 14 maxtor 300 gig firewire/usb2 drives..."

      WTF - did you hijack a goddamned Maxtor semi-truck or what???

    20. Re:Wirewire drives? by ecloud · · Score: 1

      But you have to plug them in. Can you power them off via software? You can do that with hdparm for IDE drives. So I figure just have a dedicated backup appliance... a PC with as many drives as you can find controllers to support. (You could add a PCI card if the built-in controllers are not sufficient.) It can be very low-end, just so it has ethernet. Have it boot Linux from a CF card. Leave it powered on all the time, but power off the drives when they are not in use; one would get powered up each day to do one backup generation, on a different drive each day. Or, if the machine supports wake-on-LAN the whole thing could be powered off except for a once-a-day backup stint (but, it might as well run a distributed computing project of some sort in its spare time).

      Only trouble is a severe power surge or lightning strike could take out the whole thing, because stuff is still electrically connected. (But you could run it on DC power! I'm building a DC power system for just this reason... solar panels and a big bank of batteries. 12V is fine for EPIAs, but I found some 24V ATX supplies on Ebay so will use those for a couple systems.) But it's automatable, whereas plugging in the FireWire drives manually when you need to do backups is likely to be forgotten sometimes.

    21. Re:Wirewire drives? by Leareth · · Score: 1

      However, remember not all Firewire enclosures are created equal.

      We have a variety of Firewire & Firewire + USB enclosures, and the range in speed between 2k and 11k a second in transfer rates.

      The 2k ones were cheap $30 enclosures, which of the 30 we bought 12 of them have been returned or replaced. Even then, the enclosures work intermittently, dropping connections and corrupting data. It's got to the point were we don't even use them; they are sitting in a box in storage because they are so failure prone. (And my boss won't throw anything away.)

      The 11k ones (Which are LaCie 120GB jobies) have worked flawlessly, and are used for baking up our main data server.

      --
      *A)bort, R)etry, I)nfluence with large hammer.*
    22. Re:Wirewire drives? by Carnivore · · Score: 1

      I have real-world data to back that claim up. I have a Maxtor one-touch USB2/Firewire drive. Transferring a 20 gig file made from /dev/urandom went at ~13 MB/s on USB2, and ~18 MB/s on firewire. The target drive in both cases was IDE.

      I was a bit surprised, because USB2 claims a higher bitrate.

    23. Re:Wirewire drives? by Anonymous Coward · · Score: 0

      StarTech DRW115 series.

      All metal, all the time.

      Roughly $60 per bay+caddy, extra bays are $20, extra caddies are $50.

      I've used plastic versions (different manuf) in the past and was rather dissapointed with it. So I ordered the DRW115 series from TheNerds.net last week and am planning on getting rid of using external USB drives for backups.

    24. Re:Wirewire drives? by Mr.+Jackson · · Score: 1

      "Lacie makes their 1 terabyte firewire (943 gigabyte formatted) drive."

      We have used a lot of the LaCie FireWire drives (500 and 250 GB flavors) at our shop, and have observed a disturbingly high failure rate. Come in one day, plug it in, and 500 GB of data are gone. LaCie does not acknowledge the problem. I would avoid LaCie and go with simple IDEs in external enclosures.

    25. Re:Wirewire drives? by cdrudge · · Score: 1
      The 11k ones (Which are LaCie 120GB jobies) have worked flawlessly, and are used for baking up our main data server.
      Maybe you should examine your cooling solution if your external drives are serving as ovens.
    26. Re:Wirewire drives? by cdrudge · · Score: 1

      This goes along with what I've heard between the two different formats.

      Does anyone know of any real-world data benchmarks using real-world drives, preferably with a variety of enclosures with different chipsets?

      Beleiving manufacture data for hard drives and chipset speeds always are taken with a large grain of salt.

    27. Re:Wirewire drives? by littlerubberfeet · · Score: 1

      We had a series of firewire bridge failures with Maxtor and Western Digital drives (about 10 failures). The actual drives were still perfectly good. Is it just the bridges on the LaCie drives? or is it the actual internal IDE/ATA drive?

      Also, does LaCie make its own drives, or are they Seagates or something with different stickers? I worry, because my boss just bought one of those terabyte drives...

      --
      Sig (appended to the end of comments you post, 120 chars)
  10. Personally I prefer something in a blonde by kfg · · Score: 4, Insightful

    I was wondering, if Slashdot readers have any recommendations for a cheap automated way to store and retrieve data."

    Although the good ones don't come cheap. I guess this another case of "pick any two."

    KFG

    1. Re:Personally I prefer something in a blonde by bheerssen · · Score: 2, Informative

      mods, this is not off-topic.

      KFG meant to say "You can have fast, good, or cheap. Pick two."

      It's an old software design maxim that applies suprisingly well to this subject.

      --
      (Score: -1, Stupid)
    2. Re:Personally I prefer something in a blonde by mcrbids · · Score: 2, Interesting

      KFG meant to say "You can have fast, good, or cheap. Pick two."

      It's an old software design maxim that applies suprisingly well to this subject.


      ...and to many things, particularly if you replace "fast" with "convenient". Just for kicks, think about it.

      Food? Check. Clothing? Check. Beer? Check. Housing construction? Check.

      Pretty much anything that involves the exchange of money for goods and services follows this maxim.

      --
      I have no problem with your religion until you decide it's reason to deprive others of the truth.
    3. Re:Personally I prefer something in a blonde by Kiryat+Malachi · · Score: 1

      No it isn't.

      It's a very, very old engineering saying. It was around before programmable computers were. I have never, ever met an engineer who doesn't know it.

      It applies to just about everything.

      Especially girls.

      --

      ---
      Mod me down, you fucking twits. Go ahead. I dare you.
      (I read with sigs off.)
    4. Re:Personally I prefer something in a blonde by bheerssen · · Score: 1

      Touché

      --
      (Score: -1, Stupid)
  11. 1TB a month?!? by stinkydog · · Score: 5, Funny

    Short of launching his own space probe, the only way for this guy to consume a TB a month of storage is a serious porn habit. Just post your 'content' on Edonkey and it will be available when you 'need' it. You likely only watch them once anyway.

    SD

    --
    âoeWho knew something as harmless as willful ignorance could end up having real consequences?â
    1. Re:1TB a month?!? by sandwichmaker · · Score: 1, Interesting

      You have obviously never heard of fMRI studies, have you?

    2. Re:1TB a month?!? by grasshoppa · · Score: 5, Funny

      Using this method, I have achieved my life long dream of tapeless ( well, everything-less ) backups.

      I simply make a tar.bz2 file with all my important files, filter it through gpg, then post it on edonkey, usually titled, "Olsen twins getting it on", and then usually the date.

      Viola, instant backup that is available to me whereever I may go.

      --
      Mod me down with all of your hatred and your journey towards the dark side will be complete!
    3. Re:1TB a month?!? by Anonymous Coward · · Score: 0

      Sorry, but a 1tb porn habit is almost impossible.

      Using my comcast 3.5mbps connection to its fullest 24x7 and subscribing to all well-reviewed porn sites then downloading all the content I can, I've only managed to acquire 1.5 terrabytes of porn in six months at a rate of about 250gb/mo tops.

      I store my porn on 200gb IDE hard drives which I stick in mylar bags and then into a mylar lined rack of plastic shelving.

    4. Re:1TB a month?!? by gl4ss · · Score: 5, Funny

      ** You have obviously never heard of fMRI studies, have you?**

      oh shit! I totally missed the part of the history where FMRI scanners came commonplace for men.

      oh wait the whole ask slashdot blurb is twisted, the headline implies asking for datastorage possibilities for the common man - yet one of the first things mentioned that he needs it for his special job that generates tb's of data per month. by that definition he is not a common man, except that he hopes to have a miracle solution - that is quite common.

      still, a common man would choose whatever possibility gave the cheapest price per gb(probably harddrive). with dvd-r's he would end up burning multiple dvd-r's per day and it's kind of implied that the data would need to be retrievable so he would have to burn the same disc multiple times, even then it wouldn't be a sure thing.

      his needs are quite bigh though still, big enough to warrant for professional help since his likely going to be spending quite a bit of money on the thing.

      --
      world was created 5 seconds before this post as it is.
    5. Re:1TB a month?!? by Zone-MR · · Score: 5, Funny

      Oh, so that explains why that "Olsen Twins Getting it on - 12 Mar 2003.avi" file I downloaded last week contained a zipped tar archive full of boring spreadsheets and a lot of donkey porn.

    6. Re:1TB a month?!? by ruckc · · Score: 1

      Why not check out the his website: Vigyaan I can see that type of work requiring large amounts of data.

    7. Re:1TB a month?!? by Anonymous Coward · · Score: 0
      Oh, so that explains why that "Olsen Twins Getting it on - 12 Mar 2003.avi" file I downloaded last week contained a zipped tar archive full of boring spreadsheets and a lot of donkey porn.

      I seemed to remember that they were pretty young, and sure enough, google told me this:
      Mary-Kate Olsen and Ashley Fuller Olsen Date of Birth: June 13, 1986
      Now, you might have been lucky, they turned legal for porn about 20 days ago. But you cite the name of the file as "Olsen Twins Getting it on - 12 Mar 2003.avi", and on that date they would have been 16. Busted!
    8. Re:1TB a month?!? by antic · · Score: 1


      By "common man", I think he meant "cheap". ;)

      --
      'Thats they exact same thing a banana wrench monkey.'
    9. Re:1TB a month?!? by EvilTwinSkippy · · Score: 3, Funny
      Well if you are the one behind all the "Islamic Militant Bukakke Kitten" porn, you are one sick bastard.

      And you really need to fire your accountant. Your Caymon Island bank account was overdrawn twice in a month.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    10. Re:1TB a month?!? by magefile · · Score: 1

      I think he misspelled "on the cheap".

    11. Re:1TB a month?!? by Igmuth · · Score: 1

      Though any organization that can afford a fMRI unit, has quite a different definition of cheap them most people/companies.

    12. Re:1TB a month?!? by Zone-MR · · Score: 1

      lmfao. Perhaps you are not familiar with this concept.

    13. Re:1TB a month?!? by Anonymous Coward · · Score: 0

      Nah I generate about 1GB a day of just raw images with my digital camera. Start editing them and it is no big deal to have 3-4G a day. Now granted I only shoot 8 days a month, but it is possible to generate that level of data.

      How much would a Tivo generate a day if one records 3-4 shows a day?

  12. Cheap solution by codeguru73 · · Score: 4, Insightful

    Buy some inexpensive IDE drives with high storage capacity and use a software raid solution. What kind of budget do you have anyway?

    1. Re:Cheap solution by Wudbaer · · Score: 3, Insightful

      Repeat after me:

      RAID is not backup !
      RAID is not backup !
      RAID is not backup !
      [..]

    2. Re:Cheap solution by EvilTwinSkippy · · Score: 2, Interesting
      RAID is not backup!

      Indeed, I don't believe in any backup that doesn't have multiple copies that can be stored offsite. Fire really doesn't care what was on your hard drive, nor do thieves, or axe-wielding maniacs.

      And anyone who has been in IT long enough can tell you one of the above stories first hand.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    3. Re:Cheap solution by noname3 · · Score: 1

      Fire? Yeah, I didn't have long to wait for that one. I was sixteen when my family lost our backups. When dad cut costs by closing his office and moving his computers home, he put them all into our storage room. Right next to the backups.

      When the fire hit we lost two AS/400s, a couple dozen PCs, and about 30 backup tapes. No one was hurt, but did we ever have problems with the insurance company.

      Remember when Homer tried to claim he had a Picasso and a collection of antique cars in his basement? We got the same reaction from State Farm.

    4. Re:Cheap solution by Anonymous Coward · · Score: 0

      raid might not
      be backup but
      you could use
      raid for backup.

    5. Re:Cheap solution by IronicCheese · · Score: 1

      Fire, theives and axe-wielding maniacs? And you want to move your TAPES off site? Heck, I'd think about moving MYSELF outta there. ;)

    6. Re:Cheap solution by Anonymous Coward · · Score: 0

      "really doesn't care what was on your hard drive, nor do thieves, or axe-wielding maniacs" Lizzie Borden took an axe, And plunged it deep into the VAX; Don't you envy people who Do all the things YOU wanted to do? Had to be quoted. Gotta love that old fortune program.

  13. Rev Drives by jtwronski · · Score: 1, Informative

    Iomega has a somewhat new backup solution out called a rev drive. Its quite a bit like a hard drive, but removable. Mine holds 90GB compressed, and the transfer speed isn't all that bad. I haven't had the opportunity to test it on Linux, So I can't say it will work with your setup. the drive is under 300USD, and about 50USD per cartridge.

  14. Bulk storage? by mikers · · Score: 3, Funny

    I got a couple of drawers of old floppy disks. $10 takes 'em. Plenty of bulk.

    The Sony "lifetime" warranty may still be good on them too!

    1. Re:Bulk storage? by Anonymous Coward · · Score: 0

      I saw an ad in the paper by a guy selling 300,000 (!) 720KB floppies. That makes up for the first 230GB. Just look for similar ads and you're off with "a solution that will work on old computers, too." :)

      Make a little soldering and you have TWICE that amount of space!

  15. age old problem... by Lumpy · · Score: 5, Insightful

    Ahh the large amount of data that has X value versus a storage solution...

    If your data is worth $20,000.00 then a $2000.00 solution is dirt cheap.

    what is your data worth? that is where you need to start and then look at the 10-30% of the data's value to start looking at how must to spend on it's storage.

    If 1 month's data was lost forever, how much money would it cost the company? that is your actual $ amount that you should be shopping at.

    and that is how I got the company to buy a $20,000.00 1000 tape DLT jukebox.

    my data is worth over $100,000 a month and is much lower than yours is size.

    That is where you need to start. Justify your storage costs by figureing out what it is worth to begin with.

    --
    Do not look at laser with remaining good eye.
    1. Re:age old problem... by D-Cypell · · Score: 4, Funny

      my data is worth over $100,000 a month

      This 'data' doesnt happen to be a large collection of email addresses does it?

    2. Re:age old problem... by Servo · · Score: 3, Informative

      First of all, don't bother with DLT. It is slow, and increasingly more unreliable as DLT is phased out of production and replacement parts are actually refurbs.

      --
      A slip of the foot you may soon recover, but a slip of the tongue you may never get over. -Benjamin Franklin
    3. Re:age old problem... by Anonymous Coward · · Score: 0

      you obviousally dont know anything about DLT storage.

      it is the DEFACTO stroage for corperations everywhere. it is the ONLY solution for high capacity storage (100Gb and higher) is lightning fast with modern drives (SDLT) and is insanely reliable compared to all other long trm storage systems that are current.

      Please tell me ONE system that has the capacity and huge reliability that DLT does. I bet $1000.00 you cant.

      Lumpy is right, DLT is the direction to look. IT's what corperations and Colleges trust.

    4. Re:age old problem... by Jeff+DeMaagd · · Score: 1

      What does bug me is when people complain about the pricing of some of these things. TB RAIDS are not toys. Even then, I didn't think it was that expensive. "The common man" currently does not need TB storage, so therefore, economies of scale haven't taken hold yet.

      If it is too expensive to properly store this stuff, then maybe you need to review your pricing to make sure you aren't currently shortchanging yourself in terms of price of services rendered and the cost to do so. Or find some more efficient way to store that data.

    5. Re:age old problem... by Anonymous Coward · · Score: 0

      Ooh, the AC putting up a $1,000 bet. Balls of steel, my man, balls of steel.

    6. Re:age old problem... by TinyManCan · · Score: 2, Informative

      LTO 2 drives are the current trend in large enterprise storage. LTO is the new hotness, DLT is old and busted.

    7. Re:age old problem... by Anonymous Coward · · Score: 0

      Please tell me ONE system that has the capacity and huge reliability that DLT does. I bet $1000.00 you cant.

      EMC^2. Pick a storage array.

    8. Re:age old problem... by Anonymous Coward · · Score: 0

      VXA have a nice solution, although i don't know how it compares with other tape systems. Their ads look impressive.

    9. Re:age old problem... by Anonymous Coward · · Score: 1, Informative

      LTO1: 100GB/200GB (native/compressed)
      LTO2: 200GB/400GB
      SAIT1: 500GB/1300GB

    10. Re:age old problem... by Anonymous Coward · · Score: 0

      It's what incompetent IT seat warmers in "corperations" and Colleges trust. It's what people who aren't aware of the alternatives pay for with someone else's money. It's what you get when you want to justify overtime spent reading magazines while you switch tapes occasionally.

      Learn to spell, moron - and save your advice for when (or if) you graduate high school and do this professionally.

    11. Re:age old problem... by azer_s · · Score: 1

      Check out lto tape drives

      they are fast
      hold about 400Gb each tape

      and you can get an auto loader for the tape drive that allows for multiple tapes

      we use this at work (engineering company)

    12. Re:age old problem... by afidel · · Score: 1

      SDLT600 is quite nice with good transfer rates and acceptable $/GB. It has the advantage of being able to read your old DLT-IV tapes if you have them. There are of course some organizations that used LTO-1 and for them an LTO-2 solution would likewise make sense. Basically it comes down to legacy support and vendor support, $/GB is pretty comparable for most tape solutions of the same generation since they are based on the same material science breakthroughs.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    13. Re:age old problem... by EvilTwinSkippy · · Score: 1
      It's what incompetent IT seat warmers in "corperations" and Colleges trust. It's what people who aren't aware of the alternatives pay for with someone else's money. It's what you get when you want to justify overtime spent reading magazines while you switch tapes occasionally.

      Learn to spell, moron - and save your advice for when (or if) you graduate high school and do this professionally.

      Well it's what IT seat warmers who actually have had to do backups for 3 years in a row trust. Let's face it, when I bought my drives DLT was the best. Everything else here was either in development, or just plain didn't exist.

      And you never trust backups to something new.

      That said, with this year's equipment replacement we are going with AIT. But having been through several data "oopsies", DLT has proven to be a reliable backup medium.

      By the way, spelling is not measure of intelligence, nor education. Herman Melville was a terrible speller, yet that didn't keep him from a career in writing (books like Moby Dick). Spelling is generally the job of anonymous anal-retentive editors who are fogotten 10 minutes after something is published.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    14. Re:age old problem... by EvilTwinSkippy · · Score: 1
      Heck, our Terabyte array holds data that far exceeds the $10k we paid for it. Thus a $12k robotic tape backup system is cheap at twice the price.

      Considering we are talking about storing the collective work of 200 people who range in salary from $30,000 to $500,000, $22K to prevent the loss of several years worth of work is an ROI in about a week.

      I'm just glad the $22K buys 3 terabytes of RAID and backup. When I did this three years ago $22K bought 100GB of tape and backup.

      --
      "Learning is not compulsory... neither is survival."
      --Dr.W.Edwards Deming
    15. Re:age old problem... by egarland · · Score: 1

      what is your data worth? that is where you need to start and then look at the 10-30% of the data's value to start looking at how must to spend on it's storage.

      This is bogus. Storage companies use this to convince people to spend too much money. To illustrate the point: if your data is worth about $20K (however you measure that) and there is a storage solution that costs $15k that is good and one that costs $2K that is better in every way for what you need to do, you should go with the $2k. Companies often use the "You get what you pay for" argument to convince people to buy expensive storage solutions. Sometimes, what you pay for ends up being an inferior poorly designed system created with expensive proprietary components.

      I'm not suggesting people buy el-cheapo rickety storage solutions, just be smart about your purchase and realize there are companies out their who will charge you big bucks for a mediocre solution when it's not necessarily appropriate. Look at what your needs are and chose from solutions that will meet them.

      I'm a big fan of IDE RAID. A standard tactic storage companies used when they had no products that could compete against it is "You get what you pay for." Of course, that's not true and the smart customers knew it which is why those same storage companies have since introduced products based on IDE RAID.

      --
      set softtabstop=4 shiftwidth=4 expandtab nocp worlddomination
    16. Re:age old problem... by Anonymous Coward · · Score: 0

      More than likely, the parent was speaking about SDLT.... at least I would hope!

    17. Re:age old problem... by Carnivore · · Score: 1

      Yup they're fast. I have an LTO-1, which holds 100GB native.

      One caveat, though. I found this one out recently. If a tape ever breaks in the drive, HP (I'm not sure about other manufacturers) says that they can't garauntee that the drive will continue to work. I think that they were bullshitting me, but they said that if the drive had been under warranty, they would have replaced the drive outright, no chance of fixing it. They didn't offer an option for me to pay for a repair. As far as I can tell, there's no way for the drive to have been damaged by the broken tape.

      We ended up buying a whole new drive @ $2k, which sucks.

      I'm drooling over the new AIT drives, but I need generation 3 right now. Damn astronomers with their terabytes of data.

  16. What if... by Anonymous Coward · · Score: 0

    You build a tower of dual-layer DVD burners. Maybe 8. That'd give you 8x9, 72 gigs per loading session. Although, aren't those 50 gig discs supposed to be out soon? 20 of those would be a TB.

  17. Compression and Redundancy by CyanDisaster · · Score: 1

    I'm not sure if you've already thought of this, but would it be possible to remove any redundant data, then compress what is left, or would the data need to be left intact and as-is, so that it can be accessed without any fuss should the need arise?

    Hope be with ye,
    Cyan

    1. Re:Compression and Redundancy by Anonymous Coward · · Score: 0

      Yes, good point. I would suggest Michael Schindler, www.compressconsult.com--he wrote SZIP. The savings in reduced storage requirements will surely outweigh any consulting fees. From his site:

      What we can do for you

      Since you came here you have a desire to make your data smaller. Reasons could be cut of storage or data transfer costs, limitation of bandwidth or storage capacity. We can help you by choosing a suitable compression for your needs, or, if there is none, develop one. We will also assist you in evaluating other's compression proposals and inform you of potential problems.

      We can also point out to you places where we think a lossy compression is possible. We have had a case where a technically speaking lossy compression is still regarded lossless by the client - the compression could not distinguish 1,2,3 from 3,1,2 but the client did not care for the sequence anyway. The difference in compressed size was 2:1 there.

      Why consult an expert?

      Compare design of a data compression with building a car: If you just need a low-end solution you can do it yourself. If you need an average solution you go ahead and buy a car; you would use a standard library in compression. Even there an expert could help, or you may end up with a two-seater sportscar to transport a family. If you have unusual needs (a "car" to transport spacecrafts, a racing car,...) you go for an expert. You should do the same in compression.

    2. Re:Compression and Redundancy by Anonymous Coward · · Score: 0

      Nice ad.

    3. Re:Compression and Redundancy by Anonymous Coward · · Score: 0

      Nah, I don't actually know him personally even, although I did write to him once about porting SZIP to another platform. He was quite helpful.

  18. Well... by stonecypher · · Score: 3, Interesting

    Depending on your budget, the appropriate thing to do may be to get an automated DVD burning system to do scheduled incremental backups in duplicate. We used to do that with CDs at an ISP I used to work at. It's unfortunately difficult to search for while not getting people pirating movies, but this is the first thing I found on Google; doubtless there's better out there.

    --
    StoneCypher is Full of BS
    1. Re:Well... by jcourcha · · Score: 1

      http://www.discmakers.com/duplicators/products/aut omated.asp We have a couple of these at work.. useful for making 100 copies overnight.. and could very well be scripted to do backups. The 4 DVD drive model will run you about $6,500 while the 1 DVD drive will be closer to $3,700. Mac/PC compatible :)

    2. Re:Well... by stonecypher · · Score: 1

      pity you posted that two days late. that deserves to be modeed up, and i can't use my mod points here.

      --
      StoneCypher is Full of BS
  19. Tape? by darkjedi521 · · Score: 2, Interesting

    I haven't seen anyone mention magnetic tape yet. I'm sure it has its drawbacks too, but considering its still widely used for backup purposes in a commericial environment, it can't be too bad. Especially depending on how much a cartridge can hold. Its not the cheapest, but it might be something to look into.

    1. Re:Tape? by dnahelix · · Score: 1

      This company came up after a quick search.
      I think everyone imagines a room full of those spinning reel-to-reel things when you mention tape. It's archaic, but viable today!

      --
      Slashdot Eds Link Anonymous Posts With Logged Posts
      They Are Vermin Feeding On Each Other's Feces.
      I Hate \.
    2. Re:Tape? by darkjedi521 · · Score: 1

      I've still got a pair of working 9 track open reel drives. Need to find boot media so I can make the attached VAXen functional again. I don't suppose anyone would know where I could images of RX-50 boot media for a the console of a VAX 8530 on the web? I have the VMS install media, I just need to reload the OS on the console first.

    3. Re:Tape? by Anonymous Coward · · Score: 0

      Amen. For large scale backups like this, I always recommend tape drives.

      Plus there's always the side benefit of your data storage room looking like it came straight out of the 60's...

    4. Re:Tape? by Anonymous Coward · · Score: 0

      I'm not a DEC-head, but a good bet for finding out would be the Classic Computers mailing list.

  20. Buy an older tape drive by Apparition-X · · Score: 5, Informative

    Look for an LTO gen 1 or SDLT220/320 on ebay, with a SCSI connection (some of them are fibre, and I assume you don't want to go there!). Don't forget to pick up some tapes. In general, this sounds like it would work if you plan on doing this for a while, and can leverage the initial investment over months or years.

    Capacities are (for the cost of a sub $50 tape):
    - LTO1: 100 GB uncompressed
    - LTO2: 200 GB uncompressed
    - SDLT220: 110 GB uncompressed
    - SDLT320: 160 GB uncompressed

    If your data is particularly ammenable to compression (i.e. database data) you could easily get 3 or 4 to 1 compression with these drives without sinking your CPU utilization.

    1. Re:Buy an older tape drive by TubeSteak · · Score: 1
      So first I did a little math:
      1.6TB * $0.60/GB = $960
      So if I can do the basics, that's $96 for each 160GB HD? I didn't realize HD's were getting that cheap. Anyways, at that price you're more than likely talking about PATA (Parallel ATA) drives & since $960 seems to be expensive for you, SATA & SCSI are out of the question.

      With HD space coming so cheap, I think a dedicated RAID box would work as a storage solution. DVD's are a pain, I'm assuming you want to store large volumes of data, so people will say get a tape backup. 1. They're expensive and 2. They're small. Even with a 200GB tape compressed to 400GB... You're still not anywhere near 1.6TB or even 800GB (since we don't know what type of data he might be compressing, I'll just go with an even 2:1).

      Oh yea, Buying PATA drives in bulk. If you do this, assume failures will happen in bulk and purchase extra's accordingly. 3Ware, according to their website (and as far as i know) make the only 8 & 12 port RAID cards around. Couple that with a cheap full tower (or a badass midtower) and throw in 12+ HDs.

      Since you considered DVDR, I'm assuming this doesn't need to be online 24/7. This is not a problem since it's storage, feel free to power down your raid box. This saves the headaches of hardware failure, power surges, accidentally hitting the delete key, etc.

      In deciding Tape vs RAID, the key factor should be: Do I want my data spanned across several tapes. If you can get 4:1 compression or better, by all means consider the tape drive, but with a big enough tower, you could get >3TB + redundancy + hotspares. My only RAID5 problem is that even with 1.6TB of data, you lose two drives and you're toast. {--that should be your second deciding factor. It's a scary thought.

      --
      [Fuck Beta]
      o0t!
    2. Re:Buy an older tape drive by Anonymous Coward · · Score: 0

      PATA prices: PriceScan.com

      120GB - $80 ($0.67/GB)
      160GB - $90 ($0.56/GB)
      200GB - $110 ($0.55/GB)
      250GB - $160 ($0.64/GB)
      300GB - $230 ($0.76/GB), 5400rpm only

      5400rpm 250GB drives are more like $190 ($0.75/GB)

      For a RAID array (assuming 8 disk controller like one of the Escalades):

      8 x $110 = $880
      controller card = $500
      MB / CPU / RAM = $400
      case = $200
      total cost around $2000
      net space 6 x 200GB (~1.1TB), $1.78/GB

      As you can see, it ain't the disks that are the problem in an ATA RAID, it's the controller card, case and components. Plus, I'm allowing for a hot-spare and RAID5. Probably can't save the cash on the case, but you might be able to ditch the controller card and save some cash on the MB/CPU/RAM. That cuts the cost to around $1300 for the hardware or $1.16/GB.

      Biggest SCSI drive is 146GB (call it 140GB for planning purposes) and those are $500 each. To build a 1.1TB array requires 10 discs (9 in a RAID5 plus 1 hotspare). SCSI controller card is at least $300, plus the $100 cables, plus the MB/CPU/RAM (figure $200 min) and the $200 case. So $600 for the box to hold the drives plus $5000 worth of SCSI drives works out to $5.00/GB.

      Maybe as much as $7.50/GB for SCSI compared to a realistic $1.75-$2.00/GB for IDE/SATA. That's not even considering the costs of backup drives and tape.

    3. Re:Buy an older tape drive by mrtroy · · Score: 1

      If your data is particularly ammenable to compression (i.e. database data) you could easily get 3 or 4 to 1 compression
      3 or 4 to 1 compression is highly unlikely

      Assume random data gets 2 to 1 using common algorithms like huffman or LZ (i dont know how to spell their names, so i wont try)

      Database data MIGHT be better...but not that much better.

      Also consider any files that can easily be compressed like pictures or music or movies will already be compressed.

      So im guessing 2 to 1 compression is the best he can hope for, also keeping in mind the length of time necessary to compress and decompress this data, it may simply not be worth it.

      Personally, I wouldnt.

      --
      [I can picture a world without war, without hate. I can picture us attacking that world, because they'd never expect it]
    4. Re:Buy an older tape drive by Apparition-X · · Score: 1

      All more or less true.

      However I can say from years of personal experience that database data (much of it) really does compress that well. Pictures don't compress worth a durn. Most file space stuff does about 1.5:1 to 2:1.

      And I have no idea what algorithm LTO1/2 and SDLT have built into the drives, but it does work *really* well. And because it is firmware, it is pretty darn fast. As are the drives. Assuming you can sustain the stream on the system side, we find it pretty common to exceed 10 MB/s (and yes I mean bytes!) per drive.

      The bottleneck really does become the ability to sustain fast transfers. For example, LTO2, 9980, 3592 are all rated at about 34 MB/sec, and then if you give them compressable data, can actually run at 65-70 MB/sec. (Bear in mind these drives cost $20-40k new). Now think about trying to get a dozen of them running effeciently at the same time! /rant off, I know you weren't talking about throughput. And your points about compression are dead on (except for the fact that it doesn't take any time, because it is done at the tape drive level).

  21. Oh, I see... by spinkham · · Score: 3, Funny

    You want it fast, cheap, reliable, easy, and now, eh? Good luck with that.... Sounds like a request from the PHB...

    --
    Blessed are the pessimists, for they have made backups.
  22. 1 TB/Month by nikonius · · Score: 2, Insightful

    It does not sound like your needs are anywhere near that of the 'common man'. You sound more like a power user to me. Somethimes you have to pay for heavy-duty storage as the cost of doing business.

    1. Re:1 TB/Month by nikonius · · Score: 1

      I have actually seen 200GB HDs for US$129.99 at discount wharehouses. Buy 6 and RAID5 it in software using linux and you would get a terabyte with some survivabilty.

    2. Re:1 TB/Month by Seumas · · Score: 1

      $130 for a 200GB drive? And those are at discount warehouses?

      Go to PriceWatch and you can get 200GB drives for $100 including shipping. I've got a bunch of them stacked up for archival of video and audio. My last two batches came from a place on their called US-Depot (or something like that) and I had no problems with them either time.

    3. Re:1 TB/Month by aardvarkjoe · · Score: 1
      For a one-time thing, this would work. However, it sounds to me like he's going to need to do this every month (although it's admittedly not terribly clear what he needs) -- I sure wouldn't want to be storing my data on a set of fifty discount drives in a semi-homebrew setup.

      If this guy is really storing 1TB of data every month, and is going to be doing it for any length of time, he'd be better off going for a commercial solution. No, it's not cheap, but it would be worth it in the long run.

      --

      How can we continue to believe in a just universe and freedom to eat crackers if we have no ale?
  23. Sounds like you need a WORM by Durindana · · Score: 1

    Aren't Write-Once Read-Many solutions ideal for this kind of thing?

    If you're not looking for permanent backups, the per-media cost may be prohibitive though.

  24. Cheap and Big by guamman · · Score: 2, Insightful

    Tape Drives - Probably the cheapest way to store large amounts of information. The only drawback is that they aren't fast. However, If your harddrives are large enough to hold the data you are currently working on and tapes are used exclusively for backup then a speed problem shouldn't be . . . a problem.

  25. 1 TB of data a Month? by callipygian-showsyst · · Score: 0, Redundant

    That's a LOT of pr0n

  26. Porn? by Anonymous Coward · · Score: 0

    How much porn can one store?

  27. eh by maxbang · · Score: 2, Interesting

    If you don't have the patience for DVD backups (neither do I), then you're pretty much stuck with RAID. So buck up, spend the extra cash, and setup a storage box or two on the network with one or two terabytes in each. I have a branch of my network setup on gigabit, one box has 250 GB of storage on RAID 1 across two 250 GB (this one's for video projects), the other has 160 GB in RAID 0 (my learning system). Works fine and easy as hell to setup. If I need to add storage I can either add some drives or just add another box. I've thought about using GFS, but I don't know enough about it to implement it, yet. Anyone here currently using GFS?

    --
    I also reply below your current threshold.
    1. Re:eh by timmarhy · · Score: 1

      peopel raid is not a backup, it's redunancy.

      --
      If you mod me down, I will become more powerful than you can imagine....
    2. Re:eh by Anonymous Coward · · Score: 0

      A RAID system will happily wipe out all your data on command (specifically, rm -rf /...). In fact, something like that happened at my college when one of the ancient RAID controllers foobar'd and took down the home directories for like a week while they kept restoring corrupted backups... grr.

      1 TB/month only translates to about 33-34 GB/day, which you could back up on to DVD manually if the data is really that important, or archive to tape or something.

  28. compression by Suppafly · · Score: 2, Informative

    First off, if you aren't already compressing that data, start. You may be able to cut the size down dramatically using compression.

    Then backup using tapes just like every other place that has to do backups. Generally do full backups once a week and incremental ones nightly or whatever is necessary based on the data you are working with.

    1. Re:compression by hackstraw · · Score: 1

      Compression only works on some (albeit, most) data. It doesn't get you a thing with most scientific numerical data.

  29. spongedrive is best by cubyrop · · Score: 5, Funny

    i am responsible for providing storage solutions for a mid-sized content creation company which, through version archiving, accumulates near 1-200 GB per day. they require access to their media backups on a rolling basis, so tapes are not an option.

    i have found that a Teutonium cluster of 6.5 TB Spongedrives (either Cray or SecreTech are fine) fits the bill nicely. housed in a 15-unit rack server, the amoeba-shaped drives utilize BioLas technology to store data on 6-dimensional Moebius Cilia for a slick seek time of 0.00 ms.

    a cluster costs about $45,000 USD but the price should come down in 2004 Q4 when SecreTech launches their new 40-platter blackholium SCSI's.

    --
    If I could make this sig kill you, I would.
    1. Re:spongedrive is best by Anonymous Coward · · Score: 0

      Any mod who rises to the bait should be shot.

  30. Drawbacks, what are you willing to put up with? by Anonymous Coward · · Score: 5, Informative

    All forms of media/backups have their own drawbacks... but some aren't as bad as others, and the others often are more accessable.

    Tape: Tapes break, they wear, they have dropouts, take a while to back everything up, can't always access files if you just want to restore something (Different methods vary, folks)... but ultimately, it's cheap when you use DAT because they're a common media. Swap the tapes twice as often (and throw old ones out) if you're paranoid about tape related failures.

    Hard Drive: Most common form of backup I see now, mainly for the 1:1 size factor. Yeah, drives fail, too. Sometimes you have a pretty good warning when this is going to happen, sometimes you don't. (My 13GB Maxtor and 40GB IBM Deathstar drives both went *pfft* on reboot.) Get enough of them at once, you could swap out the logic boards if one does fry out. Ultimately, RAID or just simple 1:1 mirroring is probably the most efficient and easy method. Accessing bits and pieces is also easiest under this method. I personally just use an external USB2 case with a 120GB drive in it. Everything I want to back up goes on that drive, and then eventually... DVDRs. I turn off the drive when I don't need it... hopefully prolonging the life of it when I need it most.

    DVDR: Not anymore. If we had these new-fangled DVDR discs (+ or -) say... when 2 to 6GB drives were common.... sure... But in addition to hard drives, recovering selective files is easy under this method too... Unless you use a backup program that crunches everything together on the disc in some spanning format. Burn times can be tedious... but it's not bad if you consider the overall amount of data you're putting on the disc. Cheaper than quality-brand name CDRs, though, in terms of price per mega/gigabyte. Only an idiot would trust $0.01-per-disc spindles for long-term backups. Even the longevity of DVDR has yet to be seen...

    CDR: I'm not going to bother.

    Network: Well, still relies on hard drives and other components... but good if you don't want to saddle one room with a ton of boxes. Simply for space and efficiency... external drive is probably better anyway.

    Old fashioned method: Print everything out and keep it in a filing cabinet somewhere. You could always OCRA the stuff later. ;-)

    1. Re:Drawbacks, what are you willing to put up with? by m_evanchik · · Score: 1

      The old-fashioned method should not be discounted.

      Paper has longer longevity than any optical or magnetic medium.

    2. Re:Drawbacks, what are you willing to put up with? by wik · · Score: 2, Insightful

      Pray tell, what are you going to do with just a network? Store bits on the wire and network card rx/tx buffers? There's gotta be something big on the other end of the cable, dude.

      --
      / \
      \ / ASCII ribbon campaign for peace
      x
      / \
    3. Re:Drawbacks, what are you willing to put up with? by mrchaotica · · Score: 1

      For the network option, remember that it's good for having remote backups too, incase your office burns down - but you have to backup to somewhere completely different, not just the next room.

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    4. Re:Drawbacks, what are you willing to put up with? by Gleef · · Score: 1

      Yes, paper shouldn't be discounted.

      However, backing up 1TB to paper is a little daunting/expensive/wasteful. Storing the resulting backup would take a pretty big building for just a single backup (if you need monthly backups, you're talking about an ever growing complex of buildings). I shudder to imagine the restore proceedure.

      --

      ----
      Open mind, insert foot.
    5. Re:Drawbacks, what are you willing to put up with? by Anonymous Coward · · Score: 0

      Pray tell, what are you going to do with just a network? Store bits on the wire and network card rx/tx buffers? There's gotta be something big on the other end of the cable, dude.

      Nah, just slip a wireless link in there and bounce the signal off of a Mars reflector. With a high enough bitrate you can fit enough on the line to store a few gigabits before the signal gets back and you have to send it up again.

      Just one big "ring" memory.

    6. Re:Drawbacks, what are you willing to put up with? by WuphonsReach · · Score: 1

      However, backing up 1TB to paper is a little daunting/expensive/wasteful.

      (Doing the math...)

      Assuming 200dpi for each bit gives 40,000 bits per square inch, or 3.2 million bits on an 8" x 10" area. Adding some redundancy to the data would reduce usable storage to around 0.381MB per page. Redundant coding would reduce that to 0.300MB per page. A stack of 500 pages holds 150MB. Add in some more redundancy and push that down to 103MB per ream of paper. A box of copier paper holds 1024MB.

      Thus, 1 TB could be backed up on to 1024 boxes of copy paper. Overally redundancy is around 86%. You could even print the source code required to read the data back in the margins or on a cover sheet.

      300dpi doubles your space meaning you only need 500 boxes. 600dpi means you only need around 60-70 boxes. Much past 600dpi and you'd better switch to microfiche.

      Heavy? You bet. Unreasonable? Definitely. Doing the data verification would be a real pain unless you have a super-efficient sheet-fed scanner.

      Personally, I picture some poor graduate student being tasked with the job in the year 3030.

      --
      Wolde you bothe eate your cake, and have your cake?
    7. Re:Drawbacks, what are you willing to put up with? by m_evanchik · · Score: 1

      I'll admit I was just thinking about regular text documents.

      On the other, thanks to your calculations, I am sure some poor grad student has already been assigned the task : )

    8. Re:Drawbacks, what are you willing to put up with? by Jeff+DeMaagd · · Score: 1

      Paper printed with readable characters may last the longest when treated well, but text-only prints net about 5k of text per page, assuming no excess white space. You'd need warehouses to store terrabytes.

  31. Idiot Moderator by Anonymous Coward · · Score: 0, Troll

    What kind of idiot moderator is handling this article/item - giving Scores of "1" to reasonable suggestions and "2" for stupid/funny ideas like someones posting about his drawer full of floppies that he's willing to sell???

    1. Re:Idiot Moderator by eric76 · · Score: 1

      You must be a stranger around here.

      Anyway, at least nobody has suggested paper tape.

  32. LTO Ultrium 2 Tape Drive by jeffgeno · · Score: 4, Informative

    The drive will run about $4000, but the tapes are only around $0.20/GB assuming a 1.5:1 compression ratio. And keeping that assumption, 1 TB of data should only take 3 200 GB native tapes per month, so swapping wouldn't be so bad with the single tape drive. An autoloading library would be significantly more expensive, but if you really need automation, that's the way to go.

  33. safest way by nazsco · · Score: 1, Funny

    echo 1tb.txt > /dev/lprn0

  34. The "Common Man" doesn't need bulk data storage... by syberanarchy · · Score: 2, Insightful
    The best idea I could give you is to just create a sister system, where you mirror all your data. Not cheap, but cheaper than getting a pro-grade solution.

    The reason you won't find such things on the cheap is because the average person with a PC doesn't even know what a GB is. He simply goes into the store, the sly salesman says "oh, what do you need it for," and then says "well 60-80 gb should be all you ever need."

    Now, contrast that to me - my friends shit when they hear I have a 250 gb drive and a 120 gb drive, as well as an extra 60 gb on a networked machine. They can't fathom ever needing that much space. I know that's probably a pittance by Slashdot standards, but it's true :(

  35. Theft / Physical Damage etc. by Anonymous Coward · · Score: 0

    The age old problem I've had with RAID is that:

    - If the machine gets stolen, there's no backup.
    - If the RAID controller shits out and takes a couple of drives with it (uncommon, but has happened), there's no backup.
    - If there is a physical disaster and the machines spontaneously combust then their is no backup.

    I don't know if there is a cheap solution for what you want...

    If anything I would say plugging in hop swap drives just to backup in to your machines, and then take them offsite when the backup is done, as well as RAID if you can afford it....

  36. Duplication Tower by Anonymous Coward · · Score: 0

    I believe that it would be worth the money to invest in a duplication tower at this point, the ones with the mechanized arms, preferably one you could hook up to your computer.

  37. Why not tape drives? by Silent8ob · · Score: 2, Insightful

    Look at what the rest of the corporate world uses for large scale storage management. It is still ruled by Tape drives.

    I don't know how much an eye goes for at the moment, but if you can spring for a Super DLT drive you'll get up 320GB (Compressed) for each tape.

    It all comes down to the Quality:Cost:Time triangle.

  38. Easy by Pedrito · · Score: 4, Funny

    I use bioneural gel packs at a cost of $0.04 per teraquad. What is this hard drive of which you speak?

  39. What you want, for price. by Crasoum · · Score: 1

    Well it isn't going to happen, you -HAVE- to drop change for what you want, as a back-up solution. There really isn't any way around that.
    There are many plausible suggestions though that won't break the bank totally. One of course is raid as has been mentioned and will be a few times I imagine. But you may also wish to look into hot swappable solutions.
    USB 1.1/2.0, Firewire and SATA are all relatively cheap storage solutions if you shop around (Pricewatch is a good place if you are willing.). You can convert IDE drives to USB with an IDE>USB box, and buy a few decent 200 gig hard drive for around $120~$150.
    Another could be buy a SATA card and some SATA drive and plug them into the front of your case, SATA 2.0 is hot swappable and the hard drive prices have come down into a decent range.

    Now another solution is buy used SCSI, and raid those together, reliable fast and not overly expensive if you don't want 15k RPM.

    Another idea is buy another box, place a few hard drives in it, and use that box as your back up, but it's a hassle more so then the rest, but as a plus you can place it somewhere else as an offsite backup and all you have to do is plug it in and your work is ready to go (from the place you most recently backed up.)

    With incremental back-ups it may not be too bad.

    Then again you are moving terabytes.

  40. One word... by Anonymous Coward · · Score: 0


    Paper.

  41. Of course... by etnoy · · Score: 0

    I was wondering, if Slashdot readers have any recommendations for a cheap automated way to store and retrieve data.

    Remember the data in your brain, that is much cheaper than buying disks IMO.

    --
    Quantum hacker.
  42. P2P Backup by billstr78 · · Score: 0, Redundant

    That's easy. Name your files "Hot Lesbian 4-some[0000-1024].mpg" and make it available to the P2P sharing community. Every horny male in the country will then help to download and distribute your data across the country and when you want file number 5 back, just search for Hot Lesbian 4-some0005.mpg and bingo, 10 available good copys ready for re-use. ..

  43. Tape by NitsujTPU · · Score: 1

    Buy a tape drive.

    If you have money burning a hole in your pocket, but a tape changer, so you don't have to change the tapes.

  44. How much is an eye? by AllNicksWereTaken · · Score: 1

    How much does an eye cost?

    1. Re:How much is an eye? by harkabeeparolyn · · Score: 1

      A person with two eyes would probably be willing to give up one eye for enough money to never have to work again. For the billions of people living on $2 a day, $10,000 is enough. But in that case you have to wonder if that eye has been getting enough vitamins over the years to keep it healthy. As with most things, you get what you pay for.

  45. Hijack Cassini by Anonymous Coward · · Score: 5, Funny

    ... and program it as a repeater.

    It's about 90 minutes away, so at 250 Kbps that's over one terabit in storage on the way out there, and another terabit on the way back.

    Worst-case access latency is about three hours, though. Maybe the hard disks are a better idea.

    If you send your probe^H^H^H^H^H repeater to Alpha Centauri, you'll get more than 20,000 times the storage capacity.

    1. Re:Hijack Cassini by mangastudent · · Score: 2, Interesting
      ... and program it as a repeater.

      It's about 90 minutes away, so at 250 Kbps that's over one terabit in storage on the way out there, and another terabit on the way back.

      You laugh, but before Project Whirlwind (which created the physical modern computer as we know it) settled for 16-64 bit CRTs (DRAMs, I suppose? :-) and then invented 3D core memory, they seriously considered leasing a micro-wave line to do exactly that.

      At the very beginning, people were really hard up for memory solutions; e.g. the first Univac model used mercury delay lines, a variant on this concept.

      (The out of print Project Whirlwind: The History of a Pioneer Computer by Kent C. Redmond is recommended if you're interested in this area of history.)

  46. i know you know the smart ass by nazsco · · Score: 1

    backups? I upload to an ftp server and let the world mirror it for me

  47. Use those HDDs! by jsm008us · · Score: 2, Interesting

    You can get cheap computers from the trash, donations, bulk, etc. You can use that cluster to mirror your data once or twice. I don't know what data you have, but if you have the same data on more than 1 different hard drive, you can be rest assured it will be fine. Or you can just print it all!

    The stockmarket is backed up to three (or more?) seperate locations. Look into NVRAM (e.g. flash media) or a cluster with all those hard drives linked together, with a constant backup. With the builtin IDE controller on most motherboards, you can hook up to 4 Hard Disk Drives. If you add SATA, RAID, SCSI, and IDE, you can have lots of hardrives on one machine!

    You could also rotate hard drives, so they arent constantly used (making the whole system last a LOT longer!) or replace the drives that are about to fail (which would be at least in 3 years!). Most Hard Drives could probably handle 5 to 10 years no prob (maybe even 20 if they are rotated!).

    It all depends on what you have and what you want to do!

    --

    mysql>SELECT * FROM users WHERE clue > 0
    0 Rows Returned
    1. Re:Use those HDDs! by Cecil · · Score: 4, Funny

      I'm just curious, do you have any idea how much data 1 Terabyte is? Are you suggesting that he PRINT it?

      Let's say for the sake of argument that all 256 bytes can be printed as a visibly distinguishable character, or that he's got 1TB of plaintext. Also assume you can fit 10,000 characters on a 8 1/2 by 11 page.

      You can fit 10^4 bytes per page, and you need to print 10^12 bytes (I know, it's actually 2^120, but that needlessly complicates the math, so shush)

      That means you will need 10^12 bytes / 10^4 bytes/page = 10^8 pages.

      One hundred million pages. Assuming he has a good laser printer with infinite toner, let's say he can print 60 ppm or one page per second. It would take one hundred million seconds to print the data, which is 1157 days, or a little over 3 years.

      Given that he generates 1TB per month, I think this backup plan would probably become the top agenda item of most of the anti-deforestation groups out there.

    2. Re:Use those HDDs! by eatjello · · Score: 1

      if it would take 37 months to print 1TB of data on one printer, and he generates 1TB of data a month, all he needs are 37 printers, and infinite paper to go with his infinite toner ;)

    3. Re:Use those HDDs! by deranged+unix+nut · · Score: 3, Interesting

      Depending on how you do it, you can get a lot more than the density that you assume. Check out www.paperdisk.com.

      That said, this method would still be more than twice as expensive as storing data on hard drives, would still require a million pages, but would take a little under 2 weeks to print.

      It still doesn't seem like a feasible option.

      The up-side is that, if stored properly, the data would likely be safe potentially for many hundreds of years.

    4. Re:Use those HDDs! by kava_kicks · · Score: 2, Informative

      That Paper Disk idea is pretty interesting. I read ages ago that a large Australian government agency (whose data storage goal was LONGEVITY rather than short-term backup) - might have been the Bureau of Statistics - chose to store/archive all of their data onto microfilm ...

      "Microfilm!?" I hear you say, "But this is 2004!". I was suprised too until I heard their reasoning: the only thing you need to read microfilm is a magnifying glass and a light source.

      And that, ladies and gentlemen, makes it virtually future-proof. Try finding a hard-drive for your old 5 1/4 C64 floppy disks (or better yet, those bloody tapes that took 30 minutes only to fail to load)

  48. Storage solution by Anonymous Coward · · Score: 0

    Compaq's StorageWorks SSL2020 AIT library is a single or dual drive library that offers 2 terabytes (2:1 compression) of storage in a 4U tabletop or rack configuration. Library modules can be stacked five high for up to 10 terabytes of storage within a 20U space. The SSL2020 is qualified with Windows NT and Windows 2000, NetWare, Tru64 UNIX and OpenVMS operating systems, as well as Compaq ProLiant and Alpha-Server product lines.

    http://www.aittape.com/

  49. Do what Google does by glinden · · Score: 5, Insightful

    Build yourself a cluster of cheap boxes with cheap IDE disks and replicate your data across them. Because the data is replicated across your cluster, no need for backups or RAID.

    1. Re:Do what Google does by swb · · Score: 3, Insightful

      It's a great idea, but one of the problems is what happens when your data goes bad before you realize it and it gets replicated. Then you want what you had yesterday, and that means tape.

      You can solve this by ensuring some kind of in-process backup (like a SQL maintenance schedule, where it replicates itself), but then you're loading your replication process with a bunch of data that doesn't really need to be online, it needs to be in a vault someplace.

      Besides, Sarbannes-Oxley and the IRS want you to keep backups 5+ years anyway, so this replication-only model is only good for data whose internal integrity isn't meaningful to anyone but the owner.

    2. Re:Do what Google does by Anonymous Coward · · Score: 0

      You could argue that Google's data isn't really that important. If they lose a few bits of a cached file somewhere, it's no big loss, as the Web's constantly being updated. In fact, due to this consideration, uptime is much more important than accuracy--the illusion that Google never breaks is more important than always returning the 'right' answer. Contrast that situation with a bank, where you need a 100% accurate and verifiable audit trail.

      I mention this because no backups is really a bad idea. A cluster of cheap boxes + cheap IDE disks is basically RAID (except you're buying cheap boxes instead of a RAID controller--generally more expensive and slower, BTW), and a RAID array will happily and reliably wipe out all your data on command. If the data's important, backups are still necessary.

    3. Re:Do what Google does by ednopantz · · Score: 1

      Because the data is replicated across your cluster, no need for backups or RAID.

      Um, until the house burns down or you fark your own data by mistake.

    4. Re:Do what Google does by glinden · · Score: 1

      You make a great point. Because of the data size (1T/month) and the description of his needs ("generating, storing and analyzing a large amount of data"), I assumed he was not working with financial data.

      The financial transactions needed for business accounting should be handled more carefully, probably both multiply replicated on-site (in a database cluster and with RAID drives) and regularly backed up to tape that is stored off-site.

  50. Vital information left out by dfghjk · · Score: 3, Insightful

    How many months at 1TB/month do you require access to online? After you are done with data can you discard it or do you need it archived? What is the cost of losing your data set at any given time? In what manner do you expect to access it (read/write mixture and sizes plus aggregate throughput and number of client connections). The answers to these questions could cause the cost of a solution to vary but a couple orders of magnitude.

  51. options options, what is your time and data worth? by segfaultcoredump · · Score: 5, Insightful
    Lets see.... hard Drives are running about $0.50 per GB, DVD's are running about $0.06 per GB (100 pack, "house brand", not something I'd put my data on but this is slashdot, and there are idiots out there who think that it is a good idea), and tapes are also running about $0.20 -> 0.50 per GB (for the DLT/AIT/LTO type, the ones that have enough capacity to not drive you nuts)

    So, you can put your data on 4-5 HD's, 10 tapes or 232 DVD's per month. The Cost of doing so will be about $500 per month for the tapes or HD's and $50 for the DVD's (assuming your time cost $0)

    At work, we had a need to keep a few TB of data online permanently, so we purchased a few NexSAN ATABeast's. At $50,000 for 10TB of usable storage ($5/GB), they may be a bit out of your price range. The advantage is that you can hold almost a years worth of data and it is protected by RAID5. It also makes management a lot easier, since it is very difficult to mount 42 300G drives in a single chassis (and it takes only 4U of rack space).

    On the low end, NexSAN has the ATABoy2 or ATABaby (2TB or 1TB) for the $8-$15K range. This will let you hold a months worth of data

    On the high end, You have EMC disk arrays (Think upwards or $20+/GB for the 'cheap' stuff from them.

    Overall, if you have 1TB per month, you need to either a) get a grant to fund your work, b) hire somebody to swap DVD's for you or b) seriously rethink your data generation.

    Any of the "cheap" storage methods have serious drawbacks, and the low cost ones are, well, not so low cost if $15,000 sounds like a lot of money to you.

    otherwise, good luck

  52. If its volume you want by TheUncleBob · · Score: 5, Informative

    If you are more interested in volume than speed, then the emphasis should be on the 'ID' part of RAID. Inexpensive Disks. If you used 160GB Drives, which appear to have the best bang for your buck at the moment, and put 6 (yes 6!) in a pc. Just use any old cheap pc (I use 200-400Mhz PII)

    Run the disks RAID 5 and you will get about 800GB of storage for $600 . Now get two cheap ata100 cards so you have a total of 6 channels, and mount each drive as a master on each channel. Build a 2gb root partition on the first disk (mirror it if you want) and then set the rest of the space up as a huge raid 5 array.

    Et Voila cheap, big server. To archive data, turn off pc, and throw into attic :-)

  53. my solution by stonetemple · · Score: 1

    - Several 80-gig drives
    - 1 removable IDE hard drive enclosure
    - 1 fireproof safe, preferably bolted to the ground or kept off-site (for the particularly paranoid)

    --
    --- Robert Strickland
  54. Bulk storage by Anonymous Coward · · Score: 0

    Compaq's StorageWorks SSL2020 AIT library is a single or dual drive library that offers 2 terabytes (2:1 compression) of storage in a 4U tabletop or rack configuration. Library modules can be stacked five high for up to 10 terabytes of storage within a 20U space. The SSL2020 is qualified with Windows NT and Windows 2000, NetWare, Tru64 UNIX and OpenVMS operating systems, as well as Compaq ProLiant and Alpha-Server product lines.

    http://www.aittape.com/hewlett-packard.html

  55. Network Appliance by Anonymous Coward · · Score: 1, Interesting

    No I don't work for them, but I work with their H/W & their support is second to none. You can get a recon'd R100 for reasonable money. New, they cost ~$100k for 12TB.

    They aren't File servers, as they aren't designed for lots of clients. But they are perfect for storing a 'live' backup of data ! They can the Technology Nearstore, its designed to sit between your File servers & your Tape backups

  56. CD Changer by andrebsd · · Score: 2, Interesting

    Well, I have a cd changer for computers made by NSM... It's scsi (comes with a 2x reader origionaly) so all you gotta do is find a scsi dvd burner (or a long enough ide cable and convert it, since the motors are all powered by a com port anyway) and replace thd drive, (or like in my case, a cd-rw - had the drive for a while, so at the time a dvd burner would have cost to much) then you have 100 dvd's you can burn data to automatically, and when those are full just swap them out for new ones.

    Now the problem is, you can only get 430gig's out of one changer using single layer dvd's... Double would bring you to 970gig's per changer.

    Assuming you can get the unit for 100 bucks or so, and the dvd drive costing 100 (69 bucks at frys).. Then you have a 200 dollar backup unit that can store 430gigs of information onto dvd's

  57. Doesn't your company back-up anything else? by Larry+Lightbulb · · Score: 1

    That amount of data has to be a company, not a common man.

    1. Re:Doesn't your company back-up anything else? by Anonymous Coward · · Score: 1, Insightful

      I know an amateur digital photographer who generates close to 1TB / month with a Canon 1D MarkII. Storage is a major problem.

    2. Re:Doesn't your company back-up anything else? by eatjello · · Score: 1

      an amateur with a 1DmkII? I envy him/her...

    3. Re:Doesn't your company back-up anything else? by Mr.+Roadkill · · Score: 1
      I know an amateur digital photographer who generates close to 1TB / month with a Canon 1D MarkII. Storage is a major problem.
      Has anyone compared the economics of film cameras - 35mm and/or medium format - with digital photography for different classes of users?

      Negative scanners are reasonably cheap (sub US$1000 for reasonably snazzy 35mm ones, I have no idea on medium format) and a collection of thumbnails with index details ought to be sufficient to pull a particular negative from the filing cabinet for re-scanning if a print or photochop is required.

      Of course, this loses several of the advantages of digital photography (instant availablity, instant delete on the shot where the fat guy walks into frame etc), and developing costs need to be taken into account too, but it might just be more affordable for the next few years.

      At what, 2.8 to 3 Mb per shot on best quality, that'd be around 350 photos per month. That's a little under 10 rolls of 36 exposure film if we limit ourselves to 35mm. You can buy a 20-pack of Fuji Professional 400asa 36 exposure for around US$75 on line.

      Of course, I'm not a photographer of any greater standing than "Get your fingers out of your mouth, Sweetheart - no, kitty doesn't like cheese in her fur", so my figures and assumptions could be way off. But on the surface, even taking into account developing costs, it would appear much cheaper at that sort of data volume to use film and a film scanner, and only keep or make digital copies of what you really need at any particular point in time. When an easy-to-use, reliable, removable terabyte hits the US$100-200 mark, it will be a better option. Until then, I think film probably wins out on price.

      Also, 35mm negatives have very well known archival properties and it won't be a nightmare in 35 years making a scan or print from them, but it could be hard getting an antiquated 800Gb hard disk repaired so you can suck data from it.

    4. Re:Doesn't your company back-up anything else? by SmartSsa · · Score: 1


      1 TB in pictures?

      1 TB.. 1024 GB.. 1048576 MB
      = 126334 pictures.

      That's (using the RAW format, assuming 8.3MB per picture, according to the spec sheet on canon's site) a _fuckload_ of pictures for an Amateur. Even a professional...

    5. Re:Doesn't your company back-up anything else? by slandis · · Score: 1

      Not only is this an outrageous claim, but as noted in another comment below this; That's (at the very LEAST), 12,000 pictures a month. TWELVE THOUSAND.

      Something seems awry here. Even if that were the amount of photos he were taking... the man must never sleep. That's a photo every 21(.2) seconds. All day. For 31 days.

      I think you're exaggerating, or also including storage space for editing images, final cuts, etc.

      --
      BAM!
    6. Re:Doesn't your company back-up anything else? by slandis · · Score: 1

      Okay, so I read wrong. 126 THOUSAND. Even more silly.

      --
      BAM!
    7. Re:Doesn't your company back-up anything else? by Anonymous Coward · · Score: 0
      I work for a small professional photographer and we generate roughly a terabyte every two weeks. .PSD files with a lot of post-processing are a hell of a lot larger than the original source, as are the .tif files we send to labs.


      I'm reading this story with a lot of interest because we still haven't found a good way to store all of this stuff, we're gambleing on redundant DVD-Rs currently.

    8. Re:Doesn't your company back-up anything else? by Anonymous Coward · · Score: 0

      I have a Olympus E10 and that generates about 20 MB per shot in raw mode , thats with the 4 megapixel CCD . i think the E20 / E100 has a 6 megapixel ccd so it would be around the 30-40 MB per shot mark. If this guys willing to spend the money for the near-pro quality cameras like an E10 its pretty likely he also takes enough shots a month to get 1TB of data.
      I know a guy who does wedding videos in his spare time , he switched to digital filming and editing last year and for a 30 - 40 min final (approx 1GB dvd) product he handles about 10 - 20 Gb of raw footage. He uses 2 IDE drives with a RAID card as his working area.

  58. All depends on retension... by HockeyPuck · · Score: 1

    What are you retension requirements for this data? 2 months? 1 year? Forever? How often do you need to access this backed up data?

    If you requirements are rare restores, then I'd go with tape. You can back up to a tape drive about as fast as a ATA disk and you can move the tapes offsite. Restores w/tape are a bit more painful, but then a tape isn't as delicate as a disk drive.

  59. 4*400Go Sata on Raid 5 by da5idnetlimit.com · · Score: 5, Informative

    depending on the value of your data, you should try having a nice 4*400Go SATA in raid 5 *2, possibly using a distributed file system for redundancy...

    Not the cheapest, but fast, simple and saves you the unholy pleasure of having 2-3 DLT boxes to archive/cycle each month...

    You already have a linux cluster, so implementing a distributed file system, or even simply a nightly incremental mirror to the target server if you can afford losing one day work/computation...

    It would help if you told us what sort of data you work with... from databases and to automated telescope tracking system, both need large amount of storage, but you won't need the same system array for each...

    I seem to remember a /. story on a rackable Petabyte storage system

    You don't need to go to the Petabyte capacuty but you will find some interesting comments on filesystems, disk virtualisation, 1U rack providers and so on....so a 1 Terabyte rack server is definetly possible...

    Good luck...

    --
    It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
  60. DLT is the way to go by pastpolls · · Score: 3, Informative

    I actually use a DLT with autoloader I got off ebay for under $200. I then bought a lot of used DLT tapes (100) and use them to backup my Video and DVD projects. It is great because when I fill my offline storage (about 1TB) I just fire up the backup software and get the old DLT going overnight. It is done by morning and the shelf life for those tapes is about 20 years.

  61. What? by Anonymous Coward · · Score: 0

    Why would you even think that?

    1. Re:What? by Anonymous Coward · · Score: 0

      Why would you even think that?

      Does the word "spam" ring a bell?

  62. Ultrium by 7vEn_T_7vEn · · Score: 3, Informative

    I'm not sure what your budget is but if your like me you want something that complies to standards so it will be around, is cheap and effective. For this I would have to recommend an Ultrium tape backup drive. The drive is standards based (google it) and the tapes are dirt cheap a 200/400 gb tape pulls up for $55. If you figure (hardware compression) 250gb of storage per tape then it will cost just $.22/gigabyte. The problem is that the drive itself is listing for about $2600, not exactly cheap but it's guaranteed to be backwards compatible with future lto standards and the media is as cheap as you could possible ask for. One more thought, look into an LTO Gen 1 solution (100/200) for a cheap drive, cost per gigabyte is roughly the same, it will just take more swapping.

  63. Consider Online Backup by jp10558 · · Score: 2, Interesting

    One company that provides massive online backup and storage at reasonable prices is Streamload. You might want to check them out.

    --
    Opera, Proxomitron-Grypen,GPG 0x0A1C6EE3
    1. Re:Consider Online Backup by Anonymous Coward · · Score: 0

      Reality check: it would still cost over $715 / month, and only works if a minimum 3.5Mbps can be sustained for 28 days (24 hrs / day). Somehow I don't think xfer over the net would keep up to this guy's needs.

      Maybe buying HDs at the price/capacity sweet spot du jour is a better choice.

  64. Only on slashdot by gexen · · Score: 5, Funny

    Only on Slashdot would they start talking about huge storage arrays and title it "for the common man"

  65. carousel for everyman by Doc+Ruby · · Score: 3, Informative

    A DVD-R jukebox can give you 200 DVDs at once. That's $3600 (drive/changer) + $268 (1000 DVD-Rs), for (1000*4.7GB) 4.7TB@$4000, or $1.18:GB. That's almost double your HD cost, but you'd need at least another host PC, and multiple controllers for the 16HD RAID, which is probably another $1000. And another $268 buys you another 4-5 months storage, so by next April you're down to $0.14:GB; in a year you're at $0.12:GB. A shelf of 200-disc "CD" books will hold your archives, 1 book per carousel for "fast" retrieval. Backup all your DVDs offsite at $0.27:GB. As DVD-R prices fall over time, you're probably looking at something like $0.05:GB, probably less than even plummeting HD prices. And the DVDs (especially with the cheap backups) are much more reliable, especially over 10 years, than the HDs. If you are looking at 10 year archive, at $80:month in DVDs, for 29% more money you can add a second host PC/changer set, left in their boxes, in case the original PC/changers fail.

    --

    --
    make install -not war

  66. Cheap hardrives with firewire enclosure by ippearx · · Score: 2, Informative
    http://www.wiebetech.com/products/ComboDock.html

    Makes an cheap, fast way to put lots of data onto lots of hard drives. Using one of these bad boys means no extra money is spent on drive enclosures, cases etc. You only buy raw standard hard drives. Excellent if it's only backup, and you do not need lots of access. This solution is not automated however.

    Hard drives are prone to failure. I was thinking of buying at least 2 drives of different brands to mirror, storing them in separate locations in sealed, air tight containers at just the right humidity/temperature. Also I think a disk check every 6 months or year would be necessary, and if any problems are found, replace the disk with another.

    One beauty with this method is you only need to pay for disk space as you need it, and hard drives may still get much bigger. I was going to buy drives at the lowest cost/megabyte which at the moment is 160GB drives.

    I would love to find more information on the physical storage of hard drives, especially how long they would be expected to last without use - months? years?

  67. Um, 1TB a month in IDE drives is cheap . . . by millisa · · Score: 2, Informative

    I would hope that if you are working with a TB of data, the value of that data is pretty high . . .

    Promise SX6000 = $255.95. (6) 200GB IDE drives in a Raid 5 = $624.95

    If you had a separate boot drive from the SX6000, you could just bring the system down for a couple hour maintenance once a month and slam all the drives out and put fresh ones in.

    Just keep buying new 200GB drives anymore and shelf the old ones (or if its *really* valuable and your home firesafe isn't enough, pay Iron mountain or someone to keep it).

    There aren't hidden labor costs outside of those two hours it takes to setup a new array every month (DVD's are about 60 bucks a month for a TB, with a hundred or so for a drive (which *will* need to be replaced occasionally if you are burning that much) but you'll spend hours and hours just dealing with the swap outs and breaking up your data . .. )

    If you don't have to keep the TB of data after a month or three, then your price gets even cheaper after you invest in your initial hard drive media sets . . . and you can put all the drives in hot swap chassis to further minimize your time dealing with the issue.

    Of course this is all moot if your 1TB of data isn't valuable enough to invest 600 a month in . . .

  68. harddrives by Anonymous Coward · · Score: 2, Insightful

    For cheapest backup possible, just use harddrives. Create a software raid5, backup to it, then powerdown and remove the drives to someplace safe. You'll also be able to recover the drives on any machine that can boot linux.

    Hotswap or removable drive cages can be pricy, and aren't designed for lots of swap-ins and outs, so I'd just buy new IDE or SATA cables every few backups. If you're using the same set of drives multiple times, then leave the cables connected as not to wear out the drive's pins.

    Eventually you'll wear out the ide connectors on the motherboard, so use one of those cheap ide adapter cards and replace as needed. Or use a cheap motherboard.

    It's too labor intensive to be in the same realm of solutions as a nightly tape backup, but not nearly so much as CD or DVD backups. It's easy enough to do once or twice a month.

    If you're cheap, you're not after disaster recovery, you want disaster mitigation.

  69. DVD Autoloader by SuperJason · · Score: 3, Insightful

    Explain this to me, I can buy a 200 disc cd changer for $100 bucks, but the same thing with a burner (cd/dvd) runs thousands of dollars. Isn't there any company out there that can do it cheaper?

    Heck, I remember a slashdot article about a guy who built one out of WOOD!

    This would be a great solution for short term recovery storage. Just keep a stack of CD's or DVD's ready, and it will load them in and burn them all automatically.

    On a site note, it would be great for converting a 400 disc cd collection into MP3's.

  70. RAID 5 by Alcimedes · · Score: 1

    It has a few bonuses going for it.

    First, you get the speed of RAID drives, and you only lose 1 drive of your entire set. (So the more drives you have, the better it gets)

    Of course, if you're going to go with RAID, you need to duplicate that data somewhere else, just to be safe.

    Along these lines I'd recommend another RAID 5 in (at minimum) another location. With RAID 5 it also covers the eventual loss of a drive from your array, and can rebuild it. Of course, you can now spend as much or as little you want on your RAID 5 setup, so the costs will vary widely.

    Then again, you often get what you pay for.

    1. Re:RAID 5 by Brian+Puccio · · Score: 1
      First, you get the speed of RAID drives, and you only lose 1 drive of your entire set. (So the more drives you have, the better it gets)
      The flip side of this is that the more drives you get, the higher the chances of one drive failing are. Sure one drive failing is fine, but if you had a 25 drive array or something, and 2 drives failed (or, rather, one failed, the hot spare started to rebuild, but before it could finish, another one failed), you're screwed.

      More drives means you "lose less space", but it also means you up the chances of the array tanking.

      (Oblagatory "RAID is not a backup either".)
  71. I have a cheap solution by SteamyMobile · · Score: 1, Offtopic

    Using this method you can store surprisingly large amounts of data very cheaply. It's also protected from MS Windows viruses, but perhaps not all viruses.

  72. Common man? by MavEtJu · · Score: 1

    Lately, I have been looking into different bulk data storage options available to a common man. My work depends on generating, storing and analyzing a large amount of data -- averaging about 1 TB per month.

    Wrong assumption, 1 TB per month is not a common man problem.

    --
    bash$ :(){ :|:&};:
  73. Hmmm ... data is what brings in your money by hattig · · Score: 2, Insightful

    So spend some time and money on making sure it is safe!

    Even if you had a Bluray DVD burner, that would be 20 discs you'd have to burn to backup 1TB. So that is out of the question.

    Really what I'd set up is:

    1) Local: 1TB of hard drive space on IDE RAID (mirrored). An 8-port SATA controller would do, with 8 250GB SATA drives.

    2) GigE ethernet to somewhere else (got a separate garage?), or something faster if affordable

    3) A file server there with the same config for "off-site" backup. Should your PC catch fire and melt, you'll still have your data. Yeah, backing up 1TB of data over GigE will take around 15000 seconds a go, or 4 hours or so. That's okay overnight, and better than swapping 50 BluDiscs or tapes and then carrying them out there.

  74. Exabyte by HonkyLips · · Score: 2, Informative

    Try Exabyte - for hardcore tape storage.

    http://www.exabyte.com/products/prodviews.cfm

    I think you can store about 1.6 TB on a single tape or similar, but check them out. Tape drives have come a long way from old SCSI DATs transferring 20meg a minute. And they're fully automated and although there's an outlay cost for the tape drive, over time the cost per gigabyte for storage will be lower than hard drives.
    If you have a security company do patrols of your office you can get them to take the tape offsite with them after nightly backups for added security... etc etc.

    --
    Putting syrup in coffee is some form of blasphemy.
  75. Me too... by gooman · · Score: 1

    mine say AOL on 'em.

    (Why is it I don't throw them away?)

    --
    "Kittens give Morbo gas!"
    1. Re:Me too... by SpaceLifeForm · · Score: 1

      Because they are re-usable?

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
  76. unstanding of raid by Down8 · · Score: 1

    RIAD is a possibility?

    What does that mean? How does RAID by itself help someone lookinf for an automated storage system? RAID is just a way to add redundancy, speed, or both. It isn't a magic bullet to increase capacities.

    -bZj

    --
    .sig
  77. It depends - snapshot or historical archive by clovis · · Score: 1

    Do you simply need a snapshot to restore the most recent point in history, or do you need the ability to restore some point in history?

    Consider this scenario where I work:
    People have documents for once-a-year reports. They need to make a new one for all the stuff that happened this year, and they need the final from the previous two years. However, a virus (or new-hire) went through and randomly corrupted and deleted several documents some months ago. A mirrored system can quickly give you the data exactly as it was yesterday, but that would still be bad data.
    If you need to restore/access things as they were 6 months ago, or a year ago, you gotta have tapes or DVD's. Affordable solution are too slow, that is it takes longer to do the backup than allowable downtime (suppose DVD's take 20 hours to backup 8 hours of work - it won't fit in a day)

    There's many options. One thing you can do is mirror your system onto cheap disks which gives you a quick snapshot. Then use tapes/DVD to do periodic full and incremental backups of the mirrored system.
    The you can use a cheap backup system and hope the tapes don't go bad. (They will).

  78. Printer & Scanner Backup System by Maljin+Jolt · · Score: 1

    Print your data with tiny bar code or ocr font. Don't forget to number pages.

    Several months later, you can try to sell some outdated backups to geeks as a wallpapers.

    --
    There you are, staring at me again.
  79. cheap drives can be cost-effective by pedantic+bore · · Score: 1
    For large quantities, tapes are probably the way to go, but the hassle of dealing with tapes is something to consider. So for small quantities (a TB a month or so) offline disks make sense. You can get a 1 TB self-contained firewire array from LaCie for ~$1,300 (and there are probably competitors in this space). These are nice because they allow you to store the data in a normal file system. Just remember to keep the drives in a safe place, and power them up every month or so to keep the moving parts happy.

    The real nuisance with backups is if you need to keep them for several generations of hardware/software. In this case you need to keep a copy of everything you need to read them. Firewire is probably good for another few years before they start to fade into obscurity (unless it suddenly blossoms into popularity, reversing the apparent trend) but my guess at this point is that your new PC ten years from now won't support them in any way. Jump backup technologies before they become obsolete, but don't jump too soon either -- wait until the technology is stable and has shown that it is likely to have a long life-span.

    --
    Am I part of the core demographic for Swedish Fish?
  80. Tiered Data Storage by Anonymous Coward · · Score: 0

    It all depends on what level of recoverability you want. Here is the enterprise view:

    1. Software: We use Tivoli (IBM) this software and its cost (Maintenance) is high because IBM guarantees recoverability of data it stores.
    2. Hardware: Tivoli caches backup data on disk(compressed) preferably RAID, then after a period of time or other criteria it moves it off to tape, using a tape library is preferred since it is automated. Two tapes are created so that one is shipped off-site. Off-site storage is required for recoverability.
    3. A second set of hardware and software is required if you want to insure that you can recover in a reasonable time period.

    Cost is directly proportional to how much of each component you wish to utilize, which then determines whether you can recover at all and in what time period.

    Cost is exponential for these components and is determined also by size of data that must be recovered, either a small amount quickly (seconds or minutes for small amounts of data to hours or days for large amounts of data).

    Remember, another high cost option is to mirror the data at two locations that can be accessed within seconds if one location fails.

    Cost is high if recoverability and recoverability time period is important. Cost is low if you do not need the data back quickly or do not care if all of it is available ever again.

    If this is a job critical function *Business depends on it* then the higher cost options are required. Sorry. If you want recoverability you pay more for each degree of recoverability...there are multiple layers to be considered.

    This is the art and theory of Contingency Planning for data centers. Oops, I am letting out the secrets of the professionals... wait I am one, I do this sort of planning for a living. Cost benefit analysis is a requirement in this sort of thing. Also, consider if there are other requirements for recoverability such as legal, corporate intellectual property, etc. These all factor into the cost one must be willing to pay for recoverability.

  81. Blue? by shadowkoder · · Score: 1

    As far as DVDs go, what about the new blue-ray dics coming out? I thought ~.5TB was possible per single sided, dual-layered disc.

  82. Re:options options, what is your time and data wor by Anonymous Coward · · Score: 1, Funny
    "you need to either a) get a grant to fund your work, b) hire somebody to swap DVD's for you or b) seriously rethink your data generation."

    I choose b ... HEY! That's a trick question!

    Meh.

  83. Large Storage System. by TechGladiator · · Score: 1
    Let me start with saying that I dont nearly have 1TB/Month but have the need to have over 2TB of data online because of different client's (OLD)Databases that need to be online for reporting.

    After looking around for a couple of months I bought myself a 12 Drive SATA hotswappable case from from Rackmountpro.com and bought 12 250GB SATA drives and a 3ware card. It works like a charm. Speed is great and I think the price of the 250GB drives are right (WD2500JD $165/Piece).

    For the longest time I was looking for a CD/DVD sort of Jukebox where I could have like 200-400CDs (Real CDs/DVDs, not ISOs) but I had no luck finding anything. Even if I had some kinda program telling the jukebox to change the CD/DVD would be great, but googled it for a while and came back empty.

    I am still looking for a jukebox solution for archivals puposes if anybody has any idea where I can get/make this.
  84. Hard copy by NIK282000 · · Score: 0
    Hard copy is safer then any other storage, make hex dump all your data and print it out. Even if you have a book with as many pages as windows has bugs, they will last decades if stored properly. Now retreiving that data and making it usable again is up to you.

    --
    Dear aunt, let's set so double the killer delete select all
    1. Re:Hard copy by eric76 · · Score: 1

      You could store the data quite reliably in much less space with a good quality laser printer.

      I've heard of someone who did it with very tiny forward and back slashes. The paper reportedly looked gray but a good laser scanner could pick it up without problem.

      What would be really interesting would be an informal contest to see who could store the most data in a single page of paper.

  85. The simple economics of your "work" by swb · · Score: 3, Insightful

    If your "work" (as in food, housing and income) requires this kind of storage, you should be charging the kind of money that can make the ecomomics of such data storage actually viable. I'm assuming that some of the really high-end storage devices from EMC, Hitachi, et al could handle your data generation/replication/backup needs effortlessly.

    If that's too expensive (and it usually is), you can kludge your own system using low-end stuff from Hpaq/IBM/Dell's x86-server-oriented product lines. LTO1 drives are pretty cheap and we've found them to be very reliable over the past 3+ years, as well as offering 100 gig native per tape.

    If even that's too expensive, then I seriously think you need to re-think the economics of your work situation. If your work doesn't cover your capital costs, you're not charging enough. If the work and data are business valuable enough, cutting your storage bill to the bone by building Linux clusters crammed with IDE HDDs is just a bad business decision.

    If this is just your hobby-type work, then you need a cheaper hobby, like heroin addiction or something affordable. Physical space and electricity aren't cheap enough in a metropolitan area to burn through 1TB of storage per month, let along reliable data storage.

    1. Re:The simple economics of your "work" by Anonymous Coward · · Score: 0

      Except hard drive sizes keep increasing. Those 100gb drives from a year ago are 200gb now. So if he's doing 1tb/mo, that's about 10x100gb drives. But then then 200gb's replace 100gb drives as the affordable price-break point and now he can store the second year's worth of data in the same physical space (10x200gb drives). If 400gb drives become as affordable as the 100gb were and 200gb are - in about another year - then he'll be able to store the third year's worth of data *plus* another forthcoming year's worth of data... all in the same physical space (10x400gb drives).

  86. A few options, but more info would help... by isaac · · Score: 1
    Your posting doesn't really give much in the way of requirements.

    Do you need for all the data to be online at once? If not, consider a small LTO tape autoloader - a 24-slot loader will run you about $13k when loaded with LTO Ultrium 2 media, but you get 4.6 TB of storage, assuming you fill 23 slots with tape (you should always have a cleaning cart in the library to run at regular intervals). If you're really cheap, you can write a set of perl or shell scripts to operate the library with mtx and mt, rather than buying expensive software.

    If you absolutely need all the data to be online forever and ever (amen), then you're going to need a fat wallet. An ever-growing homebuilt solution will rapidly become unmaintainable. Consider an off-the-shelf NAS or hardware raid solution. Apple's XServe RAID boxes are surprisingly cheap for fiber-attached hardware RAID - $11k gets you 2.5TB of RAID5 (and that's with 2 drives reserved for hot spares, you could squeeze 3TB out of it with no hot spares, but i don't recommend it.)

    If you don't think you'll need to expand forever, you can start looking at homebrew options. A 12-port 3WARE SATA controller with 12 250GB SATA disks should run you less than $3000 and give you 2.5TB of raid5 with one hot spare. (Of course you'll need a system with the case and power supply(supplies) to handle them. Next year, twice the storage might cost the same or less - 400GB SATA drives are already shipping, though still cost more per gig.

    Basically, you're not the common man. Like another poster said, you need to consider what the data are worth, and buy your storage accordingly.

    -Isaac

    --
    I am not a lawyer, and this is not legal advice. For Entertainment Purposes Only.
    1. Re:A few options, but more info would help... by Anonymous Coward · · Score: 0

      Look at Apple's Xsan (in beta ATM) for a scalable SAN filesystem with multiple XServe RAIDs

      If that's too costly, consider getting a number of custom build SATA RAID5 NAS servers.
      We have 6Tb and 4Tb boxes using Maxtor 250Gb drives which have proved very reliable.

      We also have a Qulastar LTO2 tape robot, which could be a good solution if it fits your requirements.

  87. Near-line storage by Theovon · · Score: 1

    I've been thinking about something similar to this for a while. At work, it would be kinda nice to have some network storage that is more reliable than it is fast. It would be there for files that we use but which we don't use frequently enough to warrant expensive disks.

    One idea I had was to scarf up a bunch of cheap 9-gig SCSI drives from one of the local computer fairs and RAID them together. But I'm not sure if that's a good idea.

  88. Bad idea by Anonymous Coward · · Score: 5, Informative

    Google stores data for fast access, not for reliable storage. They don't care if they lose a few hundred gigs when a handful of disks die, they'll just re-spider it in a few days when the Googlebot hits the sites which were lost. Their solution is NOT optimized for reliable storage and it's not suited in the slightest to this guy's problem.

    1. Re:Bad idea by Anonymous Coward · · Score: 4, Informative

      this is incorrect. GFS (Google File System) has many systems with the same data on each node. These nodes have 3 copies of each data slice. If one server fails then the other two mirrors re-copy the data.. If two fail then the server mirrors the data to ensure it is never lost.

      google does not want ANY data to be lost. The have many mirrors of all data.

  89. Part of "fast" is "random access." by dbirchall · · Score: 1

    And that's what tape drives really aren't. If you want the last file on the tape, you have to wait for the drive to seek past everything else, and that's a real pain.

  90. VHS Tapes by nurb432 · · Score: 1

    Once apon a time there were data adaptors that let you record data onto VHS tape.

    It was pretty much like a TAR file in that it was NOT addressable, but made for huge/cheap long term backup storage.

    Donno if they still exist or not, but it would still be a good product. If they dont exist any longer, perhaps it would be good to re-create the devices.

    --
    ---- Booth was a patriot ----
  91. can you reduce the accuracy, make it polynomial? by Anonymous Coward · · Score: 0

    can you do a curve fit to some time periods and then just store the coefficients? it will be pretty close.
    e.g. do a curve fit on each days data using e.g. LOWESS.

    it will be reasonably accurate, and the amount of data will be drastically reduced.

  92. Re:options options, what is your time and data wor by Anonymous Coward · · Score: 0

    i wonder if there's a way to do 'RAID' across DVD's so that you can use cheap ones and not worry so much.

  93. two words... by xirtam_work · · Score: 1

    Xserve RAID.

    Yep, you read that correctly. Apple's Xserve RAID product is cheaper bang for buck than virtually any other solution from any other manufactuer, plus it works with Solaris, Linux, Windows, Mac OS X and more.

    You can use regular ATA drives and administer it from an easy to use interface either on a Mac or a web interface.

    Also check out Xsan on Apple's website.

    enjoy!

    1. Re:two words... by op00to · · Score: 1

      What the hell is "bang for buck"? The ability to store more "!"'s per dollar than any other solution from any other "manufactuer"?

      Jeez, mac people really are dangerous.

  94. 1 TB USB/Firewire Drives by FsG · · Score: 1

    I've been using one of these 1 TB USB/firewire drives. It's a wonderful thing; entirely self-contained, with no cluster to manage or worry about. USB allows for 127 devices, so you should be able to acquire as much hard drive space as you need. They can be easily unplugged and stored, too.

    --
    I made a PHP/MySQL library that prevents SQL injection & makes coding easier!
  95. Have you tried Iomega Rev? by mac123 · · Score: 1

    Have you tried the Iomega Rev?

    35GB Native capacity
    Up to 90GB with compression
    Hard disk speeds
    ATAPI and USB interfaces
    Good stuff

    1. Re:Have you tried Iomega Rev? by Anonymous Coward · · Score: 0

      Click.. click.. clickety.. clickety.. Ah sh*$**%

  96. Tape is your only option by billybob · · Score: 1

    I havent seen any posts that took into account that (it sounds like anyways) he needs to store this data permanently. He has a 1.6TB cluster, but if he has 1TB of data per month, that's not doing him good.

    Normally I would recommend a RAID, but at that store capacity and taking into account you need to keep your data permanently, that's not going to work as you will continuously need to be adding more disks. That would be a minimum of 4 new hard drives per month (assuming they make 500GB ones, I think they do, if its less than change that to 6 HD's/month).

    Many posts have pointed out tape backup as one of several solutions, however with your requirements I dont see how you could do anything else, if your time is worth anything. The other option is basically DVD's but that'd be 200 DVD's a month you'd have to burn. Fun fun...

    Look to some of the other posts for good recommendations on particular tape drives to look for and their costs, there's plenty of them here. :)

    --
    Joseph?
  97. Just buy 2 large disks by FonkiE · · Score: 1

    I'm planning to use 2 large disks. e.g. 400GB. One in my desktop, the second one in my server.

    Both disks get 2 partitions á 200GB. The first partition may be divided into more parts, but the last one is 200GB.

    Then I'll use (intelligent == diff) rsync to copy the server partitons into directories of the last client partiton and vice versa.

    It's very unlikely that 2 harddisks in 2 differnt computers die the same day - even at a lightning event - but then again you should use cheap extra fuses for that. (I had lightning and high voltage damage, power supplies and mainboards died, but no harddisk ever.)

    If you need protection from fire, you have to place the second disk offsite, or use a firewire/usb 2.0 disk and store it somewhere else.

    You can even use this in additon to the client/server 'mirror' described above.

    I own a 12GB DDS3 streamer and worked with DLT, don't throw away the money. Harddisks are faster and the backups are easier to access. DVD's are too small...

    The rsync version has a very huge advantage: I can easily access each *file* directly.

    I'm using a such a backup with firewire disk and 'tar' for additional compression on another site. We had another 12GB streamer before (1000 EUR then, 550 EUR now, and 15 EUR per tape - now and then :) and it was always a mess until the streamer died after 3 years (weekly use).

    Cross-mirror Advantages:

    1) fast backups
    2) relatively cheap (2 x 360 EUR)
    3) quiet (I'll throw out *all* older/smaller hds)
    4) direkt fast access
    5) daily cronjob

    Cross-mirror Disadvantages:

    1) just *one* one day old snapshot
    2) no automatic raid mirror, where you don't have to do anything

    Comments on the disadvantages:

    ad 1) use the -b flag on backup, so nothing will be deleted, just moved out of the way and then use the -delete flag from time to time, or clean up by hand.

    ad 2) having the disks in the same computer is dangerous for backups, considering high voltage etc.

    You can even do that with more than two computers. Put a second 400GB disk into your server just for backups ;-) Then the layout becomes more centralized ...

  98. your government provides this as a free service by leehwtsohg · · Score: 1

    Send it to an e-mail account in europe, with a few interesting words added (I'm not sure that's even neccessary). Delete it at the destination with an automated script. Retrieve it a couple of years later using the freedom of information act.

  99. We know what you do for a living! by psyconaut · · Score: 1

    Data mining for al Qaeda, right?

    -psy ;-)

  100. Firewire hub with hardware RAID by swb · · Score: 2, Interesting

    I'd love to see a Firewire hub that could act as a hardware RAID controller. A program on the computer would enable management of the RAID controller, and once formatted, the logical volumes would be presented to the host computer as standard disk volumes, eliminating the need for any special drivers on the host computer, as well as enabling the entire array to be portable to other platforms.

    How expensive could something like this really be? $300-400 at most, I'd have to guess considering what most places are charging for SATA RAID cards.

    1. Re:Firewire hub with hardware RAID by Anonymous Coward · · Score: 0

      Ultimately, as with most hardware IDE and SCSI RAID systems, software RAID would probably be faster in the long run, and not nearly as expensive.

      Firewire with a hub and some software RAID sounds like a very nice thing, but you'd have to keep the unit as a whole all of the time, which kind'a destroys any benefit of firewire.

    2. Re:Firewire hub with hardware RAID by rhizome · · Score: 2, Informative

      > I'd love to see a Firewire hub that could act as a hardware RAID controller.

      Firewire drives can be daisychained, and in fact OS X allows you to set up software RAID on multiple firewire drives attached to the system. You can't move them to another system and get access, but that's about the only limitation that I've found and it's more than decent for local high-density storage..

      --
      When I was a kid, we only had one Darth.
  101. I'm not a "Common Man" then by macdaddy · · Score: 2, Informative
    I have a dual MP2400 with 4 x 120GB WD 1200JB drives. I have a single XP2800 machine with 4 x 120GB WD 1200JB drives, 2 x 200GB Maxtor 6Y200P0, and 2 x maxtor 7Y250P0 drives. I have a dual Xeon 2.8Ghz machine with 4 x 120GB Maxtor 6Y120M0 drives. That accounts for all my regularly used machines. I guess I'm not a common man. :-) Not to brag...

    I have to disagree with the sister system though. For most geeks like you and I a sister system would be fairly adequate. It would be better with an occasional off-site backup. However it really sounds like this guy's data is far too valuable to have only one copy of it and to have all copies be at one physical location. He really needs an off-site backup somewhere. Imagine for a moment if his home (I'm guessing he works from home, but this still applies to a real store-front business) was robbed. The crooks didn't know what they were taking. They saw two shiny computers in an office and figured they could hawk them on the street. There goes all his data, both copies. D'oh! So in short a sister system is a good idea but it probably won't do this guy much of any good. It would be a good local solution for a short term live mirror (ie, data is archived that night but the sister machine gives you a backup for that one day's work).

    1. Re:I'm not a "Common Man" then by macdaddy · · Score: 1

      Meat to say "have only one backup copy of it" instead of "have only one copy of it". My bad.

    2. Re:I'm not a "Common Man" then by big+ben+bullet · · Score: 1

      erh... what do you need those for???

      and why is your user name MACdaddy? Or did you forget to mention your dual-G5 system with 2x200Gb and your 17" PowerBook containing 120 Gb

    3. Re:I'm not a "Common Man" then by macdaddy · · Score: 1

      One is my desktop. One is my co-lo server. The other is my home server. I don't have a reasonably new Mac unfortunately. It's on my purchase list after a few more revisions of the G5. I have a PC on my desktop simply because of purchase cost at the time I needed a new machine. I may be the Macdaddy but all my servers run Linux. :-)

  102. Frys by mboverload · · Score: 1
    Frys has 250 GB drives today and tomorrow. I am picking up 4 to make a 1 terabyte (well its not 1024 GB but who gives a damn?) Linux server to host my bittorrent movies and games. I am using about 7 GB a day =)

    Bittorrent rules!

  103. I/O use with [S]ATA, ScSI, Firewire and USB2 by rsd · · Score: 1

    Can someone point me differences in use of HDD related to file/system I/O?

    I know that SCSI drivers are faster than IDE.

    But why does a IDE driver makes the CPU/system go down to its knee?

    IDE being slower than SCSI shouldn't hung the processor while waiting for the data.
    The process will be BLOCKED longer but why the CPU hanging?

    Can someone point to me a good explanation to what's going on?

    I have tested some SATA drives (Seagates and Maxtors) with Silicon Image and NVidia chipsets.
    In all cases I saw no difference to ATA drives.

    Also how is Firewire and USB2 compared to [S]ATA drives related to performance/IO?

    1. Re:I/O use with [S]ATA, ScSI, Firewire and USB2 by HellKnite · · Score: 1

      This is part of the advantage of SCSI. A lot of processing is moved away from the main CPU and is on the SCSI controller itself. This is why SCSI is so much more expensive than the ATA based standards - there is a lot more going on in a SCSI chain, requiring a lot more circuitry. Add to that the fact that most SCSI controllers go beyond basic SCSI connectivity and add some form of RAID, assuming they're designed for HD connectivity... SCSI also accomodates many other things that I've never seen on ATA - tape backup units, robots in tape libraries, and scanners are a few of the things that come to mind.

    2. Re:I/O use with [S]ATA, ScSI, Firewire and USB2 by rsd · · Score: 1

      But what kind of extra processing does the controller/disk do that aliviates so much the I/O?

      Does firewire/USB2 HDs have the same problem as ATA interface/drive?

  104. Mod Article (-1, Retard) by Anonymous Coward · · Score: 0

    My work depends on generating, storing and analyzing a large amount of data -- averaging about 1 TB per month.

    Then you should be *depending* on a quality storage solution (such as from Sun, SGI or EMC) rather than asking advice from a bunch of Linux using nerds whose place in the planet's pecking order is listed immediately under "Maggot".

  105. reasons why tape is increasingly unpopular by alizard · · Score: 1
    I've had 2 occasions to have to attempt to recover completely from tape using prosumer tape drives, a 400 meg QIC and later, a Sony Superstation.

    Basically, I got 95% of my data back and had to reinstall the OS to get the computer running the first time. I then wound up going home and getting my old computer back. I was going to Holland and figured just taking the tapes and the tape drive would work well enough. It didn't.

    The second time, most of the tape volume didn't show in the restore, and after frantic e-mail to customer service, I got an e-mail saying that there was a known issue with the recovery software, they no longer supported it, and the best I could do would be to go to the vendor they'd gotten the the software package from and download the demo. I think I only lost 3-4% of my data.

    I try not to make the same mistake 3x in a row. My next computer upgrade included a mobile rack and a hard drive, my day-to-day stuff I mirror. For long-term/archival stuff, I do a zip-compressed DVD backup set once a month or so.

    I actually have a couple of 8mm tape drives, either of which are big enough for this workstation. Never even bothered to plug them in.

    Certainly, lots of commercial shops use tape. That doesn't mean you should or they should.

    Though I'd be willing to take a look at the Ecris packet writing tecnology or the LTO standard stuff.

    Anybody know of a cheap automated DVD-R? I mean... put in a pile of blank DVD-Rs, get a pile of burned DVD-Rs out?

  106. Could you use hardware raid ? by Brane2 · · Score: 1

    Like, for example 3Ware's 8506-12 ?

    It can accomodate up to 12 SATA drives in to a few RAID groups, so you can have for example two RAID-5 groups with two spare drives.

    When backup is needed, you could unmount one group of drives, unplug it and plug in & mount another.

    For further convenience, you could have drives already boxed in gorup of five in handy enclosures, so all you would heve to do is just reconnect 10 cables once per month.

    To be safe(r), label the connectors and each drive in the group clearly (1-5 or A to E etc) to avoid
    mess when accessing data in the future...

    Hard drives are not so expensive, neither is a solid hardware RAID card, or a few extra (S)ATA cards if you can't afford hardware RAID.

    Just watch where your archive is and your data should be relatively safe...

    But in essence I guess it all depends on your real needs.
    Does someone's life depend on that data ?
    Is all that data equally valuable or is there some core portion that is better not to loose and other part that could eventually be lost ?

  107. Cheap harddrives would make a good backup system by slashname3 · · Score: 1

    I just saw an add for 250GB harddrives at Compusa for $129.99 a pop. Combine that with an inexpensive system (say for $400) and you could backup a terabyte of data each month for less than $1000 dollars a month. Just add another box each month and move the older one off site. Setup a an encrypted VPN to the offsite location and you have instant access to the backups.

    If you don't need instant access then you can run the data off to DVDs at some point and reuse the harddrives and system.

  108. What are your access needs? by davidwr · · Score: 1
    You'll have at least 3 "sets" of data:

    1) stuff you are likely to access in the next some-period-of-time

    2) stuff you might need to access in the next some-period-of-time

    3) stuff you are keeping around just in case of disaster, legal requirement, or silly request by the CEO.

    #1 is on live hard disks.

    #3 can be on any medium, and will probably be stored off-site. Not all organizations have a #3.

    #2 is the interesting question.

    You are using DVDs and want something that's less of a hassle w/o costing a ton of money.

    I've seen several specific recommendations. I'm going to give some general ones:

    live disks, but removable

    a server farm, presumably using less-than-the-fastest drives and equipment, removable drives

    fiberchannel or iSCSI are options

    a robotic tape library system

    a robotic dvd-burning library system

    IDE drives are running at under $500/TB on sale, so I'm sure you could get them at that price in bulk.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  109. tape drive? by kazem · · Score: 1

    I've never used them, but have you considered tape drives? HP makes good tape drives. They can store 3 GB on a single tape and store it at a rate of 80 GB/hour.

    It's good for archiving. So you can't exactly rapidly access the stuff again later. I assume if you wanted it back you'd rip the whole tape or large parts of it to your disk(s) and view it that way.

    Computers can be used for something other than porn or computer games. It's called work. Often of the scientific type.

  110. Re:options options, what is your time and data wor by swb · · Score: 1

    I always wondered about that way back in the days of multi-floppy spanned ZIP files that crapped out on a single disk -- why not parity info in the zip file so that you could lose a segment of the zip file (one or more floppies) so that a burned out floppy wouldn't cause a problem.

    Your suggestion would either imply writing DVDs such that the parity was part of the filesystem on the DVD itself, or containerizing the data (like a disk image) so that the "file" on the DVD had parity info in it.

    Either way, you'd need a big jukebox capable of mounting a set of DVDs at one time to accomodate the parity info. If you assume 14 DVDs readers, it's probably too small an amount of data to make it worthwhile, unless you had big HDD(s) in the jukebox and internal logic to rebuild the RAID set into a single logical file off of multiple DVDs.

  111. use VISUALLY LOSSLESS compression by Anonymous Coward · · Score: 1, Insightful

    I can't image his data sets to be non-visual data (unless he's CERN doing particle physics). In that case, why doesn't he use some compression that's tuned to the human visual system; what the compression algorithm throws out he wouldn't have seen anyway. There are a bunch of wavelet based algorithms (much better than blocky MPEG-2 which needs interframe compression anyway) which are extremely good, he should see 30:1 ratios without seeing any "loss" or artifacts.

    This is what the studios are moving to. A typical feature film scanned at 2K takes 1.5 terabytes uncompressed. These files then tie up multi-million dollar telecine machines, so they need to move them off fast. (Imagine backing up 1.5 TB onto DLT every day). With scanning going to 4K this problem will get even worse.

  112. Some things to consider by ameoba · · Score: 1

    Looking at the link attached to the poster's name, it appears he's doing some sort of bioinformatics work, as such he should probably have some grant money and/or VC funding supporting this work. As such, 'cheap' is relative; it's always easier to spend Other People's Money.

    Secondly, Is all the data -really- worth keeping around? For how long? I can believe a TB/mo of data but have a hard time imagining of it all being something you'd want to use 6mo down the road; only saving things that are important might make the job a lot easier.

    On another point - you can probably compress this data down significantly; multi-hundred-megabyte files consisting of nothing but 'A', 'T', 'G' and 'C' should be able to compress down by a factor of at least 4. Both Windows and Linux allow you to automatically compress data on filesystems which would make a large dent in your storage requirements.

    --
    my sig's at the bottom of the page.
  113. Do It Yourself CD Changer by Darth+Yoshi · · Score: 2, Interesting
    But long term storage is painful -- DVDs cost about $0.10-$0.15/GB but takes too much human time...

    Do It Yourself CD Changer

    --
    // TODO: fix sig
  114. at $1,200/TB by davidwr · · Score: 1
    That's a bit pricy. Yes, it's external, but unless you want to pay for convenience, you can have similar performance for less money elsewhere.

    Then again, w/in 18 months you should get twice the storage for the money.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  115. I’m a time traveler stuck here in 2004. by OneDeeTenTee · · Score: 0

    Hello,

    I'm a time traveler stuck here in 2004. Upon arriving here my dimensional warp generator stopped working. I trusted a company here by the name of LLC Lasers to repair my Generation 3 52 4350A watch unit, and they fled on me. I am going to need a new DWG unit, preferably the rechargeable AMD wrist watch model with the GRC79 induction motor, four I80200 warp stabilizers, 512GB of SRAM and the menu driven GUI with front panel XID display.

    --
    Stop the world; I need to get off.
  116. smaller than a DVD by davidwr · · Score: 1

    I remember those, I think they stored under 10GB, certainly under 20.

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
  117. Ahh it was a joke! by Anonymous Coward · · Score: 0

    I thought he was being serious.

  118. Ask the CEO of google by wheels4u · · Score: 0

    1 000 000 users x 1 024 megabytes = 1,024 petabytes

    --
    11 1101 1011111 0100 000 110 1011111 0101 10 01 1011111 101 1 011 1011111 0 1111 11 111 1011111 101
  119. Watch out for pineapples, though... by billstewart · · Score: 1
    Don't say I didn't warn you.

    ...... Spongedrive Squarepants....

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  120. I think it's 0.05TB, not 0.5TB by davidwr · · Score: 1

    The next-gen DVDs are all in the order of 15-50GB. Sure would be nice if it were 0.5TB though, wouldn't it :).

    --
    Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.
    1. Re:I think it's 0.05TB, not 0.5TB by shadowkoder · · Score: 1

      Awww ... It woulda been cool if it was/will, but that's my bad.

  121. Reconsider some of your statements by laing · · Score: 1

    If your data is valuable to you, you should seriously consider using RAID. The first drive failure you have (and you will have many) will cost you more in lost data, down-time, and labor than a 3ware RAID controller would have cost. I've used both their IDE and SATA controllers and they're great. I also went through 1.5 TB worth of 250GB SATA drives in about 18 months due to failures. I'm glad I selected the WD drives with their 3 year warranty.

    For off-line data storage, consider using large hard drives and a hot-swap bay. There's nothing cheaper, faster, or of higher density.

  122. Re:Ask the idiot who posted the parent by Lord+Bitman · · Score: 1

    Do you think google expects 1,000,000 users to use 1GB within a year? They offer that because they know most people arent going to use it, and that by the time that isnt true anymore there will be new technologies available.

    --
    -- 'The' Lord and Master Bitman On High, Master Of All
  123. You're NOT using RAID??!!!! by swordgeek · · Score: 2, Insightful

    Let me get this straight: You have a four-node cluster, you have 1.6TB of online storage, and you need some sort of permanence; and you're not using RAID of any form?

    This is utter insanity! Without RAID, your only hope of safety is in your backups--which you're only asking about now!

    RAID your data ASAP, and then start looking for backup systems. Take a look at some of the DLT4000 replacements.

    --

    "People who do stupid things with hazardous materials often die." -- Jim Davidson on alt.folklore.urban
  124. Another perspective by v1 · · Score: 2, Interesting


    Just tossing out another point of view - similar but different than some of the others previously discussed. First off, examine the data you are keeping - do you really need that much? Nowadays it's common to be able to acquire data faster than it can be processed, and if you never stop gathering data, well, you never will catch up, only fall farther and farther behind.

    If you DO need this data, and you are going to need it for awhile, (year or more) I'd recommend cheap HDs. They also have an advantage of being easily catalogged, and are untouchable when compared with access time of tapes. Don't go raid5 though, this is not "catastrophy-proof". (flood, fire, tornado, etc) For catastrophy protection, mirror your drives. When you have them loaded up with data, pull the FW cables and swap drives in the enclosures with fresh empty drives. Label them well, and then take each half of the mirror to DIFFERENT LOCATIONS. It's OK to keep one set on-site, but the other set must be somewhere else, preferably in another zip code. This will allow you near instant access to your data (since it's onsite), will protect your data from mechanical failure (through mirroring) and will protect you against catastrophy. (you WILL need to acquire new firewire boxes etc if your office gets leveled... don't forget this detail in general - the data is of no value if you lack the equipment (tape drives etc) to read it back in with) I know you can get compression and fit more on a tape etc by using archiving software, but it may be worth the extra cost to obey the KISS rule and just simply drag and drop the data to the formatted HDs. This will make data recovery MUCH SIMPLER, and if there are errors on the HD when you need to recover, this will insure you can actually recover most/all of the information. Archive streams and tapes are notorious for losing 100% of the data that follows a corruption point in the stream.

    Once you know you no longer need a specific set, drop it back into the pool of usable drives. Buy them by the case, it's much cheaper this way. It also is advisable to buy the same make/model every time you have to get more drives, even if there are newer, larger, cheaper models out, because having all the same drives means one less complication to worry about in times of crisis.

    --
    I work for the Department of Redundancy Department.
  125. you need by confused+one · · Score: 1
    based on what I saw of your website and the fact that you ARE dealing with 1TB a month, I'm going to assume you're doing serious professional work. Given that that is true, you need a professional solution to guard your data. Don't take any shortcuts, your data is your business, isn't it? Why would you risk your business on half-assed hack solutions.

    You NEED a serious raid array. Raid 5 or possibly even Raid 50. You NEED to implement snapshots on your dev space. You NEED a serious tape drive, probably a robotic system.

  126. 100+ Gb by nurb432 · · Score: 1

    This was my recollection, but its been years so that may not be right...

    But 10gb doesn't sound right to me....

    They were ungodly slow.. and of course no random access ability gave them a small market.. so they fizzled out..

    But a great idea, cheap archiving, back when a 20 mb QIC tape was priced beyond mortal men....

    --
    ---- Booth was a patriot ----
    1. Re:100+ Gb by Anonymous Coward · · Score: 0

      well back then 10GB felt like 10TB now adays

  127. Re:Bulk Storage For the Common Man by espiritu · · Score: 1
    I was checking out a trade rag, and saw an ad for http://www.strongholdstorage.ca...currently it redirects to the parent company page http://www.corenetworksolutions.ca, so i called them, and was told it is a new service that should be up and running by the end of the week

    Hope this helps...
    Cheers!

  128. Re:age old problem... VXA2 by Anonymous Coward · · Score: 1, Informative

    VXA2 has a much higher storage capacity at a better price point than DLT. For example, a 10 tape VXA2 autoloader is about $2300 USD and holds 1.7 or 1.9TB compressed.

  129. PaperDisk, anyone? by TLSPRWR · · Score: 1

    Stock up on printer paper and ink cartridges.

    Come to think of it, you may want some filing cabinets as well (Only ~1.5 mb per 8.5x11 sheet).

    1. Re:PaperDisk, anyone? by wheels4u · · Score: 0

      It has the adventage of being double sided.

      --
      11 1101 1011111 0100 000 110 1011111 0101 10 01 1011111 101 1 011 1011111 0 1111 11 111 1011111 101
    2. Re:PaperDisk, anyone? by Jack+Schitt · · Score: 1

      What would you get with PaperDisk + MicroFische?

      I seem to remember that a MicroFische cassette will hold about 10,000 (single-sidded) pages and they do have twain compatible MicroFische readers.

      (For those of you who don't know, MicroFische is an analog optical system of storing dcouments. Think automated microfilm on a reel)

      --
      This message brought to you by Jack Schitt's Previously Shat Shit
  130. This is why I hate Ask Slashdot by Propaganda13 · · Score: 3, Insightful

    Not enough feedback or information!

    OK, 1TB/month that doesn't say much.
    Always look at different levels of case scenarios and work from there. I usually start with loss of building by fire and work down through limited hardware failure or data corruption.

    There are several factors that determine how often you should backup. Here's just a couple of questions to answer.

    How much is the data worth?
    How much is your time worth? If you lost a day or week of processing time.
    Is your work time dependent? (deadlines)
    If you lost the data, did you lose the data completely or just lost processing/analyzing time on the data that you can get from your clients again?

    How long do you have to store the data, and have it retreivable? One month compared to several years really changes your options.

    How financially responsible are you for the data?

    Multiple backups(daily, weekly, monthly)(full and incremental) in multiple locations are key to a successful backups.
    Raid is for redundacy or performance not backups.

  131. well.. by MasTRE · · Score: 1

    > RAID is a possibility, but it increases the cost significantly.

    Software RAID1 would only double your cost, and let you sleep at night.

    --
    Must-not-watch TV!
  132. Re:Exabyte is crap by Anonymous Coward · · Score: 0

    Sent back two of them as they died quickly. They don't work with Adaptec 2940's.. even tho they "see" them momentarily. You gotta get the latest (3000? series).

  133. Cheap USB to IDE adapters by Anonymous Coward · · Score: 1, Interesting

    Get a pile of hard drives, and a pile of these:

    USB to IDE cable

  134. KISS: dot bomb RAID or JBOD on ebay by potus98 · · Score: 1

    While this certainly isn't a very original idea, I'm still amazed at what I can get on ebay. Yes, it's cliche, but I often forget about it when I start soliving problems similar to these. I get all engineerie and over analyze the challenge.

    Large chunks of disk-based storage can be found on the cheap. With the advancements in software (read OS with Linux) based RAID, even JBOD's would work well.

    1-2 TB ain't what it used to be!

    --
    This one gang kept wanting me to join cause I'm pretty good with a bo staff.
  135. Highpoint external SATA by Anonymous Coward · · Score: 0

    Instead of firewire or USB get Highpoint SATA. It's faster and about the same price.

  136. Check out... by La+Camiseta · · Score: 1

    Ibrix has a really good clustered FS setup. Plus you can just plug in more systems when you want them.

    Another thing that you may want to check out something liket GPFS if you want to build your own filesystem cluster.

  137. Tape reliability... DLT by bani · · Score: 1

    DLT and AIT are far more reliable than hard drives.

    Unlike hard drives, I don't worry about losing all my data if I drop a DLT on the floor. You can practically run over the DLT cartridges with a tank and not worry about data loss.

    DLT and AIT are specifically designed to last a long time and avoid tape wear.

    You can pick up a 35gb DLT IV drive off ebay for ~250 and a stack of tapes for ~100. Turn off the built-in drive hardware compression, run your data through bzip2, and sleep soundly knowing your DLTs are far more reliable backup than any hard drive.

    Don't trust your important data to DAT -- the rube goldberg tape mechanism means many munched tapes. :-(

  138. What about aggregate speed? by Bananas · · Score: 2, Informative
    Something I've noticed in all posts, the price is a prime factor, but the original poster also seems concerned about access times (hence DVDs not being an option, due to the time it takes to retrieve data).

    For simplicity, I'm not going to go into RAID tradeoffs, etc. and just stick with "striped data", which gives you maximum bang for the buck. You should draw up a simple spreadsheet with the following headings:

    1. Media Type: Enter the type of media, it's manufacturer, etc. here. Example: HDD, Hitachi, Model XYZ, 320Gb.
    2. Size: How much raw data the medium will store. All figures should be in Gb.
    3. Speed: The expected data transfer rate of a single unit of storage. IDE drives vary and can range from 5mb to 30mb/sec, tape also ranges. All figures in Mb/sec.
    4. Watts per Unit: how much power does each media unit draw? Tape drives will be difficult here, but HDD units are typically around 20-30 watts. Go conservative and plan on allocations of 30 watts for HDD units.
    5. Cost per Unit: How much does it cost for 1 HDD/Tape/whatever?
    6. Cost per Gb: [Size] divided by [Cost].
    7. Units Needed: Given a target of 1024 Gb (1 Terabyte), how many units of storage are needed to reach that size, assuming data striping and no RAID-5? Forumula is '1024' divided by [Size]. then round all decimals up to the nearest whole number.
    8. Expected Size: Take [Units Needed] times [Size]. If you have 4 units of 320, you'll end up with 1280 Gb (sans redundancy).
    9. Total Cost: cost of non-redundant array. [Cost Per Unit] times [Units Needed].
    10. Aggregate Speed: Assuming a 1:1 ratio of controllers to units, what kind of speed can we expect? [ Units Needed ] times [Speed]. All figures in Megabytes. Note: a huge array of 1+ TB can be made unusable if you can only process 10 megabytes a second.
    11. Power Consumed: [ Units Needed ] times [ Watts per Unit ]. Important - your power supply should be rated at about 120% of this figure to make everything work reliably. Also, if you're going above 400-500 watts, then plan on some additional cooling - there will be an increase in BTU's

    It's not exactly a great spreadsheet layout, but it should be enough to enter everything in and start seeing what is practical and what isn't. I'm sure that someone else would be able to enhance this a little further - any takers?

    By the way, you really should think about RAID-5 at the very least. All it would take is just one drive to hose your data completely. Besides, as the array grows in size, the price tradeoff becomes smaller and smaller, to the point where it's really not worth your time to stripe all of your data without redundancy. I believe that the md drivers in linux support up to 32 devices per RAID set. That takes your overhead from 1/5 of your array (in a 5-drive setup) down to 1/32 of your array.

    A SAN-style setup lends itself well to this, but the price is very prohibitive to "the common man", as it requires very expensive hardware. You can emulate something like this via GFS support in Linux, which (theoretically) would allow you to aggregate your data.

    If there is a requirement to keep the data online at all times, you'll need to spend more on some PC cases, as well as some networking to string the units together. Pick a reasonably-priced case that will house all the media units, have adequate power (at least 250 Watts, 300+ would be ideal) and keep them cool. Use a motherboard that is reliable, and can adapt to several different clock speeds for a given CPU; you'll want something that can be thrown out for less than $99.00 if it should go bonkers on you, but if the CPU burns up, you should be able to still get parts off the shelf and get the Motherboard running again. Stick will the "commodity" or low-end CPUs, as (a) they tend to be cheaper, and (b) having been through a complete lifecycle, any bugs or issues with the CPUs will be well-known by now. Don't worry about the speed of the board or CPU at this point, as most "modern"

    1. Re:What about aggregate speed? by pe1chl · · Score: 1

      >HDD units are typically around 20-30 watts

      SCSI drives, maybe. Or older types of drives.

      A typical modern 300GB ATA drive runs at 10 watts in-use (and less when idle).

  139. Xraid by Phrack · · Score: 2, Insightful

    I haven't tried it myself, but Apple's Xraid appears to be gaining in popularity as a reasonably priced bulk data storage solution. It reportedly works with Linux, Windows, Netware and, of course, Macs.

    If that doesn't suit ya, and it's bulk storage without necessarily speed you're looking for, check into the ATABoy line from Nexsan.

    --
    Dump the IRS - http://www.fairtax.org
    1. Re:Xraid by Lukey+Boy · · Score: 2, Funny

      I do believe that this is the first time I've ever seen the word Apple used in the same sentence as "reasonably priced".

  140. Re:Cheap harddrives would make a good backup syste by slashname3 · · Score: 1

    Wow, ignorance gets demonstrated in such amazing ways.

  141. I can't get you cheap terabytes... by ConceptJunkie · · Score: 1

    ... I know where you can get a cheap eye.

    --
    You are in a maze of twisty little passages, all alike.
  142. Outsource it by bcjanes · · Score: 1

    Outsource it, pay 1000 people in an impoverished country to memmorize all of it !

    --
    Linux is unix training wheels, while BSD *is* unix.
  143. NAS Anyone? by Phredd · · Score: 1

    How about a gigE NAS solution here? You can get a 3TB (raw) NAS solution from EMC these days for $6k (it is an AX100). The question remains, what is your data worth?

    --
    Phredd - "I have found people tend to take you far less seriously once you start waving your genitals at them..."
  144. Mmmm, hard disks by Anonymous Coward · · Score: 1, Informative

    Well, not this is a soluation, but if you just have lots of money to blow. I just put some of this stuff in at work.

    I've got an EMC Clariion CX200 fibered to the servers, that means its a SANS. Its 3 TB. I think it was $40k. Then I've got a Win2k3 Appliance Server fibered to the EMC Clariion CX300 with 12 TB. It shares out through various filesystems ( nfs, smb, ftp etc.. ). So its' my NAS. The CX300 I think was about 100k. The SANS holds the online data, the NAS holds the archived data. Oh, and the NAS head is fibered to a Dell PowerVault 12TB LTO2 tape jukebox. Which backs up the NAS. It was pretty cheap, i think it was 18k or so.

    Oh yes, and just for the slashdot crowd. Yes, this equipment will be holding fMRI studies. As well as MRA, CT, DR, CR, CTA, US and variaty of other modalites. Being a PACS engineer is fun, although, I do make less then a teacher. Fucking Economy.

  145. You need DMF by csoto · · Score: 1

    SGI's Data Migration Facility. It works on IRIX, but maybe they have a Linux version now (or get a MIPS SGI on eBay). It migrates older files to tape automatically, or to another filesystem, if you happen to have a big huge array somewhere...

    --
    There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
    1. Re:You need DMF by csoto · · Score: 1

      Apparently, they offer DMF 2.8 for their Linux-based Altix line of servers, which aren't so horrible in and of themselves.

      --
      There exists no way of exchanging information without making judgments. --Bene Gesserit Axiom
  146. RAID okay for this task by StRex · · Score: 1

    Read the question the person asked again. They're looking for long term storage, not a backup. The redundancy of RAID provides some assurance that your data will be around for the long term.

    1. Re:RAID okay for this task by timmarhy · · Score: 1

      no i think you need the question again. "long term" implies keeping the data safe and offline, it certainly doesn't imply any need to have it instantly accessable. if he is running a business and living off this data, then he's a complete fool for not having a raid + backup of this data in the first place. what if theres a fire/spillage? whats his raid going to do for him then? ohh wait.. HE NEEDS OFFLINE BACKUP!

      --
      If you mod me down, I will become more powerful than you can imagine....
  147. Want your data to last a million years? by digitalgimpus · · Score: 1

    Try a stone tablet and a chissle.

    Proven to last a long time.

    Though quite labor intensive. Could employ slaves? ;-)

  148. How about compression? by Anonymous Coward · · Score: 1, Informative

    >>"Lately, I have been looking into different bulk data storage options available to a common man. My work depends on generating, storing and analyzing a large amount of data -- averaging about 1 TB per month. I would like to have a storage system which is automated, fast, reliable and most importantly does not cost the price of an eye. Right now, I have a 4 node Linux cluster with 10 large hard disks (total capacity 1.6 TB); data storage roughly costs about $0.60/GB (excluding the cost of PC hardware). But long term storage is painful -- DVDs cost about $0.10-$0.15/GB but takes too much human time and leaving data on hard disks makes me nervous because of possible failures. RAID is a possibility, but it increases the cost significantly. I was wondering, if Slashdot readers have any recommendations for a cheap automated way to store and retrieve data."

    First of all, is the 1TB of data that you collect every month mostly different or mostly the same as the data you collected in the past month?

    Secondly, how compressable is the data you are collecting?

    Thirdly, how much random access do you need to the data, or is a serial stream of the data good enough?

    --

    If the data you have is mostly the same from month to month then you only have to perserve the difference between 2 months

    If the data is highly compressable then you can use bzip2 -9 to make the data much smaller, therefore needing a lot less of the media than otherwise.

    A 1TB file that compresses 50 to 1 will only be 20 GB. This will easily fit on 5 DVD's.

    If you collect 1TB of data and diffing it with the previous months data outputs only 100 GB differences, and that compresses down 21 to 1 then you can fit it on a single DVD.

    rsync is also good for copying the data on a system to a remote system.

  149. Linux Software RAID by max+born · · Score: 1

    Your Linux cluster is the cheapest method I've found. I have one such TB box with 10 old/used/cheapo SCSI disks and using software RAID I built about 4 years ago. It acts as a backup server using rsync in a shell script that collects data from 15+ networked servers every night.

    Never had a problem with it.

    Guess that's what the "I" in RAID is for.

  150. storage needs? by dAzED1 · · Score: 1

    does the "common man" need 1.6TB?

    How about doing the raid bit with what you already have.

    1. Re:storage needs? by max+born · · Score: 1

      does the "common man" need 1.6TB?

      When the IBM introduced the first PC back in th 1980s, it was often described as a solution without a problem.

      If you could put 1.6TB on something the size of a postage stamp, the "comman man" would soon find a need for it.

    2. Re:storage needs? by dAzED1 · · Score: 1
      yes, the "common man," pushed heavily by MS's need for fresh money every year and also by the gaming industry, has pushed the PC to places we didn't expect 20 years ago.

      But today isn't 20 years ago - he has the storage today. A simple mirror would yield 800Gb, which is 20x more than most people have anyway. The only people I ever see with a full hard drive are those who have Just stop and think about 800Gb. 20 years ago, people could imagine things that would take up that much space. Today, the only thing the "common man" would need that much space for is HD videos...and if they're dealing with HDTV, they're not going to whining about the costs of RAID.

      Again - we're not talking about what he'll need in 20 years, or even what he'll need in 5 years - we're talking about what he needs today. If he doesn't want the expense of backup options (dvd would be cheap compared to the hassle and expense of getting a tape setup) and doesn't want to RAID, then he'll just have to accept data loss as a probability.

      I've installed very large clusters, and multitudes of sunfires...but I have no problem getting all my vital stuff onto a single cdr. Email archives, documents...those things haven't really changed much. Buy the damn music cd's, and the dvd's, and then you don't need to back them up (or just DL it again if you lose them). Applications? App installs shouldn't be in the 1.6Tb space anway, but even if they are then reinstall is always better.

      My point is that unless he's doing something illegal, is stupid, or both - a cdr (or at worst, a dvd) should handle everything without issues. Manage your data, figure out what is actually unique (documents, email archives, etc), and realize that they aren't really /that/ much bigger than they were 5 years ago. Plan backups appropriately.

  151. more about the app. by Anonymous Coward · · Score: 1, Informative

    I wish the poster told us more about the application. Is the data from sensors (someone mentioned FMRI), or is this from sort of simulation modeling?

    This might sound off topic, but I have used different compression schemes (like wavelets) for sensor data, and regenerative techniques with statistical sumeries that gave me me about 2000:1 equiv. compression. That still does not address the 1TB/month backup problem, but might help reduse the problem somewhat.

    As for backup. In the past I have chosen to go with a modified cluster solution, where I set up a data server in another building and auto-backuped up the new/unique data to it. The reason I chose slow/large HD's instead of tapes is my experience with using tapes over a course of 10 years. I cannot even get replacement tapes if I needed them not to mention the actual tape drives...

    How long do you realistically need to keep the data around for? Can you recycle the media (say after 1 to 2 years? Does it need to be truly perminate? ...

  152. What are your near- and long-term requirements by TBone · · Score: 4, Informative

    I looked through some of the answers here, and as near as I can tell, you've got a bunch of home hobbyists telling you how to back up your home computers. Perhaps all your needs entail is a computer with an external IDE drive array and 4-10 200G SATA drives in it. But from your initial post, it's not clear what you need your offline storage _for_.

    First of all, you mention that you generate and use 1G of data a month. What happens at the end of that month? Does all of the data become useless? Is some of it carried through? Is it useful for historical processing for some time after it's not "live" any more? The disposition of that offline data is important; you can't determine how you can most effectively back up your data until you know what you need to do with that data once it's backed up.

    Since no one cares about backing up old data that they never use any more, I'm going to assume you need this data in some form in the future. I'm also assuming that your data ages out completely every month.

    Realistically, you have two options: Large redundant disk arrays, or tape. Various factors give credence to one or the other.

    First of all, get off of the SATA hacks, and realize you're going to need to go to SCSI, whether you end up with disk or tape. You're backing up data, you're going ot want it to be reliably written out, and SCSI is the de facto standard for backup architecture. Yes, you pay more for it, but there's a reason for it: the SCSI equipment I manage at work fails a fraction of the percentage of time that the various IDE/ATA systems fail. While SATA is marketed as a consumer technology, it will never meet the rigors of being a reliable backup methodology.

    • Media Cost: Tape wins over disk here. LTO tape is running, at a quick check, for about $75 retail for 200/100G tapes. Even assuming only reasonable compression, you're looking at 150G for $75 bucks. And that is single-cart pricing; tape pricing quickly drops if you're ordering in bulk (typically in packs of 10, then at the 3-packs level, then more, check with your preferred media vendor)
    • Hardware Cost: Disk wins, but it's a double-edged sword - every disk you own has electrical and mechanical failure chances. The more disks you have, the more likely you are to lose one of them. The more you're storing on disk, the more you open yourself to a catastrophic failure of those disks themselves. High-end fast tape drives and libraries are expensive, but they just _work_. You plug them in, load your preferred tape management software (hell, run mtx for that matter), and start backing stuff up. No formatting, settings up arrays, hot-swap schedules, anything like that. But you pay through the nose for it - expect to spend into the $10K range for a large-scale tape storage solution that you could match (in short-term storage duration) for a couple of thousand dollars for a disk-based solution.
    • Hosting Space: Try to store 10TB of disk, and you'll need an air conditioner in that room just to cool down the disk cabinet and controllers. 10TB of tape just sits there though; you can store 4TB of tape online in a small 3U (about 6 inches) tape library - that's 24 tapes, and such libraries typically also support two drives. Go to 5-6U, and you can get 4 drives and over 50 tapes. If those were 200GB LTO tapes, you'd be looking at up to 10TB of storage available online, or easily offline and off-siteable. In addition, tape is easily expandable. Need more storage space? Buy another tape. No new hardware needed, no power concerns, just drop it in the drive or library and go.
    • Speed: Disk definitely has an edge. Set up an decent SCSI RAID5 array (real hardware raid across multiple disks on separate physical controllers, not this playtime software 0+1 homebrew IDE raid crap) and watch your write speeds triple. If you need to back up that 1 TB overnight, you don't have much of a choice but to go to disk in some form. But again, you pay a price for it. The speed you save in the
    --

    This space for rent. Call 1-800-STEAK4U

    1. Re:What are your near- and long-term requirements by Anonymous Coward · · Score: 0

      Sigh... yet another IT graybeard perpetuating the myth that SCSI is the only thing that works. Did you miss the title of the story? "for the common man" - not "for the IT manager with an institutional budget at his command". 3Ware and others have excellent SATA RAID solutions. Many manufacturers offer 5-year warranty continuous duty cycle SATA drives - you just don't want to pick the wrong ones when buying your hardware.

    2. Re:What are your near- and long-term requirements by pe1chl · · Score: 1

      >First of all, get off of the SATA hacks, and realize you're going to need to go to SCSI

      You must have missed the fact that per-unit capacity for SATA drives is significantly higher than for SCSI drives.
      And then I'm not even talking about per-megabyte cost of SCSI storage when compared to SATA storage...

      It seems like you are using a hammer, and now see every problem as a nail.

    3. Re:What are your near- and long-term requirements by 0x0d0a · · Score: 1

      First of all, get off of the SATA hacks, and realize you're going to need to go to SCSI, whether you end up with disk or tape. You're backing up data, you're going ot want it to be reliably written out, and SCSI is the de facto standard for backup architecture. Yes, you pay more for it, but there's a reason for it: the SCSI equipment I manage at work fails a fraction of the percentage of time that the various IDE/ATA systems fail. While SATA is marketed as a consumer technology, it will never meet the rigors of being a reliable backup methodology.

      The SCSI/SATA/PATA debate is silly, and (no offense to you, as I have no idea of your particular situation) often perpetuated by people who make purchasing decisions but don't know enough to evaulate their systems reasonably.

      None of the three standards is inherently less reliable than the others. Many SCSI drives push density limits much less than ATA drives, though this is not a constant. In addition, because SCSI drives are often used in high-load environments, they often have higher rotational speeds and operating temperatures (and sometimes even require active cooling systems). These are negative reliability factors -- heat is a huge factor in drive data reliability.

      It's not too hard to get a general feel of how reliable a drive is. Take a look at how high the density is, as this is the biggest factor -- drives of a particular generation with low density are likely to be more reliable. Unless you are working with a rack that absolutely will not allow drives in it to undergo impact, rated impact can be significant, as desktops and towers can get knocked about (note that notebook-style 2.5" form-factor drives can generally handle about an order-of-magnitude increase in impact over desktop-style 3.5" drives -- Seagate, for instance, makes the "Savvio" line, an enterprise-clas 2.5" series). Look at rated (while-operating -- non-operating means little) impacts. Operating temperatures should be as hot as possible -- the bigger the margin between you and the maximum operating temperature, the better.

    4. Re:What are your near- and long-term requirements by cowbutt · · Score: 1
      None of the three standards [SCSI/SATA/PATA] is inherently less reliable than the others.

      Technically, that statement is correct, of course, but in practice the interface you choose determines the quality of mechanism you can buy with it; SCSI drives are engineered with different cost/reliability/longevity assumptions and tested individually, ATA drives are batch-tested. For more details from the horse's mouth, see This paper from Seagate on the differences.

      As it happens, I use quality PATA drives (WD Special Edition, right now) in all my personal systems, so I'm not even a SCSI bigot ;-). I chose WD SEs because the price/performance/reliability was right for me and they are one of the few ranges of ATA drives that still come with a 3 year warranty. Most ATA drives only come with 1 year (especially OEM models) and most SCSI drives come with 5 years.

      --

  153. Beam It Up, Scottie by Bob+Munck · · Score: 1
    You didn't say when you'd want to retrieve the data or how much you'd want to spend to do so, so I made some assumptions:

    Beam the data into space. When you want to retrieve it, you'll have to go catch it.

    You don't want the beam waving all over the place because of the Earth's rotation, so aim it at the North Star (assuming you're in the northern hemisphere). If this activity continues for a long time, you'll have to compensate for the precession of the Earth's axis and for various smaller wobbles; I believe a correction every century or so will suffice.

    Note that if the speed of light really is an absolute limit, you'll have to count on the universe being curved and wait for the data to come around again. Again, you didn't say anything to indicate that this might be a problem.

  154. Easy Solution. by AvoidTheNoid · · Score: 0

    1. Disassemble 4 node linux cluster. 2. Reassemble 4 node linux cluster as a robot. 3. Give robot a pencil and notebook. 4. "Mr. Robotensen, take this dictation..."

  155. Big, Fast, Reliable, and Cheap? by mdielmann · · Score: 1

    Well, not too demanding, are we?

    --
    Sure I'm paranoid, but am I paranoid enough?
  156. Post it to the usenet news by MichaelCrawford · · Score: 1
    You can store immense quantities of data as ascii-armored usenet posts.

    --
    Request your free CD of my piano music.
  157. Re:options options, what is your time and data wor by wik · · Score: 2, Informative

    There's no reason why you couldn't read each of the DVDs in serially and incrementally rebuilt the lost DVD. On recovery, you should only need enough space to hold a single DVD to rebuild the remaining disk.

    A disadvantage is that the data cannot change while you write all N+1 DVDs and restoring would require lots of DVD swaps (regardless of whether you've lost a DVD or not) and the ability to incrementally write files with gaps in them (not an issue with most filesystems).

    --
    / \
    \ / ASCII ribbon campaign for peace
    x
    / \
  158. Where is the Holographic Memory When You Need It! by rocker_wannabe · · Score: 1

    Hopefully we wont have to wait long for Holographic Memory to become commercially available. It looks like the space program is one of the few to actually use Holographic Memory for anything now.

    --
    "Meaningless!, Meaningless!" says the Teacher. "Utterly meaningless!"
  159. What I would do if I were you... by Thagg · · Score: 1

    If you really have 1TB/month of fMRI data, I'd look very carefully into algorithms for compressing it. I would think that a customized compression scheme for a particular type of data might yeild quite impressive results, reducing the backup problem by an order of magnitude or more. From your description of the value of the data and the cost of recovery, some time invested in a compression scheme would be well spent.

    Thad

    --
    I love Mondays. On a Monday, anything is possible.
  160. Rsync is my friend by mcrbids · · Score: 1

    As a freelance programmer and sysadmin, I admin about 20 boxen for various customers. Virtually all Red Hat systems under various support contracts.

    Many of these servers are using hardware or software RAID, but one of the terms in most every contract I've ever signed includes the term "regular, off-site backups".

    In my case, I discovered rsync and wrote a nice, easy-to-use backup system based on rsync I've called Backup Buddy. This allows me to not only backup data, but with a minimum of additional storage usage, view my backups as a set going back through time to any point in the last (typically) 45 days, seamlessly.

    With this tool, I manage about 2 TB of data on 4 different backup servers, all remotely.

    My own backup server for my own stuff is a recycled AMD K6 system with an PCI IDE card and two HDD, 120 & 160 GB put together using Logical Volume Manager. I've fallen back on these backups innumerable times - and I can't say how nice it is when a restore can be done in 2 minutes flat from any nearby workstation.

    I also have two primary servers at different locations (hint: they are 200+ miles apart) on separate networks in case of catastrophe - they're also mirrored via rsync nightly, and a switch from one to the other takes about 3 hours.

    Uptime is important, and I think this is proof that even for small (1 employee) businesses can have a reliable, effective backup solution!

    --
    I have no problem with your religion until you decide it's reason to deprive others of the truth.
  161. ICI Optical Tape / CREO Vancouver by anubi · · Score: 2, Informative
    I haven't seen any mention of these guys here, but a few years ago, I remember a company, CREO, working on a data recorder which used a spoolable optical tape. I believe this tape was made by ICI over in Europe.

    There were several packaging options for this tape.. including reels of 2" wide tape and cartridges.

    I've lost track of what happened to it. All I remember is that this tape existed at one time and some research was being done to make data recorders of phenomenal storage capacity.

    Back in the early 90's, there was one company in Campbell, California, known as "LaserTape" which was trying to design a tape drive for the PC which used cartridges of this tape. I have lost track of whatever happened to the company.

    --
    "Prove all things; hold fast that which is good." [KJV: I Thessalonians 5:21]

  162. Many houses burning down? by Pac · · Score: 1

    Ya know, mate, there is this new thingie over which you can send your data for a ride over a distance, to be stored in another machine elsewhere. Them guys call it the Internet.

    1. Re:Many houses burning down? by Rich0 · · Score: 1

      Uh, then the original post should have mentioned that the cluster was to be distributed across multiple sites.

      Most people who have 100's of GB of data don't also have multiple co-location sites with huge pipes. I could have a cluster of linux servers at home on my 100MB ethernet in no time at all - and for a little cash I'd be at GB. With a few spare computers on hand that GB cluster might only cost me a hundred dollars, and a stack of hard drives for storage might cost me a few hundred more.

      On the other hand, if I want to distribute that same network to provide disaster protection, then I need to rent space for my servers and have a high-bandwidth link between them. If I send just 100GB per day between the computers I'd need 10Mbps of bandwidth - which would cost me thousands of dollars each month for each distributed node.

      At those kinds of costs he should just have a single RAID5 server and buy the biggest tape drive there is - he'd break even in probably only a month or two. It is a LOT cheaper to tape-backup 1TB of data and put it in a storage shed than to maintain a high-bandwidth colocation site...

  163. DSL, porn and the common man by Pac · · Score: 1

    Do you have any idea how large a short porn movie can get? Imagine a DSL or cable connection running 24/7 downloading all movies available many times over (we're talking common man here, we know he won't check the file size much less hash and be easily fooled by a simple name change, so "Paris Does It.mpg" and "ParisHiltonHomeMovie.mpg" will both be downloaded). One can easily reach the terabyte needs this way.

  164. Aww send me an invite! :-P by opweirdisntit · · Score: 0, Offtopic

    hey ya folks...lol i want a gmail invite:) POR FAVOR (spanish for please) ...gracias (spanish for thanks) email -> opweirdisntit@yahoo.com i'd like my gmail account to be nitinshantharam@gmail.com thanks in advance :P

  165. Inexpensive, Reliable, Efficient... by frank_adrian314159 · · Score: 1
    ... pick any two.

    You may call this Adrian's Law of Backup because I've never seen all three at once (and two of them together is still pretty damn hard).

    --
    That is all.
  166. The cheapest way to do this by symbolset · · Score: 1

    Today Media at $/TB is going to be IDE@200GB$508/TB.

    Hire a guy to build you servers. You'll need one server per month, figure about $300 for the cpu and OS (this is slashdot. figure it out.) That's $1308 per month, plus whatever you pay your idiot nephew to set it up.

    The good news is every month it gets cheaper.

    Hint: Since your backup stream is 3mbps continuous, I'm going to guess you're using gigabit so add $100 per server for a switch network and NIC.

    Throw the old servers in the attic as business needs require. Once it's built in our AO it costs about $1.00 a month worth of juice keep a server up, so the issue is really the business need to have the data be unavailable.

    --
    Help stamp out iliturcy.
  167. How about a nice terabyte sized external... by k31bang · · Score: 1

    How about a nice terabyte sized external hard drive? Only $1,199.00.

    --
    -+-=-+-=-+-=-+-=-+-=-+ *** http://www.mountainfort.com *** +-=-+-=-+-=-+-=-+-=-+-
  168. X Serve RAID by Anonymous Coward · · Score: 0

    3$ / GB for 400 Mps xfers and fully redundant fiber channel storage.

    http://www.apple.com/server/

  169. Re:Firewire drives? by silentbozo · · Score: 2, Informative

    Word of warning. Don't cheap out on Firewire hardware - it's touted as being bulletproof, but in practice I've found SCSI-voodoo-like interactions between cheap cards/cases, and questionable power supplies. I've pretty much given up using Firewire for applications where I need to swap drives a lot, as weird crap happens, just at the worst possible moment.

    In those applications, I've gone with dedicated ATA/133 cards with a nice roomy case with a bunch of removable drive bays. It's a pain to have to shutdown to swap drives, but less of a pain than Windows bluescreening, rebooting, and "fixing" your attached Firewire drive, scrambling all of the data on it, and making it impossible to run a recovery (no, I didn't have a backup of that data...)

    I've also had weird crap happen with my Macs as well - some hardware doesn't show up unless you have it plugged in on startup. In theory it was a great idea (mix Firewire cases with removable drive bays), in practice, you're asking for trouble if you're using cheap parts (ie, bottom-basement cases, with cheap cables.)

  170. A cheap, simple solution by Cabeiroi · · Score: 2, Informative

    I'm not sure what your price range is, but one method I've had success with is a Promise SATA add-in card and removable hard drive enclosures. SATA is hot-swappable and combine that with a cheap hard drive enclosure ($10-$30+) with any SATA hard drive of your choice and you have a relatively cheap solution.

  171. How about abusing physics law's? by OlivierB · · Score: 2, Interesting

    I remember reading one day about some research somebody did about abusing the network capacity to store data. Basically he would send mail to himself via a third party smtp server. Of course he would tell his destination server to ignore his messages until a set date, then refuse the messages which would then be bounced back to his originatin email acocunt. By having a roll on that he achieved some pretty amazing storage for FREE! with ultra reliable ISP grade mail backup. Now aplly to same principle to space! Saw you have a server on Mars. You could transmit to Mars the data in full before MArs even started receiving the Data. When Mars would receive the data it would immediately send it back, not even waiting for the message to be completely received. Thus the data would not use any storage on mars either. At this point you have achieved media less storage. And have abused the network capacity of Space. Talk about the geek factor in that!. I don't really wan't to model this network's capacity but everybody here understands that it is a function of the transmission rate, celerity, distance with "relay" server. Of course there is an amount of data for which you will start needing some sort of storage on both servers. This will noly happen if the data has time to do a return trip to and from the relay server in less time than one can transmit the data in full. Improve the transmission rate and your network "memory capacity" multiplies.

    --
    Artificial intelligence is no match for natural stupidity
  172. Don't Backup SPAM by dionisus · · Score: 1

    You know, there's really no need to backup all your SPAM. After filtering out the SPAM, I'm sure you can use the floppy method mentioned already ;)

  173. Gmail by foniksonik · · Score: 1

    When public Gmail accounts are available... just get yourself a couple hundred of 'em and write a script to partition and email yourself your data... as well as a script to download and rejoin your data... dude, save thousands by using a free service with ultimate backup, etc.

    man, I wish I had a need of this myself... I'vd just come up with the holy grail...

    --
    A fool throws a stone into a well and a thousand sages can not remove it.
  174. Re:Hard copy - "Infinite" data on a piece of paper by rixster · · Score: 1

    What would be really interesting would be an informal contest to see who could store the most data in a single page of paper.

    My entry :
    Please open your browser at www.google.com and type in what you are looking for. Then write the answer in the space below.

    --
    Two wrongs may not make a right, but three ....
  175. Not enough info to size a solution. by oneiros27 · · Score: 1
    Your main limiting factor of designing your backup strategy is the time to recovery, in your typical situation. (Time to recovery typically being a function of the cost of downtime)

    Here are a few sensitive variable which may affect your solution:
    • time to recovery
    • initial capital outlay
    • total cost of ownership
    • time window for backups
    • cost of downtime
    • man-hours of intervention required.
    • type of recovery needed
    Some less significant variables include:
    • power consumption
    • space required
    • support contract options


    For instance -- you have 1TB/month of data. That's fine. I am going to make an assumption that your work in some way is either based on a weekly (256GB/week), or daily (~51GB/day) cycle. From that, we have to look at how long you can have a load on the system as you back it up on a daily/weekly basis.

    When you have to recover, is it going to be a disaster situation (ie, restore the entire system), or used as an 'undelete' type repository (restore files x,y and z). In the first, you're limited by physics ... busses can only push data so fast; disks can only spin at a certain rate, etc. In the second, seek time is an issue -- you can get by some of the later problems by using a database that catalogs the location of files, but make sure you don't get something that can't do a restore without the database, or you're hosed in a full recovery situation.

    blah...I could go on for hours on the subject, about the type of stuff that would be useful to consider before designing a backup solution. Quite simply, you want the price of the solution to be lower or equal to the benefit that you'd get out of it -- it's not typically worth
    paying 3x the money for a 10% reduction in recovery time, unless there are some odd factors (ie, users get their month free if you have an outage of more than 12hrs).
    --
    Build it, and they will come^Hplain.
  176. How much is the data worth? by Anonymous Coward · · Score: 1, Insightful

    Ok, a few questions:

    1) How much is the data worth per day, week, mouth, year? Your final solution should reflect these data points.

    2) How quickly do you need to have access to it? Quicker means more money longer lowers the price, but add complexity.

    3) How stable does the data need to be? Is year old data worth the same as current data? What about 2, 3, 4 years later. Do you need to get that data back?

    4) How much physical room is available for the backup systems and offsite storage? Is it climate controlled, yet convenient? Is it in a different state to avoid disasters?

    5) How secure does the data need to be? Is this your customers' credit data that cannot be leaked or there are federal fines or will it just be inconvenient?

    A storage engineer would use your answers to help design a total solution. If your data isn't worth very much, then you've also shown that by this study. OTOH, if it is worth millions, don't expect to "get by" with a $20k answer from ./ readers.

    I work where there are daily penalties of $400k if we make a mistake or our systems go down. Other systems will cost $5M / hr if they aren't up. What do you think the cost of our backup and recovery system is? We have data stored in multiple locations - near, a little further and on the other side of the Earth. It takes a little longer to get the data back the further away it is. I can imagine insurance and banking where the cost of data is in the $10M per second.

    How much is your data worth?

    1. Re:How much is the data worth? by Anonymous Coward · · Score: 0

      nice try at panning...

      you simply reworded everything the parent post said but did it in a way in an attempt to make you more "informative".

      try adding to the discussion instead of acting like an overpaid consultant that is all pissy because someone that is a "commoner" knows as much or more than you do.

      space is not a concern, if your data is $2B then building an addition to the office is a tiny part of the storage solution's total cost. $30K for a office to house the $180K backup system is nothing.

  177. Work Demands should be realistic. by Linus+Sixpack · · Score: 2

    My work demands 1TB a month ....

    It sounds like you need a good cost benifit analysis and an idea of a budged.

    First RAID your existing data.

    Second Replicate any working solution you have now identically for next month and backup hardware.

    Have a serious talk with work as to what is expensive and what you can afford. What happens if a data set is lost? How much damage\cost would that incur? I would look int AIT drives from Sony.

    It sounds like you are in a frame of mind where you see everything as expensive. This will heavily influence your decision. Walk through a data disaster scenario with your backers and examine your costs in that light.

    ls

  178. Ask Google ... by DK_SplatMan · · Score: 1

    With the new G-Mail concept, Google have shown the world that they are capable of storing TBs of data without too much trouble.

    And it is not data that can be lost without problems (as suggested elsewhere) because we are talking about user-data! Not just search-engine index data.

    Why don't you ask them how much an online storage area would cost? Other suggestions here run from 500 USD to many thousands of USD each year ... plus the trouble of setting it up, keeping it safe, and paying the electrical bill ;-)

    How much online storage space can you buy from Google if you pay them 100 bucks a month? Give'em a call and ask... :-)

  179. Optical Jukebox by permaculture · · Score: 1

    http://www.google.com/search?sourceid=mozclient&ie =utf-8&oe=utf-8&q=optical+jukebox You might consider an 'Infinite Storage' optical jukebox. We ran one for awhile back for user files, but it wasn't fast enough with small files and it was replaced by SAN. Perhaps with your large files it'd perform better.

    --
    Environmentalism is the new Victorianism. Everyone ties on a green corset and pretends we're virtuous.
  180. 400GB SATA drives? Where? by Joseph+Vigneau · · Score: 1

    Can you please point a link to where I can buy a 400GB SATA drive? I know they've "announced" them, but they aren't for sale yet. Why are you recommending a solution that you cannot have possibly implemented yourself?

  181. duh by Anonymous Coward · · Score: 0

    hey asshat,

    if you are generating a TB of data a month, you are not a "common man"

    stupid troll

  182. Cheap plug, but I *am* trying to be helpful! by proggoddess · · Score: 1

    I'm a SW engineer at ATTO Technology, and one of the products here is a RAID-capable disk array. I truthfully don't know all that much about its behind-the-scenes workings as I am on a different project. The Diamond is capable of holding 24 ATA disks for a total of 7.2 TB for less than $10/GB. It does Fibre Channel or SCSI, and it can emulate a tape drive to be used with your favorite backup software.

    Here's the link for more info:
    Diamond Storage Array

    --
    --The Programming goddess from Gorflaz
  183. blue laser DVD by kazem · · Score: 1

    Later this year Panasonic, I think, will be releasing DVDs that hold 50 GB on a side using a blue laser. Of course, that's still not even close to a TB.

  184. Don't help him... by IsaacW · · Score: 1

    Anyone who needs 1TB of storage per month must be either a spammer or an *gasp* evil data-miner!

    ;-)

  185. shouldn't it be... by Extrymas · · Score: 1

    cat 1tb.txt > /dev/lprn0

    1. Re:shouldn't it be... by nazsco · · Score: 1

      well... er... i... i renamed my cat command to echo... yeah... i like it better that way. echo is much prettier than cat, and i use cat, i mean, echo a lot more than echo... wich i renamed to not_echo... yeah, thats it.

  186. Paper isn't the answer... by abb3w · · Score: 1



    A freind of a freind works at Dow Corning. He mentioned at one point that Dow uses a direct-to-microfilm printer for some of their most critical data. Expected media life is over 100 years given modest climate control. The pricetag, however, is doubtless out of this guy's range-- six figure, as I recall.

    --
    //Information does not want to be free; it wants to breed.
  187. You know what I'd really like by Babel · · Score: 1

    I have an office full of computers, relatively new-ish, all running Linux, all with 80-160GB hard drives.

    I section off a part of each disk for OS -- root, usr, var, etc.

    The rest is /home.

    What I'd really like is to tie the /home partitions together in one single network-wide, redundant file system.

    Here's how I'd plan it out:

    Start with something like the NBD (kernel level network block device). Have a distributed block hash of some kind, mapping each virtual block to one or more physical blocks. Make that layer support redundancy, so that each block was repeated (or 3 for 2 checksummed, as in RAID) elsewhere on the network, so that the distributed block driver could survive at least one machine going down (later: build in multiple levels of redundancy so that blocks get mirrored in multiple places, and the virtual block driver could survive multiple machine failures).

    Build a file system on top of that, just like regular NBD.

    Now I'm not a file system guru but I expect this can be done with the right amount of cash inflow.

    1. Re:You know what I'd really like by SuiteSisterMary · · Score: 1

      Why not just do it the way UNIX types have been doing it for thirty years; /home mounted via NFS?

      --
      Vintage computer games and RPG books available. Email me if you're interested.
    2. Re:You know what I'd really like by Babel · · Score: 1

      Because it's not distributed, it's not redundant, and it's not scalable in the way I want it to be.

      The scheme I'm proposing would be able to tie together 40 x 40GB /home partitions into one redundant, scalable, network-wide /home partition of about 800GB or so.

      Server-less networking for the masses. :)

  188. go for #2 by Ba3r · · Score: 1

    I had a friend who built a coffee table from 4 Mac Classics he got for 5 dollars each at a yard sale (only one booted). Thus, I can fairly say 20 dollars is reasonably priced for an Apple themed coffee table. And that would be the 2nd time you heard it.

  189. Re:400GB SATA drives? Where? by chrish · · Score: 1

    IIRC Seagate's 400GB disks aren't scheduled for availability until the fall.

    Paper launches suck.

    --
    - chrish
  190. Re:options options, what is your time and data wor by swb · · Score: 1

    There's no reason why you couldn't read each of the DVDs in serially and incrementally rebuilt the lost DVD.

    That's kind of where I was going; the DVDs are a storage medium for a multi-part archive, and a HDD internal to the jukebox provides the working filesystem once sequential reads of the DVDs are done.

    I know that some tape drives have memory chips for labeling and positioning data, but I've often wondered if you couldn't have a storage device with a built-in HDD and tape-type unit that would implement a HFS invisible to the OS. The HDD keeps frequently accessed data and transparently reads/writes from the tape drive. It's probably mechanically impractical and too expensive, but an interesting idea.

  191. StorageTek by garretwp · · Score: 1

    You can also save some money and get a Storagetek silo that can hold over 7000 tapes and have around 7 petabytes of tape storage at your finger tips like my work has :)

  192. hard drives??? by WINSTANLEY · · Score: 1

    I know hard drives go bad, but:

    1. if they are
    cheap enough, how about multiple drives per backup
    maybe in conjunction with multiple copies per drive?

    2. if drives do go bad, isn't data retrieval from
    dead drives a pretty advanced science?
    Sure it is pricey but the total cost again depends on the failure rate.

    If cheap drives have a failure rate of say 15%
    in the first year, using a second drive pushes the
    overall failure rate to around 2%.

    --
    It is by coff... er, will, alone I set my mind in motion...
  193. Um, It's Called a Database by blooba · · Score: 1
    Well, um, what you have is something we technology professionals call a database. More specifically, it sounds like you have a data warehouse. You need to think about things like mean-time-between-failure (MTBF) and mean-time-to-recover (MTTR), system response time, dollar cost of downtime, etc. Or, maybe you should just step aside and let the grownups handle this? Hmm?

    Seriously, though, if you want cheap online backups, go with IDE RAID. It's slower than SCSI, but a lot faster than the other options.

  194. I've had good luck with Promise Drives by TrogL · · Score: 1

    They're a hardware RAID

  195. Re:400GB SATA drives? Where? by da5idnetlimit.com · · Score: 0, Flamebait

    "Can you please point a link to where I can buy a 400GB SATA drive?"

    Here ? ... well, this one is the closer to my home... about 10 minutes on foot, in Paris (France)

    I know the description is in french, but "HITACHI DESKTAR 7K400 400 Go Serial ATA 7200t 8Mo" is quite understandable, I hope...

    "I know they've "announced" them, but they aren't for sale yet."

    It seems that chrish (the one that answered your post) have the same delusion :

    " IIRC Seagate's 400GB disks aren't scheduled for availability until the fall.

    Paper launches suck."


    Well, the world is not all Seagate centric, it seems...

    "Why are you recommending a solution that you cannot have possibly implemented yourself?"

    Well, Joseph, any other witty/uninformed comment on 400Go Sata availability and me implementing (or not, btw) them ???

    I mean, this is /., and taking my post at face value would be a leap of faith for someone that has been on slashdot for soooooo long (number 514...nice...).

    Maybe you just need to relax. Or read something else than slashdot, and keep informed on what's hot, or, more to the point, on what's out in the market...

    I mean, before posting, that is.

    Any other flame/comment ?

    Yours Faithfully,

    da5id

    --
    It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
  196. Thanks to all for lots of useful suggestions by Vigyaan · · Score: 2, Informative
    Few points are emerging (I am still reading the responses, so pardon me if your comment didn't make it here):

    Hard drives based solution seems to be (currently at least) cheap + easy to use for immediate use

    DVDs are cheaper for long term storage but automation devices are still not commonly available (plus capacity per DVD is small)

    Software RAID is slow for writing but okay for reading data

    Tapes may be an viable alternative for long term data storage but the tape drives require an initial investment

    Some readers have mentioned "LaCie Bigger Disk". $1199 for 1 TB disk space ... a price to pay for convenience.
    Since a lot of you have asked me, now I will explain the nature of my data, its storage and analysis.

    I am a scientific researcher working in computational biology. I do atomistic modeling and generate snapshots of protein conformations along MD trajectories. The data is analyzed several times to calculate different quantities. Those of you who do this kind of stuff know that we can collect and store only a fraction of data we want. I generate this data on supercomputers and then compress it (using bzip2 and gzip) to store temporarily and permanently.

    The data generated is ususally analyzed within the next month. A good fraction of the data (about 70-80%) needs to analyzed again several times for different quantities in the next few months. In my experience 80% of data is usually discarded after a year of so. Therefore 2.4 TB/year need permanent storage.

    So 500 GB at least is required for "daily use", 1-2 TB would be nice to have for intermediate use and over 2 TB will need "permanent" storage.

  197. somewhat cheap solution, 3 x RAID5 by Anonymous Coward · · Score: 1, Interesting

    Assemble a 1500$ machine. Tyan, AMD, lots of ram over processor power. Use a big, nice case like the Codegen S-101. This case has 11 x 5,25" bays, plus an additional 3,5" bay. Ditch the included psu and use something in the 650 - 750 W range.

    Use 2 5,25" bays for a dvd burner and a tape unit. In the remaining 9 5,25" bays, assemble three Promise sata enclosures, these can hold 4 disks each, and are 3 bays high. Then plug three Promise S150 SX4 RAID 5 controllers with 128 MB PC133 ECC each. Now populate all the raid enclosures with 400GB Seagate disks, as soon as they're available, for a capacity of 1,1444 TB x 3 = 3,433 TB grand total.

    Initial cost:
    1 x computer = 1500$;
    3 x S150 SX4 + 128mb = 900$;
    3 x SuperSwap 4100 = 750$;
    12 x 400GB HD, 200$ each? = 2400$:
    total = 5500$, 1,60$ per GB.

    Have handy some 4 unit sets of SuperSwap 1100 enclosures for additional 4 disk raid 5 sets.

    If you use a mobo with integrated Giga Ethernet, you can expect a network throughput about 60 to 75 MB/sec.

    This setup will provide you with online access to the current data set, plus two backups, and as many as you can pay backup sets in closet storage.
    You can reduce the initial cost reducing online raid sets, 1000$ each.

    Hope it helps,
    Bug Eyed Hardware Nut.

  198. RAID + Salvage by Havokmon · · Score: 1
    because it protects against device failure, not *user* error. if you delete a file from a raid array, it's gone. that's part of what offline is all about.

    So put it on a Netware box ($60 SBS shop.novell.com). Salvage has been around since the early 90's. IMHO there's no reason for an OS to not incorporate it's own restore feature these days.

    --
    "I can't give you a brain, so I'll give you a diploma" - The Great Oz (blatently stolen sig)
  199. Indeed (Re:Wirewire drives?) by big+ben+bullet · · Score: 1

    and here's the link
    LaCie Bigger Disk

    only problem i have with it is that they claim that 1 terabyte = 1,000,000,000,000 bytes

    not even 1,000 Gb
    let alone 1,000,000 Mb

    one of the oldest tricks in the book but still an oh-so-common practice

    1. Re:Indeed (Re:Wirewire drives?) by cdrudge · · Score: 1

      Seriously, does anyone still get caught by that? I mean, sure it was a big deal in 1995. At least now they always have it on the box or in the specs.

    2. Re:Indeed (Re:Wirewire drives?) by tyrann · · Score: 1

      That's correct.

      1 Tb = 10^12 b. A terabyte is a decimal-based number.
      1 Tib = 2^40 b. A tebibyte is a binary-based number.

      LaCie is not doing any trick there.

    3. Re:Indeed (Re:Wirewire drives?) by big+ben+bullet · · Score: 1

      i think you'd be suprised by this...

      most of them don't read slashdot though :)

    4. Re:Indeed (Re:Wirewire drives?) by big+ben+bullet · · Score: 1

      Didn't know that

      after a wikipedia search i found out this is also the case for

      Gigabyte (GB) -> Gibibyte (GiB)
      Megabyte (MB) -> Mibibyte (MiB) (without the sunglasses 8|
      Kilobyte (KB) -> Kibibyte

      and (for now still futuristic)
      Petabyte (PB) -> Petbibyte (PiB)
      Exabyte (EB) -> Exbibyte (EiB)
      Zetabyte (ZB) -> Zebibyte (ZiB)
      Yottabyte (YB) -> Yobibyte (YiB)

      and as a sidenote: this last one is so big that if 10% of the earthlings would have a completely filled GMail box that would be a capacity of just 0,00063 YB

      Does anyone have any invites left?

  200. Too slow? by kingLatency · · Score: 1

    I can't speak from experience, but I've heard that OS X's built-in software RAID is terribly slow. Perhaps that wouldn't matter for backups rather than regular use.

    --
    "I've got to stop masturbating! It makes me too lazy! Stop it, Albert. Stop it." -- Albert Einstein
  201. cups! by nortcele · · Score: 1
    One hundred million pages. Assuming he has a good laser printer with infinite toner, let's say he can print 60 ppm or one page per second. It would take one hundred million seconds to print the data, which is 1157 days, or a little over 3 years.
    He would have a bank (or beowulf cluster :) ) of printers via CUPS, and do duplex as well. The print option is still a contender!
  202. Sony AIT drives, 1.3TB per cartridge. by Moderation+abuser · · Score: 1

    If you really need that much data, it will fairly quickly become cheaper than hard disk storage. Add something like Bacula to back it up.

    --
    Government of the people, by corporate executives, for corporate profits.
  203. HMM., by way2trivial · · Score: 1
    printers claim resolution at what dpi? 2880 x 1440 for an epson 2200?
    which does full bleed printing (edge to edge)
    8.5/11 letter paper therefore can have 24480 bits per line, and 15,840 lines for a total of 387763200 bits per sheet, for a total of 48470400 bytes per sheet?

    --
    every day http://en.wikipedia.org/wiki/Special:Random
    1. Re:HMM., by way2trivial · · Score: 1

      this was going to be decent sarcasm about printer manufacturers, I was interrupted at work.. damn job.,,,,

      --
      every day http://en.wikipedia.org/wiki/Special:Random
  204. c'mon, it's only pr0n by Bad+Boy+Marty · · Score: 1

    Let's sort this out. Slashdot/Speakeasy is offering a 6 megabit DSL service; there are 86400 seconds in a day; we can average 30 days in a month. So, if all that bandwidth is dedicated to downloading pr0n, it amounts to about 1.76 TB/month.

    (Of course, that same figure would apply to downloading anything at full bandwidth on such a connection, like MP3's, MPGs, etc.)

    --
    RHCE; are you certified? Karma: ambiguous.
  205. DVD-duplication station. by rew · · Score: 1

    There are these DVD duplication stations that have a DVD burner and an robotic empty-burning-done mover. you can put a pile of some 50 DVDs on there, so you need to feed it with a new pile of empty DVDs every week or so.

    Use a RAID array of 6+1+1 to buffer the data: 6 data disks, one redundancy disk, and 1 hot spare. Remember the hot spare. Your MTBF goes up enormously through the hot spare. Trust me. Or do the math yourself. But please do replace it once one of the disks has failed, and the hot spare has come into action....

    Of course, if you require online accesss, then the offline dvds are useless.....

  206. Re:options options, what is your time and data wor by wik · · Score: 1

    If you're willing to play disk jockey, I could imagine doing this completely in software with a single DVD drive. After all, this would probably be part of a backup application, anyway. I don't see any reason to dedicate hardware to this. Furthermore, it is certainly possible to have more data than available DVD drives, so having all DVDs online at the same time is impractical in the general case.

    Building a simple hack couldn't take more than a day of hacking (at least to write a proof of concept).

    --
    / \
    \ / ASCII ribbon campaign for peace
    x
    / \
  207. Static raid for backup through par / par2 by chkn0 · · Score: 1
  208. Yeah ! Flamebait !!!! by da5idnetlimit.com · · Score: 1

    My first attempt at flamebait is duly recognized by at least on slahdot moderator. Yeah !

    OTOH I was really pissed off at seeing someone blatantly calling me a liar...when that person could have taken the time to google for "400Go SATA" and get some answer before posting...

    This post could also have finished as informative, I gave the link asked, Insightfull, for the Seagate part, or quite anything else, this being slashdot and moderators being known for their heavy use of the crack (crap?) pipe.

    My point ? well, I'm still waiting for Joseph to present some sort of excuse, at least recognising he was wrong... again, this is /., we deal in strong opinions, not in nice social intercourse.

    so, here is My Formal Mea Culpa : HEY JOSEPH !!! YOU WERE WRONG !!!!AND I'M STILL RIGHT !!!

    Lol.

    Let's see if this also get a flamebait mod... Ahh, the nice smell of Karma burning 8)

    --
    It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
    1. Re:Yeah ! Flamebait !!!! by Joseph+Vigneau · · Score: 1

      My point ? well, I'm still waiting for Joseph to present some sort of excuse, at least recognising he was wrong... again, this is /., we deal in strong opinions, not in nice social intercourse.

      The Hitachi 7K400 is still incredibly difficult to find. I have been waiting to build a SATA based machine for a while now, so I've been paying attention to my usual sources (NewEgg, PriceWatch), and still cannot find this item listed there. Perhaps it is only in limited release in Europe? That's the only place I can find it, and I can only find a very few online retailers in Europe that carry it. Some of those have the "not in stock" message...


      so, here is My Formal Mea Culpa : HEY JOSEPH !!! YOU WERE WRONG !!!!AND I'M STILL RIGHT !!!


      Whatever. How many of these things have you bought, anyway? Are they any good?

    2. Re:Yeah ! Flamebait !!!! by da5idnetlimit.com · · Score: 1

      I took a nice 8 discs and a pair or 3ware Escalade 8506-4LP (max 2TB/array, so I have two arrays in RAID0)

      The system runs on debian and provides Fileserver for a small video-editing company.

      you can find a test of the hdd here

      For the case, I took the Cooler Master STC-T01, a bit overkill, but I was a anxious of overheating ...also, it provides room for another 4 disks, but I will need another psu to power 12 disks....

      The disks are quite silencious, and the 3Ware card allow for staged disk spin-up, so I was able to run it all on only 1 PSU.

      Overall, that makes a 3Tb Fileserver for 4500Euros (including a cheap AMD XP 2400 + 1 Gig ram and Gigabit ethernet), which is quite nice if compared to NAS appliances offered elsewhere.

      Of course I don't have the whole collection of utilities to manage the box that comes from major vendors, but Webmin is quite up to the task...

      Also, having 8 disks in two RAID0 offers no security/redundancy whatsoever, but I was just asked for a giant fileserver, not a Disaster Recovery implementation 8)

      --
      It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker
  209. errata on price by da5idnetlimit.com · · Score: 1

    make it 5500 Euro, I forgot to add the 3ware cards to the price....

    --
    It takes 40+ muscles to frown, but only four to extend your arm and bitchslap the motherfucker