Slashdot Mirror


The Amazing $5k Terabyte Array

An anonymous reader writes: "Running out of space on your local disk? How about a Terabyte array for only a few thousand dollars. This article at KCGeek.com shows how to put together 1000 Gigs of hard drive space for the cost of a few desktop computers." I could rip my entire anime collection for instant access! Rip all my CDs and still have .9 Terabytes left! Maybe Mirror Usenet! I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences").

448 comments

  1. The Amazing Calculus Giveaway by Anonymous Coward · · Score: 0

    I love calculus so much, I want to give it to everyone! Come, get some integration!

    1. Re:The Amazing Calculus Giveaway by Anonymous Coward · · Score: 0


      Int[Log[x],dx] = x Log[x] - x

      happy now?

    2. Re:The Amazing Calculus Giveaway by Anonymous Coward · · Score: 0

      I love you! Yes! :)

    3. Re:The Amazing Calculus Giveaway by MoogMan · · Score: 1

      Int[Cow, dx] = beef + C
      Int[beef, dx] = mince meat + C

      Beware! Dont try to integrate chicken_soup by partial fractions...

  2. RAID by Ectropy · · Score: 0, Redundant

    I wonder if it has the capacity for any RAID form, or if it already has a RAID system built in.

    --
    Kyle "DotCom" Lynch :: http://www.kylelynch.com
    ...I need some cheeze-its...
    1. Re:RAID by Dman33 · · Score: 2

      Not to be rude, but if you RTFA you would find out in the third paragraph.

      But to answer your question, it looks like an IDE Software RAID5.

    2. Re:RAID by Sepper · · Score: 3, Informative

      Actually, if you did read the article, you would find that the proposed systems is build on ATA100 supported by RAID5 software... which mean that the last of the 8 160GB drive, would be used for parity and that leaves *ONLY* (7*160GB)/1024= 1.09375TB! Now, i know that hardware RAID5 is expensive, but just think for a second: you would have hot-swapable secure-as-long-as-only-1-hard-drive-fail personnal massive-and-fast storage system... A dream system :)

      --
      I live in Soviet Canuckistan you insensitive clod!
    3. Re:RAID by Ioldanach · · Score: 2, Insightful

      So, then, I'm confused... He's trying to use software raid, but he has 4 Promise FastTrak 100TX2 raid controllers. WTF? First off, each of those cards supports 4 drives on 2 channels... Why does he need 4 cards when he only has 8 drives? He only needs 2 cards. Second, why is he using expensive raid hardware (that doesn't even support RAID 5) when he's using software raid!?

      All he needs are two of maxtor's cards, which you can buy packaged with the drive for an extra $13. Not only that, but his prices on hard drives are way too high. 8 drives (2 with maxtor's ide cards) are $2122, per pricescan.com. Since he lists $500 for the ide cards and $3000 for the HDD's, that's a savings of $1378.

      Then, he quotes $500 for 2 GB of ram. At $70/.5GB sticks that's $280. $500 for a case??? Try $365.

      That said, the $5720 price he quotes is high by $1733. You could build one of these for just under $4000.

      Ok, I admit, I didn't include shipping.

    4. Re:RAID by edmudama · · Score: 5, Insightful

      > He's trying to use software raid, but he has 4
      > Promise FastTrak 100TX2 raid controllers. WTF?
      > First off, each of those cards supports 4
      > drives on 2 channels... Why does he need 4
      > cards when he only has 8 drives? He only needs
      > 2 cards.

      I'm a firmware engineer for Maxtor... if you're going for performance, you want 1 drive on each bus, and you don't want to use the motherboard connectors. With 2 drives on each bus, you are limiting the average transfer rate out of cache to 50% of the max transfer rate. On a modern drive with their 60-65MB/sec channel rates, you cannot stream sequentially off of 2 drives without saturating an ATA-100 cable. Even running ATA-133 won't help starting a year from now.

      Additionally, every bios I have looked at sucks in terms of performance. In most cases they have small DMA FIFOs which stutter the pipe during high speed transfers -- they literally hang the DMA lines while they empty their fifo into memory, then come back and grab another 8 words or something sad. They also tend to be very poor managers of the IRQ line. This causes delays at times when your hard drive could be giving you more data, but the host hasn't gotten around to asking for it yet.

      All the 3rd party cards have like 2Kbyte FIFOs which prevents any overrun from occurring, which alone is quite helpful in high bandwidth applications.

      The cards we include with our drives are in the lower end of Promise's spectrum... you can spend more and get more performance if you want to, which is what I suspect the author of the original article did.

      --eric

      --
      More data, damnit!
    5. Re:RAID by Alan+Partridge · · Score: 1

      you're right to be confused, I can't understand the logic of this at all. He's using 160GB drives when his ATA cards only recognise 130GB. Why? Why is he using twice as many ATA cards as he needs? Why does he need so much RAM? Why didn't he use 1000Base T? Why bother doing this at all when the next iteration of HD tech will allow maybe half a TB per unit anyway? This is geek for geek's sake.

      --
      That was classic intercourse!
    6. Re:RAID by Alan+Partridge · · Score: 1

      isn't this all irrelevant over 100base T? This is NOT a performance set-up by any means.

      --
      That was classic intercourse!
    7. Re:RAID by Ioldanach · · Score: 1

      Ok, I'll grant the need for one drive per bus. So get 4 of the cards Maxtor includes instead of 2. Still, he's setting up a fileserver for a 100Mbit ethernet network. That's what, 12MB/sec? I doubt he'll ever see any added benefit from using the high end promise cards vs. the low end ones, even if he switches to gigabit ethernet at 120MB/sec. Even a switched gigabit network with saturated pipes in & out isn't going to move more than 240MB/sec, which is probably approaching the practical limit of his system. It just seems that the bootable card with onboard RAID is wasted if he's not actually *using* the onboard raid.

    8. Re:RAID by xtink · · Score: 1

      this is a raid5 system that means every bit has to be written twice if you have a chance some time look at the disk access light on a high end raid system the drives are active all the time due to the overhead. just because you can only push so much data over your network connection doesn't mean you want to limit your IO bandwidth why do you think they use SCSI ultra 160 in high end hardware raids. and remember this is designed to be a network storage if your running on a switched network so it could be possible that the drives are reading and writing at the same time so now you have some one on the network reading a file at 12MBs and you have some one else writing a file at 12MBs your already up to 24 MBs of data you have to figure at least another 12MBs for the system to write the parity your now up to 36 MB of data flowing over the bus

      --
      I've never noticed it before but my thinking cap does sort of resemble a hockey helmet
    9. Re:RAID by Toraz+Chryx · · Score: 1

      "you cannot stream sequentially off of 2 drives without saturating an ATA-100 cable."

      Um, I was under the impression that ATA didn't support concurrent transfers?..

    10. Re:RAID by alefnort · · Score: 1

      I think he was right with "WTF?", an ATA100 controller won't address up to 160GB. Please tell me im wrong, I just sprung for a 4 channel ata133 controller. Same set-up, 1.2TB software raid0 or jus short of 1.0TB raid5. I called mine terasaurus.

    11. Re:RAID by Anon-Admin · · Score: 1

      After the return problems I have had with Maxtor over dead drives I will never buy a maxtor drive again! Although I can not argue with you comments. I must ask why a smart guy like you is working for such a crappy company?

  3. I'm sure I'll figure a way to fill it... by DNAGuy · · Score: 4, Insightful

    Its only a matter of time 'til video becomes as commonplace as MP3's on our drives. 100 Gigs is what...20 movies??? I don't see my appetite for disk space slowing down any time soon.

    Hmmm...video; logfiles that don't roll over - ever; online network backup... I'm sure to figure out a way to fill that terabyte. :)

    --

    BRENT ROCKWOOD, EST'd 1975

    1. Re:I'm sure I'll figure a way to fill it... by Dark+Legend · · Score: 1

      Yay full quality DVD rips, no crappy Divx lovely Mpeg2, so I already need over 200 gigs at that estimation to house my collection.. damn, does anyone make a DVD video jukebox?

    2. Re:I'm sure I'll figure a way to fill it... by darien · · Score: 1

      I'm sure we could all fill any arbitrary amount of space; but by the time more than 1% of us actually has a genuine use for this kind of capacity, we'll probably be able to buy it in a single drive from PC World for £199.

      I'm not sure if this is wonderful, or very depressing. It certainly doesn't make the article any less fun.

    3. Re:I'm sure I'll figure a way to fill it... by mgv · · Score: 2

      Its essential that we move to this level of secondary storage, or there is a real danger that tertiary storage systems (such as tape, DVD) may actually be able to keep up!

      Seriously , the big problem here is not having the data online, but figuring how to recover it if you lose it.

      Not that RAID is a bad thing, but I have seen RAID systems go down - I lost a day's work (not archived by myself) when my web hosting company's raid system failed completely. (They were most apologetic and offered some compensation, but the data was very gone for all their customers - I believe they bought new RAID systems from another vendor immediately thereafter).

      My 2c worth.

      Michael

      --
      There is no cryptographic solution to the problem where the intended receiver and the attacker are the same entity.
    4. Re:I'm sure I'll figure a way to fill it... by BlackSol · · Score: 3, Insightful

      Redundancy is a better solution than disposable media backup. Often more expensive, but infinately more reliable.

      Code Versioning/document management on changing files to maintain history.

      Your web hosting provider had 1 Raid system, thats only 1 level of redundancy (I know multiple disks - but on 1 system). If you want to truely ensure data you need redundant systems, such as networked backup to 2 additional machines that also utilize raid.

      If the data is critical you need to examine points of failure. Thats what clustering, and load balancing offers - total redundancy.

      --
      $sig=$1 if($brain =~ /idea\s+(.*)/i);
    5. Re:I'm sure I'll figure a way to fill it... by trezor · · Score: 1

      Hey! DivX at 1.5 Mbps (double CDencoded) with 448 kbps AC3 is so close to MPEG2 that it doesnt really matter. I for once can't see the difference. And you cant hear any difference, because you use the original DVD audiostream.

      You get a -hell- lot more quality into the flicks buy doing a double cd. That's the only way I bother doing it anyway.

      So you will prolly need just 75 gigs to achieve full quality :)

      --
      Not Buzzword 2.0 compliant. Please speak english.
    6. Re:I'm sure I'll figure a way to fill it... by grazzy · · Score: 1

      100 gigs 20 movies? well, its not. most movies available today in decent quality (1 ½ hour) are around 1.2gb in SCVD format and maybe ~700 mb in divx.

    7. Re:I'm sure I'll figure a way to fill it... by albino+eatpod · · Score: 1

      I think he's referring to the fact that most DVDs (single-sided, double-layer) are around 5-9GB each.

    8. Re:I'm sure I'll figure a way to fill it... by morcego · · Score: 1

      Thats what clustering, and load balancing offers - total redundancy.

      There is no such thing as total redundancy. Thats a plain fact.
      Even if you have load balancing and a huge cluster, you still don't have it.
      Add to it: autonomous energy system (generators), separated networks and access points, and you still don't have it.
      At IBM, our contingency plan took even nuclear wars in consideration. And even with that, we didn't have total reduncance, the same way our systems were not 100% secure.
      I think the lesson here is the same regarding security: a false sense of security is the worst security problem you can have.

      --
      morcego
    9. Re:I'm sure I'll figure a way to fill it... by Anonymous Coward · · Score: 0

      But what happens when you have most of the movies at Blockbuster. I burn them now, but it would be great to have them all available over the network.

    10. Re:I'm sure I'll figure a way to fill it... by guile*fr · · Score: 1

      who would cares about your e-business in case of nuclear war?

    11. Re:I'm sure I'll figure a way to fill it... by morcego · · Score: 1

      Anyone who operates in as many countries as IBM does.

      --
      morcego
    12. Re:I'm sure I'll figure a way to fill it... by Zeinfeld · · Score: 2
      Not that RAID is a bad thing, but I have seen RAID systems go down

      I once saw a shaddowing controller fail in such a way that it managed to corrupt both of the RAID 5 arrays it was driving. Had to bring the system back up from the first level backup.

      Soon after that we switched to using EMC gear.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
    13. Re:I'm sure I'll figure a way to fill it... by MADCOWbeserk · · Score: 1

      I have 300 or 400 divx... I'm going to need a couple.

    14. Re:I'm sure I'll figure a way to fill it... by lazn · · Score: 1

      Perhaps true, but 1TB = 1000GB not 100GB.

      EACH DRIVE in the TB server is 160GB

    15. Re:I'm sure I'll figure a way to fill it... by Telastyn · · Score: 2

      What's the law?:

      "Any hard drive, no matter the size, will be 95% full in 3 months."

    16. Re:I'm sure I'll figure a way to fill it... by grazzy · · Score: 2, Funny

      well, dvds belong in a shelf, not on your harddrive :-)

      shelf: 20$
      storage: 100 dvds

      soo, basiclly you can store 100x7 gb on a shelf for 20 bucks. thats cheap!

    17. Re:I'm sure I'll figure a way to fill it... by Anonymous Coward · · Score: 0

      No one will ever need more than a terabyte of disk space.(yes I know but I will never leave Bill off the hook). Now if he then said, "but you will always want what you can afford"....

    18. Re:I'm sure I'll figure a way to fill it... by 4of12 · · Score: 2

      Redundancy is a better solution than disposable media backup.

      This reminds me of a time when I was doing my PhD thesis.

      After working on the manuscript for the better part of a year, I began to get Beautiful Mind syndrome about losing it catastrophically due to fire, flood, etc.

      So I started to regularly ftp updated versions to a supercomputer site some 400 miles distant.

      Just in case....

      --
      "Provided by the management for your protection."
    19. Re:I'm sure I'll figure a way to fill it... by rpseguin · · Score: 2, Interesting

      > Its only a matter of time 'til video becomes as
      > commonplace as MP3's on our drives. 100 Gigs is

      > I don't see my appetite for
      > disk space slowing down any time soon.

      True enough. I disagree with Cmdr Taco's comment:

      "we now have the ability to store anything we reasonably could need"

      I used to say the same thing a while back, thinking I could never fill a disk. That was a 5M Sider drive for an Apple II...

      I just wish the stupid BIOS and drive manufacturers would get their act together on drive limits...
      Nobody will ever need more than 500M...
      Nobody will ever need more than 2G...
      Nobody will ever need more than 8G...
      Nobody will ever need more than 32G...

      How many times can you shoot yourself in the same foot with the same gun?

      > logfiles that don't roll over - ever; online

      That is a terrible architecture for storing log files... Makes them very hard to search, modify, ... You'd be better off creating a tool to iterate over a set of files for you.

      > network backup... I'm sure to figure out a way
      > to fill that terabyte. :)

      No problem there.
      A terabyte just isn't that much when you start to think of volumetric data, CFD, physics calculations, FEA, ...

      Personally, I'd really like to stop seeing all of this spinning media and start seeing solid state stuff with much higher densities...
      Frustrates me seeing people talk about 500 terabytes in a test tube. Forget that, just get the stuff working and tell me where to place my order for something I can use. :-)

    20. Re:I'm sure I'll figure a way to fill it... by susano_otter · · Score: 2

      Yeah, but I/O is painfully slow...

      --

      Any sufficiently well-organized community is indistinguishable from Government.

    21. Re:I'm sure I'll figure a way to fill it... by kesuki · · Score: 1

      If you have 200+ anime DVDs believe you me you don't want to mpeg-4 compress them. Burnt in subtitles suck so that adds an extra step of ripping the subtitles with vobsub (several extra steps on most DVDs). you have to rip Two audio tracks (english and japanese) because you want to have both. Plus you loose all those extra features.
      I'd put money on the UPS or FedEX guy delivering Taco with the needed hardware shortly. The ability to access some 200 Anime DVDs over the network with the full features of the DVD *drools* the cost of the hardware is only $25 per DVD too. Although if you go with a single processor system with only 2 IDE controllers and only 5400 rpm 40 GB drives (*8) you can build a system for under $1,000 that has 1/3 the capacity of this monster $5,000 system.
      While the maxtor engineer was right about 160 GB drives (needing an entire bus to themselves) the older slower 40 gig drives don't throttle ata100 quite the same. At anyrate I can't see my potential need for storage being alieviated until I can obtain 1TB for under $75. By then who knows what people will dream up to fill my storage capacity.

    22. Re:I'm sure I'll figure a way to fill it... by afidel · · Score: 1

      you might want to read This before putting all your eggs in the EMC baskit.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    23. Re:I'm sure I'll figure a way to fill it... by savage_panda · · Score: 1

      I'm fairly picky about backing up my data. now I could do backup all my work, and make backups of backup directories, and backup those backup directories in the future. Cause you never know when the backups fail, and the backup's of the backup's can fail too. To be truly save, the backup of the backup of the backup directory will be the last and final safe guard. Thou space requirements would increase polynomialy. If I have 10 gig now, and backup every day, and make the backup of the backup every week, and make the backup of the backup of the backup every month, I would use that terra byte up in 3 months.

    24. Re:I'm sure I'll figure a way to fill it... by LoseNotLooseGuy · · Score: 1

      Plus you loose all those extra features.

      I suspect that you would not "let loose or release" all those extra features; it is more likely that you would fail to retain those features. I believe the word you were looking for was lose.

      Congratulations! You have been participant #12 in my campaign to rid Slashdot of this error.

      --
      Proudly correcting Slashdot's most irritating linguistic error since 2002.
    25. Re:I'm sure I'll figure a way to fill it... by Alan+Partridge · · Score: 1

      I'm wondering at exactly what point Americans will reach their laziness limit. Maybe there isn't one - maybe only a total dreamworld that can be lived without ever getting out of bed will suffice. Is that why you people like the godawful Matrix so much? Why does anyone need instantaneous access to a two or three hour movie? What's ten seconds putting the disc in compared to watching it? MP3 makes sense because of the "Juke-Box" factor - iTunes on random is like a radi station that only plays music I like - I've got about a week and a half's play time in there already. But video? Nope.

      --
      That was classic intercourse!
    26. Re:I'm sure I'll figure a way to fill it... by Alan+Partridge · · Score: 1

      that rule always USED TO hold true for me, but when I bought my current machine in October 2000, I put 2 43GB 7200RPM IBM Deskstars in it and they're not even half full yet. I'm as surprised as anyone, but there you go.

      --
      That was classic intercourse!
    27. Re:I'm sure I'll figure a way to fill it... by susano_otter · · Score: 2
      Convenience != laziness, I think... movies take up less space on your hard drive than they do on a shelf (what with individual packaging, easements for the shelf-unit, &c.)--and don't forget that a hard drive full of movies can be stored anywhere; a shelf has to occupy valuable wall space, include enough clearance to get to the movies in question, and be conveniently near the viewing device.

      Cataloging, indexing, and searching your copious movie collection is a lot less painful if you can eliminate the whole shelf thing as well. Jukebox issue aside, I'm sure you could come up with a similar list of reasons why having 1.5 weeks of music on your HD is vastly superior to having it on a shelf.

      Finally, a memo to the humor-impaired lobe of your brain: the I/O comment was a joke. See, I meant to highlight the absurdities of... oh, never mind. You didn't get the Matrix, you obviously wouldn't get this. But hey, at least you have a week and a half of random music you like to take your mind of it!

      --

      Any sufficiently well-organized community is indistinguishable from Government.

    28. Re:I'm sure I'll figure a way to fill it... by Dahan · · Score: 1

      You're my hero :)

    29. Re:I'm sure I'll figure a way to fill it... by LoseNotLooseGuy · · Score: 1

      I am grateful for your support, Dahan. Perhaps you will remember me the next time you have modpoints, so that my words to the skeptical masses may carry more weight.

      --
      Proudly correcting Slashdot's most irritating linguistic error since 2002.
    30. Re:I'm sure I'll figure a way to fill it... by Alan+Partridge · · Score: 1

      was the Matrix supposed to be humorous then? No wonder I didn't get it. I thought it was just another piss-poor Reeves vehicle with a stock-Hollywood meaningless plot and some criminally underused supporting cast members. Keanu Reeves is almost unique among actors that I've seen in that he has never been in a decent pic and has apparently no acting skillz whatsoever. I've seen chair legs with more personality than KR.

      --
      That was classic intercourse!
    31. Re:I'm sure I'll figure a way to fill it... by Alan+Partridge · · Score: 1

      and henceforth I shall call myself... DefinitelyNotDefinatelyGuy. Wait, what about ThanNotThenGuy? Yep, that's the one for me!

      --
      That was classic intercourse!
    32. Re:I'm sure I'll figure a way to fill it... by LoseNotLooseGuy · · Score: 1

      Alan Partridge, your participation in the cause is gratefully noted. However, I believe the responsibility for "then/than" was filled eearlier today. On the other hand, DefinitelyNotDefinatelyBoy would be a welcome addition to the team...

      --
      Proudly correcting Slashdot's most irritating linguistic error since 2002.
    33. Re:I'm sure I'll figure a way to fill it... by adamjaskie · · Score: 0

      heh yeah i have had that problem
      I bought a nice large (at the time) 60GiB drive several months ago, and within a couple months, it was more than half full, and you could buy a 130GiB drive for the same price!!!

      --
      /usr/games/fortune
    34. Re:I'm sure I'll figure a way to fill it... by adamjaskie · · Score: 0

      I saw a nice solid state drive somewhere. Dont remember where though. It hooked up via external SCSI, and stored everything on 30 gigs of SDRAM. When it had to power down, it copied everything to a pair of mirrored 30 gig hard drives. It even had a built in UPS with enough power to put everything onto the hard drives if power failed.

      --
      /usr/games/fortune
    35. Re:I'm sure I'll figure a way to fill it... by grazzy · · Score: 1

      my point was really that you cant make a comparisation like that. if you are going to store movies on your harddrive you dont do it in raw mpeg2 uncompressed dvds. you re-code them to something like divx or SVCD to store them more efficently.

      i wasnt joking, if i would actually OWN a DVD-movie i would never go true the trouble of encoding it, and then putting it on my harddrive.

      the processe of ripping a dvd properly takes like ~20hrs with a decent computer.

    36. Re:I'm sure I'll figure a way to fill it... by susano_otter · · Score: 2

      Indeed it does. Me, I'm quite satisfied with a cabinet full of movies and a PS2... Now all I need is a Lego Disc Robot.

      --

      Any sufficiently well-organized community is indistinguishable from Government.

  4. Expanding junk by Dark+Legend · · Score: 0, Troll

    Now that just roxxors my boxxors! that's a lot of space!I would still end up filling it with useless junk, my crap tends to expand to fill the HD space strange that...

  5. The Amazing $5k Terabyte Array by Anonymous Coward · · Score: 2, Interesting

    yeah , with 160 gig ATA drives out now,
    you can do it with 6 drives vs. 10 drives,
    and alot of motherboards come with onboard
    RAID, and if you use software RAID via
    win2k or Volume manager type app for Linux
    it would rock .

    Cheap too, at $260 per drive per pricewatch .

    Peace out...

    1. Re:The Amazing $5k Terabyte Array by Anonymous Coward · · Score: 0

      The amazing thing is that you're over have way there (as far as raw storage space is concerned) using a cheap mother board with 4 IDE channels and 4 160 GB drives. Of course, you don't have any free space for a CD and no speed or security increases from RAID.

    2. Re:The Amazing $5k Terabyte Array by Lumpy · · Score: 2, Offtopic

      please tell me how you get 6 IDE drives on a pc that gives you any performance in a rad function... U160 SCSI drives will give you at least a 70% speed increase and a 80% increase in reliability....

      If I had to store a terebyte of information I'd be an idiot to use consumer level storage (IDE).

      Ever wonder why real servers uses SCSI?

      --
      Do not look at laser with remaining good eye.
    3. Re:The Amazing $5k Terabyte Array by RazzleFrog · · Score: 2, Interesting

      Just get an IDE RAID card for less than $50. That frees up primary and secondary for DVD,Burner,etc. That is assuming of course that you have a free PCI slot.

    4. Re:The Amazing $5k Terabyte Array by RazzleFrog · · Score: 3, Informative

      I believe that Promise makes the SuperTRAK Pro series of ATA RAID cards that support up to 6 drives and RAID 5. I haven't used them personally but they do exist.

      I agree that on a server or a professional workstation SCSI is the way to go for speed and reliability. But for the home consumer who wants to work with digital video the cost of a SCSI RAID set up is extremely prohibitive.

    5. Re:The Amazing $5k Terabyte Array by mprinkey · · Score: 2, Informative

      on a server or a professional workstation SCSI is the way to go

      I do wish to avoid yet another SCSI/IDE flamefest, but I would point out that this configuration is like most of its ilk--it is basically network attached storage. That means that no one will be reading or writing from the server system itself, but will be accessing the raid array through a network link via NFS and/or SMB. In my experience, performance of Linux Software RAID5 on Promise IDE controllers with 80-GB Maxtor 5400-RPM hard drives can exceed 50 MB/s write and 70 MB/s read. SMB/NFS even over Gbit ethernet will be hard pressed to saturate that.

      Having built many of these low-budget raid5 arrays, I cannot concur that SCSI and/or hardware RAID is necessary to see acceptable performance. <Horror stories about Hardware IDE RAID5 controllers deleted.>

      I do admonish would-be builders to include an extra hard drive in the raid array as a hot spare. For four drive arrays (3 data + 1 parity), it may be unnecessary. For larger system (7 data + 1 parity), I think a hot spare is a worthwhile investment. Also, avoid 7200-RPM drives if possible and actively cool all of the drives in the array. One or two fans blowing on the array can make a big difference.

    6. Re:The Amazing $5k Terabyte Array by Junta · · Score: 2

      Well, the reliability is a point taken, one device fails on an IDE chain and the whole chain may collapse depending on the problem.

      The performance is questionable. IDE is behind SCSI, but not nearly as badly anymore. And how are you accessing this storage space typically? Through a network whose speed is likely not to exceed 100 MBit, so network is essentially the bottleneck for throughput, unless of course you have some data processing application or something like oracle, in which case your point may be valid, but still, SCSI is still overpriced for what it gives... If only it had prevailed more, then it would be cheaper and this debate would be moot...

      --
      XML is like violence. If it doesn't solve the problem, use more.
    7. Re:The Amazing $5k Terabyte Array by Anonymous Coward · · Score: 0

      IDE == consumer
      Scsi 160 == I sleep at night knowing my drives will be fine even if I lose one.

      There is something to be said of a server with IDE, it is a cheap piece of crap. Sure 180 gig of IDE is only about $375 and scsi 160 180 gigs is $1,682 but it is scsi and I can fit 15 devices per cable.

      Sure it cost more but it is worth it.

    8. Re:The Amazing $5k Terabyte Array by Lumpy · · Score: 2

      This is nice when taken as a over simplified example. but that is not the case. I can access all 15 of my scsi devices and tell them to do things seperately and they will perform the job. 2 devices on the IDE chain? if one is doing a job the other has to sit and do nothing. the communication system built into SCSI gives the largest performance gains... This is why a SCSI-II hard drive and controller still feels snappier than a Ultra 33 IDE drive... yes the IDE drive is theoretically faster than the IDE, but as soon as I access the CD and drive at the same time the SCSI devices cintinue to fly while the IDE devices start falling down waiting for each other.

      They really really need to design a IDE-II specification that gives the SCSI performance traits to IDE.

      --
      Do not look at laser with remaining good eye.
    9. Re:The Amazing $5k Terabyte Array by Cramer · · Score: 2

      While I agree, SCSI drives are simply better drives, cheap is a very powerful motivator. IDE is about one tenth the cost of SCSI. So the IDE array will last a year -- how long do a lot of companies last these days? Over time, the SCSI system may, ultimately, be cheaper -- the cost of replacing failed drives, the downtime for rebuilding and restoring the array, lost productivity of a missing database, drugs for the admin headaches...

      I've built a 1.04TB array. It's an impressive hack of a system. Out of the 16 drives for the array, four (4) were defective right out of the box! And two of those replacements were suspect. After a month of handling a full news feed (120G+ per day) we've worked most of the kinks out of it (I don't recommend w2k for a drive array.)

      BTW: I used a pair of 3ware Escalade (6800) controllers. They take alot of the suckiness out of IDE (tho' it's a cabling mess.)

    10. Re:The Amazing $5k Terabyte Array by Lumpy · · Score: 2

      Also, avoid 7200-RPM drives if possible and actively cool all of the drives in the array. One or two fans blowing on the array can make a big difference.

      This is important in any drive array. IF you dont have them spaced apart and have cooling fans on them, no mater what they are, you are asking for failures and short life spans. I was am azed at the differences on my SCSI drives, seperating them another 1/4th of an inch and adding a fan blowing at each bundle of 3 drives caused a temperature drop from 110DegF to 76DegF or only 4 degrees above the server rooms ambient temperature. (Hey, I'd rather have it set for 60 but the traffic and billing ladies complain..they keep their work area at 80!)

      The suprising part is that the ML530's have a spot to place a fan in the drive cages, yet no fans installed.

      --
      Do not look at laser with remaining good eye.
    11. Re:The Amazing $5k Terabyte Array by shic · · Score: 1

      Why not use a handful of these cheap and cheerful servers as your backup facility? IMHO, the issue of reduced reliability is outweighed by the following factors:

      For many deployments, it is important to minimise down time - the cheap TB server can easily be switched into service for an emergency situation - this is less easy if you have backup tape.
      The incompetence problem is reduced by fully automatic systems - hence reducing the risk that someone may "forget to put the tape in" the night before the disaster.
      Remote physical locations can be used for backup servers like this - hence reducing the likelihood that the off-site tape (containing the only copy of the data after the fire) happens to have been buried by your neighbour's dog.

      Lets face it... the days of tape backups are (hopefully) numbered.

    12. Re:The Amazing $5k Terabyte Array by 4Runner · · Score: 1

      Well...we built a 5 75GB IDE raid 5 server over a year ago. It is almost like the system on kcgreak but since we built it last year, it has smaller drives. It uses extra ATA100 IDE controllers. Each IDE drive works as a Master. It has about 280GB of storage out of 37GB. This system has been flawless over the past year. The system also has a 35GB tape that backs up the entire system over 10 days. We have two complete sets of backup tapes for it. But, we have yet to use the tapes. So far, we have lost 3 SCSI drives over the past few years, and we have yet to lose a IDE drive. SCSI might be faster but you can't say it is more reliable. Both have moving parts, and most SCSI/IDE are built from the same moving parts, just different onboard controllers. Also, on the RAID-5 system we get about 10MB/sec writing, and 60MB/sec reading. Remember, most networks can't handle more than 10MB/sec anyway. So, if you have network storage, it really doesn't matter if you can do more than what the network can handle. Our raid-5 system has multiple ethernet cards for mutiple networks. I would like to see people with Hardware raid-5 (SCSI) post their performance numbers and they will be surprised at how low they are. In my opinion, it is not worth spendings thousands on a hardware Raid 5 solution. Mind you, I do have a Raid-0 SCSI system built for speed. It writes/reads around 120MB+/sec.

    13. Re:The Amazing $5k Terabyte Array by Anonymous Coward · · Score: 0

      There has to be a good way to get groups of drives that aren't manufactured in the same
      batch. I got a bad batch of IBM deskstars
      once. 3 out of 8 were bad. I sent back four drives, got four new ones and it's all ok now.

      If there's a defect in one batch the odds are that many of your drives will go south, reducing the efficacy of the R part in RAID.

    14. Re:The Amazing $5k Terabyte Array by RazzleFrog · · Score: 1

      The server case I use from Antec (lost as to what model and too busy today to look it up) also has the space for the fans but no fans. The fans are cheap enough of course.

    15. Re:The Amazing $5k Terabyte Array by edmudama · · Score: 3, Interesting

      > They really really need to design a IDE-II
      > specification that gives the SCSI performance
      > traits to IDE.

      They already have it -- tag command queueing has been in the ATA spec for years, since ATA-5 I think. Most vendors either have command queueing IDE drives, or are coming out with them soon.

      http://www.t13.org for more info on the various ATA specifications

      --eric

      --
      More data, damnit!
    16. Re:The Amazing $5k Terabyte Array by jandrese · · Score: 2

      "Most vendors" assuming most vendors are IBM. AFAIK, the only company with CTQ on ATA drives is IBM (and this is on the ill fated DeskStar line). Additionally, the CTQ on the ATA devices is not as sophisticated as it is on the SCSI (shallow queues ar e common), but it should be only a matter of time before this is resolved.

      --

      I read the internet for the articles.
    17. Re:The Amazing $5k Terabyte Array by ahodgson · · Score: 1

      http://www.3ware.com/

    18. Re:The Amazing $5k Terabyte Array by MShook · · Score: 1

      The only issue right is the need for the these drives by the government (these drives are SCSI-3, 160 G each):

      From: YEAHRIGHT@cdw.com
      Sent: Monday, January 28, 2002 11:59 AM
      To: Whatever
      Subject: RE: CDW Order XXXXXXX

      I have a new update on these drives. I just received the below email from Seagate. It is not good news. This change in status was as unexpected and I appologize. Per Seagate any order the government places has priority over
      any other users order. So it seems we got screwed by the gov.
      Let me know what you want to do.

      --------------
      ST1181677LCV, ST1181677LC, and ST1181677FCV just went on EXTREME allocation until March (at the earliest). The Department of Defense has taken allocation on all of the drives we had allocated for distribution for the end of January and all
      of February. These increases were unexpected and we are working to get the build schedules adjusted to accommodate for the increased demand.

      I will be working with your purchasing departments and distribution partners to direct product into your warehouses, but I must forewarn you that the outlook is not good. March is the most recent delivery date, however, that is NOT a definite. Please call me if you have any questions or need anything else. I assure you that the difficulties you experienced in
      getting a response last week will NOT happen again.

      Regards,

    19. Re:The Amazing $5k Terabyte Array by mprinkey · · Score: 1

      This is important in any drive array. IF you dont have them spaced apart and have cooling fans on them

      Excellent point. Most cases place the hard drive bays so close that there is neglible space to allow even active cooling flow. I typically have to fabricate mounting brackets for the large arrays (typically 10 to 12 drives) to make them all fit in a 4U case. When drilling holes, I partition the space evenly and typically end up with 1/4 to 3/8 inch gap between. Sandwiching them all together cuts your effective heat transfer area by almost a factor of two for two drives and a factor of ~5 for six drives.

    20. Re:The Amazing $5k Terabyte Array by edmudama · · Score: 2

      Yes, you're right, and the drive is actually slower issuing queued commands than standard read DMA commands.

      Standard ATA queue depth is 32, which I believe is the same as SCSI, though there are side projects in ATA land to increase this because of potential performance gains. (more things you can choose from, better odds of being able to choose something easy)

      --
      More data, damnit!
    21. Re:The Amazing $5k Terabyte Array by mikolas · · Score: 1

      "Also, on the RAID-5 system we get about 10MB/sec writing, and 60MB/sec reading. Remember, most networks can't handle more than 10MB/sec anyway."

      Yeah right. Usually you will have a gigabit fiber from the file server to the switch and from there on gigabit fiber to other servers and 100 megabit lines for the workstations. 100 megabit file serving network is hardly enough for any serious use and you would have such if you need that amount of storage.

    22. Re:The Amazing $5k Terabyte Array by valenti · · Score: 1

      Most of the "better" IDE Raid cards use 1 drive per channel. For example, Adaptec 2400.

      And the typical Dell PC with a hard drive and CD is shipped with each device on it's own channel.

  6. Actually by IAmATuringMachine! · · Score: 5, Interesting

    Actually a DNA sequence is only about 3GB for a human - you're anime DVDs might take more space, at least until you compress them. Then again, DNA should be fairly trivial to compress highly. Let Z = CA, Y = TG, .....

    --
    "Computer Science is no more about computers than astronomy is about telescopes."
    -E. W. Dijkstra
    1. Re:Actually by skroz · · Score: 2

      Why stop there? You could store four base pairs per byte with the most basic of compression schemes. You could probably compress it down much, much further.

      --
      -- Minds are like parachutes... they work best when open.
    2. Re:Actually by Quixote · · Score: 4, Funny

      Why stop there? You could store four base pairs per byte with the most basic of compression schemes. You could probably compress it down much, much further.

      But be careful with that compression thing! If you compress the DNA too much, you could end up like Minime

    3. Re:Actually by dNil · · Score: 5, Insightful

      You are correct that the human genome is "only about" 3 giga basepairs of sequence, but to only store that would be rather egocentric. There are as of Dec 3 2001 some 14396883064 bp in the GenBank, and the amount of sequence information still grows roughly in a exponential manner.


      Now, this will not hit the TB line anytime soon. The trouble starts if you are involved in genome sequencing. Then you need to store the raw data for all that sequence. Each some 450 bp of sequence is reconstructed from about 5 - 10 different fairly high reslution gel images (in the ballpark of 150 kBi per image). Also, recall that even short stretches of the sequence can be accompanied with a lot of annotating information, such as names and functions of genes, regualtory elements or pointers to articles explaining the experimental evidence for such. This mutiplies the storage requirement with quite a factor - nothing a neat little linux box with a huge RAID-array cannot handle though. Thats how we handle the sequencing data from Trypanosoma cruzi, by the way.

    4. Re:Actually by IAmATuringMachine! · · Score: 2, Interesting

      Good insight - I suppose I was just considering the cheap thrill at showing that it can be trivially halved, but no doubt if one is looking at base pairs alone they could probably compress it by a factor of eight. But the other poster was correct in observing that there is a plethora of other meta-information that goes along with it, such as what the various base pairs code for. Then again, if we wanted to be all GATTACA, they would probably do the simple compressed file (seemingly of a third of a gig) and the hardware would would decode it and calculate the meta-information for my insurance company.

      --
      "Computer Science is no more about computers than astronomy is about telescopes."
      -E. W. Dijkstra
    5. Re:Actually by IAmATuringMachine! · · Score: 2

      I made a grammar error (you're instead of your) - it must be that my compressed DNA didn't unzip properly.

      CATcoyboynealGTTA....

      --
      "Computer Science is no more about computers than astronomy is about telescopes."
      -E. W. Dijkstra
    6. Re:Actually by airuck · · Score: 1

      There is a lot of sequence redundancy in the public databases, because most of the sequences are not assembled into complete genomes. I work with mouse and human and the best assmblies are at Celera. I think each assembly is under 1 GB as compressed flat files (which is how they are stored and searched with BLAST and FASTA), but that only represents the assemblies. The whole package is about 30GB and includes annotations and mapped sequence features (pre-computes). Although this data is not public, I believe its size is a fair estimate of what the public will grow into. Of course, once you have all that data, you will be wanting to decorate it with your own work. I think data transfer may be a larger issue. Tape and DSL?

      --
      First entomology, then virology, and finally bioinformatics systems. Bugs follow me wherever I go.
    7. Re:Actually by mairas · · Score: 1

      Indeed. One might even consider using lossy compression for packing the DNA.

    8. Re:Actually by SpaceLifeForm · · Score: 1

      High-tech debugging so to speak.

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
    9. Re:Actually by cburley · · Score: 1
      a DNA sequence...

      Someone should make a movie about a bunch of Open Genome zealots using personal computers with huge hard-drive arrays to store and experiment with DNA sequences.

      To play the genome-equivalent of RMS, I nominate:

      Gene Hackman

      (Think about it. ;-)

      --
      Practice random senselessness and act kind of beautiful.
    10. Re:Actually by Anonymous Coward · · Score: 0

      It is a lot easier to store the analog form of DNA than in digital. A few drops of blood would easily do.

    11. Re:Actually by LegendLength · · Score: 1

      Might be a tad harder to copy or manipulate that analog information though.

    12. Re:Actually by jpostel · · Score: 2

      Too bad there is no moderation for '+1 Corny'

      ;)
      .

      --
      Ummm, Jon, aren't you supposed to be dead...? - Otter(3800)
  7. "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 2, Informative
    What is he talking about with the DNA sequences?

    human = 3 billion base pairs
    = 6 billion bits of data
    = 7.5e8 bytes
    = 7.3e5 kilobytes
    = 715 megabytes
    < 1 gigabyte

    Sure, lots of other life forms have been sequenced too, but most of these have much smaller genomes than humans.

    So how would you need a terabyte to store DNA sequences?

    1. Re: "I need to store DNA Sequences"??? by John+Betonschaar · · Score: 1

      >> Sure, lots of other life forms have been sequenced too, but most of these have much smaller genomes than humans.

      Actually, they don't :-P

    2. Re: "I need to store DNA Sequences"??? by nick255 · · Score: 2, Insightful

      OK, I'm not an expert in this area, but I think when people do research into DNA sequences they get DNA sequences from a large sample of people so they can look for statistical links between certain gene sequences and various properties. Therefore they will need reasonablesamplesize*sizeofsequence, so if you have a sample of just 1000 people you could easily be getting into terrabyte land. (Then they need a highly optimised version of diff to spot the differences in the sequences!)

    3. Re: "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 0

      so, I need 6 billion gigs to archive every person on the planet, never mind all the animals and hybrids we create! I need more space man!

    4. Re: "I need to store DNA Sequences"??? by Maddog+Batty · · Score: 1
      715MB per human genome.
      0.08% variation between one human and the next.
      Therefore 57MB required per human.
      1TB = 1048576MB

      Therefore 18000+ genomes could be fitted into 1TB. I guess compression would allow this to be increased somewhat.

      --
      wot no sig
    5. Re: "I need to store DNA Sequences"??? by dNil · · Score: 1

      You are right in the sense that to create the sequence, data from several individuals are used, but no more than 5-10 are actually required. Look up a tutorial on shotgun fragment assembly if you are curious to know more! Larger samplesizes can be effectively handled by just looking for differences between the individuals and the "concensus" sequence for that species - somewhat like what encoding a movie does, storing only differences from the last frame. Have look at for instance hgvbase to see all human sequence variations we know so far.

    6. Re: "I need to store DNA Sequences"??? by wilgamesh · · Score: 1

      Don't forget genomes have "low" variations from person to person. i.e. my genome and your genome differ only by about 1%. I mean, that translates to 30 million bases still, but that's low compared to the whole genome.

      So even better is when you start adding genomes of other humans into your database. Since person-to-person variation of genomes is quite small, on the order of at most 1 per 100 bases, that means to store the *difference* from the 1st genome requires only about 10 MB.

      So to store all human genetic material without compression, by storing it as *differences* to the first genome, assuming the human genome is uncorrelated and can't be compressed much, we would require only

      6x10^9 x 10 MB + the first original 1 GB

      ~60000 TB.

      That's only in the millions of dollars range.

    7. Re: "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 0

      looking for differences between the individuals and the "concensus" sequence for that species - somewhat like what encoding a movie does, storing only differences from the last frame.

      MPEG encoding for humans? Oh no! Kidnapster could become a reality! Run, Lucy Lu!

    8. Re: "I need to store DNA Sequences"??? by Carpathius · · Score: 1
      I think people are missing the most interesting thing about this.

      Imagine if we could use DNA strands -- or something the same size -- to store data. Your terrabyte would fit on the head of pin...

      Sean.

    9. Re: "I need to store DNA Sequences"??? by Big+Dogs+Cock · · Score: 1

      Yeah, but you're missing the most interesting thing about that. It took frigging years to sequence the human genome. Can you wait that long to access your porn? Hell, it's worse than a dialup.

      --
      "Under the iron bridge, we fist" - The Smiths, Still Ill
    10. Re: "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 0
      Sure, lots of other life forms have been sequenced too, but most of these have much smaller genomes than humans.

      Actually, they don't :-P

      Yes, they do.

    11. Re: "I need to store DNA Sequences"??? by UCSB_Levendis · · Score: 1

      I need to store suffix arrays of DNA sequences. Multiply any arbitrarily huge number by four, and then you see the terabytes coming in handy. Now I just need a terabyte of RAM.

    12. Re: "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 0

      meep zorp

    13. Re: "I need to store DNA Sequences"??? by Anonymous Coward · · Score: 0

      I work for a biotech company. Believe me, we have terrabytes worth of DNA sequences.

    14. Re: "I need to store DNA Sequences"??? by jerrytcow · · Score: 1

      human = 3 billion base pairs = 6 billion bits of data

      The way I figure it, you would need at least twice this. 3 billion pairs means 6 billion actual nucleotides, but since there are 4 possible nucleotides, so you would need at lease 2 bits to describe each one.

    15. Re: "I need to store DNA Sequences"??? by Rob+Simpson · · Score: 1
      Correct... while there are life forms that have sequences much larger than humans...

      http://gnn.tigr.org/articles/02_01/Sizing_genomes. shtml

      ...we haven't bothered with sequencing anything larger than our own genome.

  8. Need for memory/storage by morie · · Score: 5, Funny
    I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences"). slashdot

    Nobody should ever have need for more than 640 kB of RAM Bill Gates

    Simularities anyone?

    --
    Sig (appended to the end of comments I post, 54 chars)
    1. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      Nobody should ever have need for more than 640 kB of RAM Bill Gates

      Well, that WAS true until Bill Gates made it his personal mission in life to obsolete all the latest greatest technology within 1 year of it being released. A base Windows 9.0 installation alone will probably take 750 gigs and include nothing more than the libraries, Internet Explorer 10, Solitaire, Minesweeper, and Notepad 2006.

    2. Re:Need for memory/storage by Chelloveck · · Score: 3, Funny

      Personally, I won't be satisfied until I have enough storage to catalog the quantum state of every particle in the universe.

      --
      Chelloveck
      I give up on debugging. From now on, SIGSEGV is a feature.
    3. Re:Need for memory/storage by Jenming · · Score: 4, Funny

      i store my DNA sequence. I actualy have lots of copies incase i lose some or it gets corrupted.

      --
      Morpheus, God of Dreams.
    4. Re:Need for memory/storage by phurley · · Score: 1

      Of course you don't mean the quantum state of your media right?

      --
      Home Automation & Linux -- now I know I'm a geek
    5. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      And, I won't be satesfied till I have enough bandwith to update your catalog 30 times every second. :)

    6. Re:Need for memory/storage by Amazing+Quantum+Man · · Score: 2

      Are you certain about that?

      --
      Fascism starts when the efficiency of the government becomes more important than the rights of the people.
    7. Re:Need for memory/storage by VoiceOfRaisin · · Score: 1

      ya, funny.
      but bill gates never said such nonsense

    8. Re:Need for memory/storage by Rude+Turnip · · Score: 1

      I suppose you consider flushing the toilet a transfer to the offsite backup facility?

    9. Re:Need for memory/storage by jblake · · Score: 1

      I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences"). - slashdot
      Nobody should ever have need for more than 640 kB of RAM - Bill Gates

      Simularities anyone?


      Yes, they're both talking about memory needs, but there is a key difference...

      "we now have the ability to store anything we reasonably could need [right now]"
      vs.
      "Nobody should ever [in the future] have need for more than 640 kB of RAM"

      It's implied. Of course we understand that with larger available disk space people will make programs or rip DVDs or what have you that will fill it all up, but this terrabyte array isn't exactly going to be an everyday/everyuser product.

      --
      I just found a new sig.
    10. Re:Need for memory/storage by SmittyTheBold · · Score: 2

      I keep many on-site, but I prefer my off-site partial backups. I don't have a real schedule yet, but ideally I'd make a back-up at least daily.

      The backup medium isn't as reliable/error-proof as one might hope, but it's all I have for now...

      --
      ± 29 dB
    11. Re:Need for memory/storage by omnirealm · · Score: 3, Informative

      From a Huntsville Times (Alabama) interview with Bill Gates:

      QUESTION: "I read in a newspaper that in l981 you said '640K of memory should be enough for anybody.' What did you mean when you said this?"

      ANSWER: "I've said some stupid things and some wrong things, but not that. No one involved in computers would ever say that a certain amount of memory is enough for all time."

      --
      An unjust law is no law at all. - St. Augustine
    12. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      would it not be impossible to store the quantum state of every particle in the universe? to do that would require more particles than there are in the universe... I guess compression might be one way...

    13. Re:Need for memory/storage by faboo · · Score: 1

      One major difference could be thought of as timing.
      When Gates made his statement you could buy computers with 512k _soldered_ to the motherboard. To think that memory capacity would increase by only 25% in a reasonable period of time (say 5 years) is more than presumptuous.
      Currently, we are at a point where consumers are just being able to buy singular drives approaching 200GB. Now, as you can buy games nowa days that take up several gig of hard drive space, it's still presumptuous to state that you'll never need more than a terebyte of space, but not nearly so rediculous as Gates (factor of ~10 vs. ~1.25).

    14. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      Been done already.

    15. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      Wasn't there a Dr. Who episode about that?

    16. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      Including the ones in your storage?

    17. Re:Need for memory/storage by Anonymous Coward · · Score: 0

      Hmm, dunno, lemme check on the cat...

    18. Re:Need for memory/storage by morie · · Score: 1

      however, slashdot did

      --
      Sig (appended to the end of comments I post, 54 chars)
    19. Re:Need for memory/storage by jpostel · · Score: 1

      My wife and I decided to combine our DNA as an experiment. It took a little while to cook, but I think the results are good.

      http://members.bellatlantic.net/~dbbowser/baby/i nd ex.htm

      We'll probably try a few more combinations just to make sure the results are consistent.
      .

      --
      Ummm, Jon, aren't you supposed to be dead...? - Otter(3800)
    20. Re:Need for memory/storage by Alsee · · Score: 2

      Including the ones in your storage?

      Of course. The best part is that that data doesn't need error checking. It is always correct, even if it gets currupted.

      -

      --
      - - You can't take something off the Internet! That's like trying to take pee out of a swimming pool.
  9. Cost Per MB by JohnHegarty · · Score: 3, Informative

    1 Terabyte = 1024GB = 1048576 MB

    $ 5,000 /1048576 is a price of $0.0047 a mb.
    Or another was $4.88 for a GB.

    Now who remembers when harddisks where more than $10 a mb.

    1. Re:Cost Per MB by Tim+C · · Score: 2

      Not quite - most (if not all) hard drive manufacturers define megabyte as 1000*1000 bytes, gigabyte as 1000megabytes, etc.
      (See, for example, the note on http://maxtor.com/products/diamondmax/diamondmaxpl us/QuickSpecs/42093.htm stating that "1GB = 1 billion bytes")

      Therefore, in *this* case, 1TB = 1000GB = 1000000MB, which puts the price up a little (although not much, I'll admit :-) )

      Cheers,

      Tim

    2. Re:Cost Per MB by JumboMessiah · · Score: 2, Insightful

      Yup, except that to a hard drive manufacturer:

      1 Terabyte = 1000GB = 1000000 MB

      Their marketdroids have a bad habit of rounding the values down and evening them off. This allows them to post bigger numbers on the actual size of the drive since dividing by 1000000 instead of 1048576 yeilds a larger end result.

    3. Re:Cost Per MB by GigsVT · · Score: 1

      Well I know you might have just been oversimplifying for the benefit of the less informed, but they aren't really rounding.

      In the SI system, mega means million, kilo means thousand, not 1024 for kilo and 1024^2 for mega like computer geeks use it to mean. The hard disk manufacturers are just using the terms more correctly, because it is to their advantage.

      I'm sure almost everyone is familiar with Mebi and Tibi now, so I won't go into all that.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    4. Re:Cost Per MB by SWPadnos · · Score: 1

      $10 per MB - that's cheap.

      In high school (way back when the PC was relatively new), the computer club had to raise $800 to get an exernal hard drive to run our BBS - a 10MB hard drive (the Sider, for anyone who remembers).

      That's $80/MB, and we got a good deal.

      It's amazing what 20 years has done to the price of hardware.

      --
      - The Sigless Wonder
    5. Re:Cost Per MB by PowerGeek1 · · Score: 1

      I do... 40MB Seagate Hard Drive 5.25" Diameter Disks pretty cool. I still have a few around in one of my GHz machines. I put them in because they look cool with my window kit. All they use is one ISA card controller.

      --
      ~Dvorak GOTO1 GOTO GOTO :GOTO GOTO :GOTO GOTO1
    6. Re:Cost Per MB by Anonymous Coward · · Score: 0

      Heh. $10/MB is nothing. The very first consumer hard drive that I remember was the Apple Profile. It cost (retail) $5000 for a 5MB drive. That was $1000/MB back in 1982

    7. Re:Cost Per MB by Anonymous Coward · · Score: 0

      Back in my Mac days, I paid $450 for a 20MB extenal hard drive. At least it was SCSI.

    8. Re:Cost Per MB by slakdrgn · · Score: 1

      Well I guess thats one way to test legacy support :) I can see it in linux now.. give a user 40MB Quota in his user directory, just mount one of thoes puppys in there.. (just hope its a dialup user so he won't notice the speed) I've always thought thoes things were ugly tho.. but to each his own.. :)

    9. Re:Cost Per MB by Rupert · · Score: 2

      Shouldn't that be tebibyte?

      --

      --
      E_NOSIG
    10. Re:Cost Per MB by An+Onerous+Coward · · Score: 1

      Don't forget that the $5000 buys the whole computer, not just the eight hard drives. So the cost of storage alone is $2.93/Gig.

      Between this and dog food that makes its own gravy, we are truly living blessed lives.

      --

      You want the truthiness? You can't handle the truthiness!

    11. Re:Cost Per MB by Anonymous Coward · · Score: 0

      Mebi? Tibi? Screw that.

    12. Re:Cost Per MB by Anonymous Coward · · Score: 0

      HAH! US$10/MB? How about $1500 for a 10 MB drive? And in 1983 dollars!

    13. Re:Cost Per MB by athakur999 · · Score: 2

      I don't think it's marketdroids messing things up, it's the techies. SI defines "mega" to be 10^6. Somewhere along the line, techies decided "mega" meant 2^10.

      --
      "People that quote themselves in their signatures bother me" - athakur999
    14. Re:Cost Per MB by bughunter · · Score: 1

      I agree. Start talking mebibytes and tebibytes and you'll sound like Mushmouth on the old Fat Albert cartoons.

      --
      I can see the fnords!
  10. Nothing special. by Night0wl · · Score: 2, Interesting

    A terabyte isn't any thing special. But it's cool to see someone doing it. I was bored once one night. For a mere 36K you could, assuming you already own a Thunder K7 w/ the on-board SCSI pluss needed components, put together your self some really big storage. Using those 181GB Seagate SCSI drives.

    U160 and all of it churning at 10,000RPM. For a grand total of a few GB short of 5.5 Terabytes.

    But assuming you can affoard Thirty 1200$ drives you should be able to spring for a nice U160 SCSI RAID Card with an external connector ;p

    I couldn't even find a case with enough room for 30 hd's.... and I don't want to even think about cooling.

    But I wont have to worry about that. I can't even affoard a 9gb scsi drive at this point.

    --
    Computational Madness in a round package.
    1. Re:Nothing special. by Anonymous Coward · · Score: 1, Insightful

      But I wont have to worry about that. I can't even affoard a 9gb scsi drive at this point.

      And at this point, why even bother anymore? Performance isn't really that different anymore. A SCSI 73GB drive is almost $800 whereas you can pickup an 80GB UltraDMA 133 drive for $160. SCSI is dead. People would be crazy to pay those kinds of prices for SCSI over IDE anymore just because they think it is cool to have SCSI. Believe me it's not. It doesn't make a damned bit of difference. The IDE drives are just as fast as the SCSI drives, if not faster for average everyday use and cost 25% of the SCSI drive. Unless you're building a huge server array and have shitloads of money to throw away, just use IDE in your systems.

    2. Re:Nothing special. by GigsVT · · Score: 1

      Here is your case

      We used this case. You can put blocks in that allow you to install three 3.5 inch drives into two 5.25 slots, with a cooling fan, so this case holds something like 40 disks. It has two sets of two redundant power supplies available too, and room for two motherboards. It costs about $800.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    3. Re:Nothing special. by millwood · · Score: 2, Interesting
      HP virtual disk arrays

      Heard a rumor that they may be considering support for IDE in something like this.

      --

      "Hello, World", 17 errors, 31 warnings
    4. Re:Nothing special. by loraksus · · Score: 2

      Cooling? Bah, it's winter, put a fan and two box filters on either side of it and replace the furnace :)

      --
      1q2w3e4r5t6y7u8i9o0pqawsedrftgthyjukilo;p'azsxdcfv gbhnjmk,l.;/
    5. Re:Nothing special. by Aexia · · Score: 1

      IIRC, the Washington state legislature does something like this in the Capitol building. They take advantage of the heat generated by their servers and pipe it in to the rest of the building.

    6. Re:Nothing special. by Anonymous Coward · · Score: 0

      Since it is SCSI, you can have it in as many PC tower cases as you need. I would say 4-5 larget tower cases would have enough cooling & drive bays.

  11. 3Ware Escalade IDE-RAID by rdl · · Score: 5, Interesting

    I've been using these for a long time (6200 dual-port in hardware-mirror, up to the 8-port cards for large disk configs), and they're very fast and reliable. Cheap, too.

    $500 for an 8-port 64-bit RAID controller, looking to the host like a single scsi device per logical volume, seems like the best deal available. Along with a motherboard with sufficient slots for gig-e and these cards (easy to get 4 64-bit slots...maybe you can get more with 3-4 buses), and a 4U rackmount case with 16 drive bays, and you can have 4U of rackmount storage for $5k, too.

    I've been using setups like this for clients, as well as for private file storage (divx, mp3, backups, etc.), and know of people using them for USENET news servers (one of the most demanding unix apps for reasonably priced hardware).

    It goes without saying you want a journaled file system or softupdates when you have disks this size, and ideally keep them mounted read-only, and divided into smaller partitions, whenever possible. e2fsck on a 300GB partition with hundred of open files is painful.

    1. Re:3Ware Escalade IDE-RAID by mtfbwy · · Score: 1

      Yep, the 3Ware controllers are awesome and they now have a newer, faster version that rivals the most expensive controllers.

  12. How many genes do you have? by lhdentra · · Score: 1
    unless you define "Reasonable" as "I need to store DNA Sequences"


    Unless the Human Genome project re-invented the CD...

  13. Great! Where's the backup solution? by danimal · · Score: 5, Insightful
    I would rather spend the money on good disk storage with an integrated or integral back-up solution. Why? Well, as cool as all that storage it, what happens when it goes *poof* and you can't get it back. You're screwed.

    Yes, this is a groovy/geeky/cool solution for under your desk, but at least spend the extra dollars for a SCSI card and tape backup unit. You could fit the whole thing on a few DLT's. You can also keep incremental backups to keep the tape swapping to a minimum.

  14. It's easy. by Phosphor3k · · Score: 0

    Step 1) 1 x Promise 6 channel PCI ide raid controller, 99$US.

    Step 2) 12 x Maxtor 160gb ata133, 270$ each.

    Step 3) 1920gb of Pr0n and other goodies.

    1. Re:It's easy. by x1l · · Score: 0

      ummmm that would be raid 0, or jbod. Everyone know you shouldn't keep porn on raid 0

      with raid 0, you lose one disk and say goodbye to your p0rn.

      do raid 5 and you only have 1760 GB of usable porn storage space, as one disks worth of space is used for parity. even then, 12 ide disk in a raid 5 set is kinda iffy. I would expect you would see a failure in one of those drives. you get what you pay for.

  15. A much better article, also pointed to by /. by Thagg · · Score: 5, Funny

    Check out this article referenced by slashdot on July 20 2001.

    The nice thing about this article is that the people building it at SDSC really took extreme care in getting quality components that would work together to build a reliable, solid system, and still didn't spend more than $5K for a terabyte file server. In particular, the tradeoff of disk speed vs. power consumption was extremely insightful.

    I built one of these to their spec for my company, and I couldn't be happier. It's worked flawlessly since then. It's not clear if the Escalade boards are still available -- 3ware had said that they were discontinuing them, but they still appear to be for sale.

    thad

    --
    I love Mondays. On a Monday, anything is possible.
    1. Re:A much better article, also pointed to by /. by captaineo · · Score: 2

      Escalade originally planned to discontinue their IDE controllers, but due to public demand they decided to continue production...

    2. Re:A much better article, also pointed to by /. by felicity · · Score: 2, Informative
      It can actually be a bit cheaper:

      Promise FastTrak 100TX2 * 4 $500
      Maxtor DiamondMax 160GB Drive * 8 $3000
      Maxtor DiamondMax 20GB Drive $80

      You can get an Escalade 7850 for $550 or less, which is a single 64-bit card instead of the 4x Promise controllers. I don't know why there's a 20GB drive in there, maybe a boot drive? At $3k for 8 160GB drives, that's $375 each. Looking quickly at pricewatch, you can get the same Maxtor 160GB drives (5400RPM -- yuck!) for around $260 each. 8*160*(1000/1024) = 1250MB (actual MB) = 1.22 TB for a total of 550+8*260 = 2630 instead of 3580. Plus you have 3 PCI slots more than you had before.

    3. Re:A much better article, also pointed to by /. by jandrese · · Score: 2
      Be careful with those Promise controllers. Promise only supports 1 controller in a system. With the (not Fasttrak) 100TX2s, I can max the system out with only two contollers and 8 drives.

      BTW 5400 RPM has several side benefits for a design like this.
      1. You're likely saturating the PCI bus anyway, so anything faster is likely wasted
      2. 5400RPM drives draw less power (unless you're comparing them to 4800 RPM drives) than most other drives, alleviating strain on your power supply.
      3. 5400RPM drives generate less heat, and are easier to keep cool.
      4. Because they are running at lower speeds, lower performance drives tend to last longer than their equivelent faster drives (the 7200 or 10000RPM equivelent Maxtors), although this is highly dependant on the particular drive.

      Remember when you're planning on exceeding the design specifications of your system to account for all of the side effects, or you are likely to end up with fried power supplies and overheating prematurely dying drives.
      --

      I read the internet for the articles.
    4. Re:A much better article, also pointed to by /. by bigmammoth · · Score: 1

      (5400RPM -- yuck)
      I run 5400rpm drives they are quite cool and get sustained transfewer rates of 11.1megs a second, can transfer a full 650meg divx video in 55 seconds accross a 100mbs network. What more could you hope for exactly .. your not going to run gigabit switch, and your not going to run super raid cuz your not runing giagabit network anyway. and you can take thouse extra 3ms and think about world peace.

  16. It's just IDE Raid 5 folks.... Move along. by RobL3 · · Score: 2, Informative

    I hate to rain on everyones parade (I really do). But this is just a typical IDE raid 5 setup with bigger disks. Not exactly slashdot worthy IMHO. If you're thinking about doing somthing like this, Raid Level 5 is not a bad choice if you don't need redundancy. For more raid info check out:
    http://www.acnc.com/04_01_00.html

    1. Re:It's just IDE Raid 5 folks.... Move along. by sluggie · · Score: 2

      maybe it's your definition of redundancy, but if one drive fails in a RAID5 array nothing breaks. Isn't that some kind of redundancy?

    2. Re:It's just IDE Raid 5 folks.... Move along. by chill · · Score: 1

      >maybe it's your definition of redundancy, but if one drive fails in a RAID5 array nothing breaks. Isn't that some kind of redundancy?

      Exactly -- especially if you also include a hot spare in the array.

      --
      Learning HOW to think is more important than learning WHAT to think.
    3. Re:It's just IDE Raid 5 folks.... Move along. by Junta · · Score: 3, Informative

      What do you mean "if you don't need redundancy", the only RAID level that doesn't offer redundancy is RAID-0. RAID-5 can tolerate single disk failures, and if you do multiple levels of RAID-5 you can tolerate more failures (depending on how you configure it). The common configuration of RAID-5 with available hot-spares is quite sufficient in all but the most critical configurations, especially if it is a system that is closely monitored. Sure, you can build RAID-1 arrays of N drives where you can tolerate up to N-1 drive failures without problems, but for one space is used a lot less efficiently and for another write performance decreases for every extra level of redundancy you add, but that is overkill for most situations, the chances that multiple drives will fail simultaneously (or within a few hours of each other) is significantly remote compared to single drive failure probability.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    4. Re:It's just IDE Raid 5 folks.... Move along. by j.e.hahn · · Score: 1

      You're of course overlooking the other reason people in the high-end do RAID 0+1 (as it's called): performance. RAID5 relies on the storage of parity to disk. Computing that parity (whether in hardware or software) takes CPU. This typically makes RAID5 arrays slower for read and write than a mirror or a stripe. Now if you build a stripe of mirrored drives... You get serious capability. Writes are massively fast (writing across multiple spindles.) Reads are also faster as you have 2 spindles to pull from.

      We use this on our DB servers whenever performancs is critical. It's just amazing the throughput you can get.

      But,somehow, I don't see myself affording several Sun D1Ks for my home server.

    5. Re:It's just IDE Raid 5 folks.... Move along. by Junta · · Score: 2

      Not to nitpick, but the parity computations only impact writes, not reads, on reads RAID-5 is essentially a RAID-0. a RAID-1 of RAID-0 does provide stellar read performance, even compared with RAID-5, and, admittedly, the write performance would also be better (same amount of data being written, no computation or additional reads required, as is the case with RAID 5 where you have to both read and calcualte before the write can even begin). Still, on my 100-base network connection, the system could do a lot and I would never notice...

      --
      XML is like violence. If it doesn't solve the problem, use more.
    6. Re:It's just IDE Raid 5 folks.... Move along. by foobar104 · · Score: 2

      Not to nitpick, but the parity computations only impact writes, not reads, on reads RAID-5 is essentially a RAID-0.

      Depends on the RAID controller. Some-- although damn it because I can't remember their names-- do parity on the read and on the write for an extra level of error checking. The idea is to catch bad data before it gets to the OS.

    7. Re:It's just IDE Raid 5 folks.... Move along. by Stochi · · Score: 1

      one thing to mention though is the loss of space due to configuring RAID-0+1 vs RAID-5. let's say you have 8 drives (10GB each). with RAID-0+1 you get 40GB (4 drives for primary, 4 drives for mirror). with RAID-5, you would get somewhere around 70GB (all 8 drives configured for RAID-5, parity typically takes up about 1 drive).

      so, it basically comes down to: is the loss of space worth the increased speed? and that is a question that should be asked on a case-by-case basis.

      also, when you use hardware RAID controllers, the parity calculations are handled by the controller, so no CPU time is used. RAID-5 should still have slower access in theory... in practice, i don't think the difference is all *that* great.

    8. Re:It's just IDE Raid 5 folks.... Move along. by Anonymous Coward · · Score: 0

      My experience with RAID-5 is that software RAID-5 provides much better performance than lower-end RAID-5 controllers. Even a Sun StoreEdge seems pokey on writes compared to a PIII or Athlon server doing software RAID-5.

      Hopefully the $5K+ RAID solutions are faster, but I've personally been disappointed by many hardware RAID-5 solutions.
      If you want performance, go RAID 0+1. This is where cheap IDE disk shines - you can get a fast RAID 0+1 setup for less $ than the same amount of RAID-5 SCSI disk.

  17. Re:Another similar article by Anonymous Coward · · Score: 0

    Oh well... it'll get the Opera users, maybe.

  18. How to use the disk space by MosesJones · · Score: 4, Funny


    1) "Compress" at a higher rate than the CD uses (I've seen this)

    2) Use POV Ray to render Lord of the Rings for the cinema

    3) Keep every src and every .o from every build you do

    4) Set the Linux swap space to be "500Gb" because you've upgraded the Kernel to the new VM stuff and it looks cool

    5) Install Windows XP+ in two years time, with Office XP+.

    Imagine that "Minimum Reqs: 1TB of available disk space"

    It will happen

    --
    An Eye for an Eye will make the whole world blind - Gandhi
    1. Re:How to use the disk space by trezor · · Score: 1

      Sad and sadly needless to say... you're right. Gimme more space, and I'll find a way to use it. If not now, give me (and my friends at microsoft :) some time. If not humans, at least men tend to be turned on by size. Huge sizes!

      --
      Not Buzzword 2.0 compliant. Please speak english.
    2. Re:How to use the disk space by timerider · · Score: 1

      yea...

      data is in fact a so-called 'ideal gas' as it always tends to fill all available space.

      and there's still one valid point missing:
      how the f**k should one backup that stuff???

      bye
      [l]

    3. Re:How to use the disk space by olman · · Score: 1

      You think 1TB of HDD space is bad?

      Imagine installing that puppy from 20 DVDs..!

    4. Re:How to use the disk space by donglekey · · Score: 1

      1) "Compress" at a higher rate than the CD uses (I've seen this)

      That isn't compression, its interpolation between the samples.

      2) Use POV Ray to render Lord of the Rings for the cinema

      How would using a slow renderer take up more hard drive space?

    5. Re:How to use the disk space by MosesJones · · Score: 2


      1) It isn't compression its stupidity.

      2) Just because POV is slow doesn't stop a 3 hour movie at Cinema res using multiple terrabytes of disk space. Anyway Beowulf cluster POV for speed :-)

      --
      An Eye for an Eye will make the whole world blind - Gandhi
    6. Re:How to use the disk space by Anonymous Coward · · Score: 0

      Lets see.. WinXP: 1 CD. Average Linux distro: 3-7 CDs.

    7. Re:How to use the disk space by WNight · · Score: 2

      How much do you get paid to troll? Must be a good bit, or you must be really bored.

      I assume you've never installed Linux because then you'd know that of the 4-8 disks you get usually only two are asked for during install and even then the full install including apps and games comes to under a GB.

      Contrast that to Win2k. Sure, the OS is on one disk (and only takes 350MB or so) but Visual Studio is two disks, the MSDN libraries are on three disks (at least, probably more by now) and Office 2k is another disk (and takes about as much space as the OS.)

      When I've got a computer fully installed for use at work it takes about 4.5GB.

      This may be overkill for 99% of people, but then the extra six or seven disks of Linux are too. (And they contain similar stuff, extra utils, office suites, docs, source, etc.)

    8. Re:How to use the disk space by Anonymous Coward · · Score: 0

      Still too small to download every files off Morpheus. You'll need 400TB for that.

    9. Re:How to use the disk space by RogerWilco · · Score: 1

      2) Use POV Ray to render Lord of the Rings for the cinema

      How would using a slow renderer take up more hard drive space?

      Oh I can do that :)

      [4096x2048, AA 0.3]
      Width=4096
      Height=2048
      Antialias=On
      Antialias_Threshold=0.3
      Initial_Frame = 1
      Final_Frame = 540000
      (3Hrs@50fps)

      Or something like that, would use 13.5 TB.

      --
      RogerWilco the Adventurous Janitor
  19. Redundancy? by evilviper · · Score: 2, Interesting

    I'm sure some poor fool will do something like this, fill it up with data, then have ONE hard drive go bad, making everything practically useless.

    What we need isn't larger hard drive storage (not that it's a bad thing) we need more speed, and a cheap, gigantic & ultrafast tape backup system to backup all the data. Some PC designs that use better cooling methods would be very nice as well.

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    1. Re:Redundancy? by sluggie · · Score: 2

      Read the article. it sais "RAID5". Do you know what RAID5 is?

    2. Re:Redundancy? by GigsVT · · Score: 3, Insightful

      In case you didn't notice, it's RAID5. One hard disk could go bad with no issues other than slowdown.

      They could also do what we did with our IDE TB. We used three RAID5s in hardware, each with hot swap. In theory, if they failed just right, we could lose up to 6 drives without losing any data.

      The three RAID5s are hardware RAID0ed together. The worst case scenerio is a simultaneous failure of two drives on the same array. But we saved so much money using IDE that we just built two complete systems for less price than SCSI. So really, we would have to hit the worst case scenerio twice at nearly the same time to have a total loss.. It gets less and less likely.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    3. Re:Redundancy? by Anonymous Coward · · Score: 0

      So really, we would have to hit the worst case scenerio twice at nearly the same time to have a total loss..

      Or one flood.

    4. Re:Redundancy? by GigsVT · · Score: 1

      We are actually addressing that. We plan to eventually send one of the servers off-site to our other location in a different state.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    5. Re:Redundancy? by evilviper · · Score: 1

      Yes. My comment was not on the article, but on some user comments preceding mine.

      --
      Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant
    6. Re:Redundancy? by Anonymous Coward · · Score: 0

      how does this get modded to plus 2? this is one
      of the least informed comments i've seen on /. in
      ages.

    7. Re:Redundancy? by DrVxD · · Score: 1

      ...off-site to our other location in a different state

      Or, even better, open an office in another Solar System :)

      --
      Not everything that can be measured matters; Not everything that matters can be measured.
  20. Yeah, just like ... by Troed · · Score: 0, Redundant
    "640kb should be enough for anyone"

  21. Done before by IceFox · · Score: 2

    In fact I remember reading somewhere about a year ago on the linux terminal page about how they put a tb server together for right around 4K I can't find the link, but if someone does please post. But grabbing the third largest drive (100GB) out there will save you a bundle and you still only need 10.

    --
    Do you changes clothes while making the "chee-chee-cha-cha-choh" transformation sound?
    1. Re:Done before by Ioldanach · · Score: 1

      But grabbing the third largest drive (100GB) out there will save you a bundle and you still only need 10.

      Actually, it won't save you a bundle. The maxtor 160GB drive he's using is $262 ea, for a price of $1.64/GB (roughly). The cheapest 100GB drive is $193, for a price of $1.93/GB.

      As you can see, the price per gig of the maxtor is significantly less. 8 160 GB drives comes to $2096 and holds 1280 GB, which, at RAID-5 with one disk for redundancy is 1120 data, 160 spare. 11 100GB disks costs $2123 and holds 1000GB data, 100GB spare. Costs more, holds less.

    2. Re:Done before by Anonymous Coward · · Score: 0

      Also, if you are concidering power usage, the 10 100Gb drives will use alot more power.

      Those 160's will pay for themselves at that rate..

    3. Re:Done before by Danielle+Gatton · · Score: 1

      He was almost right.... sort of. If you drop down to the fourth smallest disk size, it's $114 for 80GB Maxtors, or $1.425/GB. That's substantially cheaper than the 160GB drives, but still not cheap enough to make up for the fact that you'll need another controller card (or a more expensive card with more ports), and probably a bigger, better ventilated case, and probably a better power supply.

  22. Re:Great! Where's the backup solution? by drsoran · · Score: 2, Insightful

    That reminds me, I don't know where the hell the tape manufacturers think they're marketing to, but with 80 GB hard drives common now, it's rare to find a tape backup solution that is affordable for a consumer that can handle that much. By affordable I mean drives around $250 and tapes under $10/piece for at least 50GB of storage. I've seen some of the proprietary drives but the tapes cost almost as much as the drive! 5 or 6 years ago the backup drives available to consumers could handle backing up the entire average hard drive of the time onto a $15 tape (Travan), but now people are probably just doing without backups which is a disaster waiting to happen.

  23. 64 bit address space by QuantumG · · Score: 2

    pfft, these days people are demanding a terabyte of RAM.

    --
    How we know is more important than what we know.
  24. Not really THAT surprising . . . by div_2n · · Score: 1

    It seems to me that having the ability to throw together a low cost TB storage array while novel, is not that big of a surprise. As most of you probably do, I remember a time when hard drives were a novelty themselves coming in sizes of like 10 MB.

    The natural progression of storage space seems to be one where the price is constantly dropping while size is constantly increasing. Is it really going to be all that long before you can buy TB sized storage devices in a single unit?

  25. Re:Great! Where's the backup solution? by Greyfox · · Score: 2

    How about another terabyte array and rdiff? While Joe Average User probably isn't going to be able to afford to do that, he's probably not going to be able to want to build the first one either. If you're a small to medium size company, it'd probably be worth considering. I think by the time you start talking this price tag, you'd be considering some of the mainfraime storage companies for DASD and backup though. IBM's 2105 "Shark" machine will go larger than 11TB now, IIRC, and I'm sure the other "big iron" shops have similar solutions.

    --

    I'm trying to teach myself to set people on fire with my mind... Is it hot in here?

  26. 1000 gigs???? by sweat · · Score: 1

    Am I the only one around struggling along with a 10 GB hard drive?? What could someone need 100 GB for, someone tell me.

    1. Re:1000 gigs???? by ichimunki · · Score: 1

      Audio and video. Sans compression if possible.

      Real life databases for actual businesses with more than two products and employees.

      Computer based animation comes to the amateur market.

      And these are just a few of the serious ideas I could come up with. I can imagine two silly ideas just to use the space for every serious idea.

      --
      I do not have a signature
    2. Re:1000 gigs???? by trezor · · Score: 1

      I got 50 gigs. I freak out if I ever drop below 10 (true). I like to have the ability to store uncompressed music, in many channels. I'm a musician, and I -never- ever compress any sound until mastering is done. Thats at least 250 megs pr. master. Still that's pretty small size. When you rip a DVD, it's fairly common to just decrypt the data and store it on your disc. Could be 5-6 GBs. Not that you need to, but it's practical. And if I want to borrow a DVD, and the guy is short on time, I got space to make a copy! How neat :) If you should choose to compress this further on, to DivX on a 2CD setup, you should at least account for all compressed data twice before being done. So one DVD rip could easily use 10 gigs. And ofcourse we're all lazy. "Ill encode it later". So all of a sudden you need 10 gigs more. And this is just work. You need entertainment. All those MP3s, MPGs and AVI files... Using a 100 gigs isn't really a man's job. It's a lazy man's job. If he were effective, he would'a found a way to use less :) Trezor

      --
      Not Buzzword 2.0 compliant. Please speak english.
    3. Re:1000 gigs???? by Carpathius · · Score: 1
      Last summer I bought the Archos MP3 player -- this month I upgraded it from 6G to 30G, and had to add a disk to my Windows system to store the additional MP3s.

      I found it amusing, though. My other system, the one I actually do work on, is a Linux system with a 2G and a 4G disk -- if I put the drive from the mp3 player into my Linux box, I double it's space.

      Not that I really need it, though -- I've got probably 3G free on that system.

      Sean.

    4. Re:1000 gigs???? by Anonymous Coward · · Score: 0

      My dual G4 800 has 100GB of storage in it. Filled it up last week. About to order 160GB Maxtor drive in external FireWire enclosure.

      Everyone needs space.

      blakespot

  27. 2TB for $8300 by GigsVT · · Score: 5, Interesting

    Inspired by Slashdot's earlier story that was nearly identical, and with the help of Peter Ashford from ACCS, we built two servers, both with capacities well over a TB, for around $8000 each. They have the capacity to expand to 3TB if need be.

    Story here

    As far as performance:
    (from my memory)
    EXT3: About 16MB/Sec block write, 45MB/sec block read
    ReiserFS: About 20MB/sec block write, 130MB/Sec block read (that's no typo).
    XFS: About 30MB/sec block write, 85MB/sec block read.

    It seems that file system plays a large role in performance. The arrays are three RAID5 in hardware using Linux software RAID0 on top of the RAID5 arrays to tie them together.

    IDE RAID controllers are 3ware Escalade 7810. Write performance can be greatly increased by using 7850 cards that have more cache.

    We stuck with XFS, Reiserfs had a bigfile bug, files created over 2GB would lock up the computer basically. XFS in general seemed much more mature, reiserfs seems more like someone's college thesis project, that they never cleaned up to be production grade.

    We experimented with different RAID0 stripe sizes, the hardware RAID5 stripe size is fixed at 64k, there are 7 active disks in each array and one hot spare. Stripe size tweaking seemed to mostly trade off read for write speed, within a certain range of values, with a taper off in performance at either extreme, (down around 8k stripes, or over 1024k stripes)

    We eventually went with 1024k stripes. That is what the benchmarks above reflect. The variance in file system performance could very well be due to interactions with stripe size, but there seemed to be common themes (reiser always read fastest no matter what stripe, XFS was always better at writes)

    I have been in so many arguments with SCSI zealots on here over this RAID... I wish people would understand what price/performance ratio means. IDE isn't a superior technology, but every now and then, it is the right tool for the job, when price is a goal too.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  28. Uhm, redundancy in posting? by VWswing · · Score: 3, Interesting

    Is this any more special than the last time
    slashdot announced an amazing terabyte arrayHere

    Seriously though.. People's numbers are pretty far off. This can be done for about 3000.. Pricewatch
    has 160 gig drives for $259 .. 10 of these would give you over 1 terabyte in useable space in raid 1.. Or if you just cared about write performance, 6 of them for $1554 would give you a terabyte of useable storage.. another $600 to throw together a cheap pc and cheap ide raid cards.. you get it for under $2500.. big deal.

    Lately I'm realizing how awful IDE really is.. I finally got around to throwing 2 36 gig ultra 160 drives on my box with an adaptec scsi card, running ext3 on top of a raid mirror.. more space than I need (I just keep all my mp3s on an IDE raid.. since my dragon motherboard has ide raid built in).. Since I've gone to scsi life has been happy. I can do things while compiling, while vacuuming my db, etc..

    Funny how mac used scsi before the rest of us, huh?

    --
    "And how can this be? For he is the ..."
    1. Re:Uhm, redundancy in posting? by GigsVT · · Score: 1

      $600 will not buy you a system to support a TB RAID.

      I've actually designed one of these, and it's a little more involved than you might think.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
    2. Re:Uhm, redundancy in posting? by Junta · · Score: 2

      And now Mac uses IDE... I use IDE too, and it is pretty clearcut to me that while SCSI drives offer better performance, IDE drive offer better bang for the buck. Recent IDE drives are nearly indistinguishable (to me) from SCSI drives in terms of desktop application performance. Now when you are building a raid array with, say, a high-load database on top of it, then yes, SCSI will be worth it. On the other hand, if it is a single user workstation, I don't really see much of a reason to go to SCSI. In fact, I'm currently building a network file server with raid with no more than 3 or 4 users over a 100-MBit connection concurrently, and I see no reason to use SCSI raid over software RAID on IDE, since SCSI increases the cost dramatically and offers little perceivable benefit...

      And as far as this story is concerned, the array I plan to build happens to cost about 1100 dollars and provides 480 gigs of storage (160 more is redundancy, so if redundancy wasn't an issue, it would be 640 gigs, and as we all know, "640 Gigs ought to be enough for anyone"), so a $5k terabyte seems a bit steep when you think about it.

      --
      XML is like violence. If it doesn't solve the problem, use more.
    3. Re:Uhm, redundancy in posting? by VWswing · · Score: 1

      If anything I see more reason to use SCSI on
      a workstation than on a small fileserver..

      Ram is soooooooo cheap right now I can't see why you wouldn't just buy 4 gigs of ram, and cache all the most commonly accessed files in memory.. hell, buy a cheap used xeon box for a grand, and spend another 1600 buying 16 gigs of pc100 memory...

      scsi on a personal workstation.. Here are some of the daily activities that slow me down where scsi would solve the problem :

      Compilations
      Finding files/directory tree searches
      rsyncing my home directory onto another machine

      those three things warranted to me the extra cost of scsi.. now I can compile a kernel while doing all my normal activities..

      --
      "And how can this be? For he is the ..."
  29. Looks great in Opera, but sucks in IE5.5 by Anonymous Coward · · Score: 0

    Can users of other browsers give me some feedback here please???

    Ta...

    1. Re:Looks great in Opera, but sucks in IE5.5 by Anonymous Coward · · Score: 0

      Yes, from a Galeon user:
      Please leave this site and never come back.

      Thank you!

    2. Re:Looks great in Opera, but sucks in IE5.5 by Anonymous Coward · · Score: 0
      I've been here for over two years now, my UID is in the mid 150k's which seemed horribly high at the time, but is now fairly low compared to the supposed half a million registered users.


      Recent events have tarnished my view of slashdot, I am no longer eligible to metamoderate as a result of those events, with no explanation from the so-called editors. Therefore I've decided to fuck about until I get bored and leave for good.


      Can I take it, from your reply, that Galleon stops underlining the text at the word "here."??

      /me chalks up another browser... Great! Thanks for your help

    3. Re:Looks great in Opera, but sucks in IE5.5 by Anonymous Coward · · Score: 0

      Just tried IE6.0, Opera 6.0 and Mozilla 0.9.6

      Seems equally effective in all of them.

    4. Re:Looks great in Opera, but sucks in IE5.5 by Anonymous Coward · · Score: 0

      Haha! No, sorry. It doesn't.

      But: It would still be great if you leave and never come back. Nobody needs you here, nobody wants you here. You are just wasting hard disc space that could better used with "first post" or "imagine a beowulf cluster" messages.

      Thank you for your attention!

      PS: Remember that suicide is an option for you as well! There's somewhere a HowTo about shooting yourself in the head, so please stop looking at goatse.cx and search for it.

  30. YKYHBRSFTLW by Graabein · · Score: 2, Funny
    You know you have been reading /. for too long when...

    The first thing that runs through your mind when you see the above headline is: "Wow, imagine a Beowolf cluster..."

    Argh.

    --
    And remember kids: Never trust a computer you can actually lift.
    1. Re:YKYHBRSFTLW by j1mmy · · Score: 1

      the beowolf cluster jokes got old. now the jokes about beowlf cluster jokes are getting old. please, stop the hurting.

  31. 4 Controllers for $500? by sluggie · · Score: 2

    Why not snap in a Promise SX6000 for like $250?

    This neat piece DOES hardware RAID5, so you don't need a fast cpu&mobo, less RAM, and since it can only manage up to 6 drives you can even have 2 as pseudo hot spare...

    The only drawback is the ability of "only" storing 800GB which is nice at this even cheaper price...

    1. Re:4 Controllers for $500? by GigsVT · · Score: 1

      It's i960 based, so write speeds are going to be a lot slower than software RAID5.

      Also, you can't really put more than one SX6000 in a single system, that sort of hampers scalibility.

      Still, if you wanted 1TB, it might be a good option, so long as you don't care too much about write speeds, or you use it in JBOD mode with software RAID.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  32. Re:Great! Where's the backup solution? by GigsVT · · Score: 1

    We're using rsync over ssh... These IDE TB's were so cheap, we just built two for redundancy. Every night the second one backs up the first one.

    We still have a tape robot, but we will only be backing up the most critical of data, our tape robot is only 1.2TB and cost many times what the TB RAIDs did.

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  33. Why use expensive online storage? by JoeShmoe · · Score: 3, Interesting

    Aren't these types of systems more for archiving massive amounts of data than actively working on it? I mean, how much data can a computer actively process anyway? Wouldn't a 100GB drive meet just about any processing demands (genome tracking, video editing, etc)?

    Why not use slower but MUCH cheaper offline storage? I really like the design goal of

    http://www.dvdchanger.com/

    You can easily get 1TB of storage with such a device for less than $1000. True, only one person can access it at a time but that is only because PowerFile wants to charge more for so-called "networked version".

    In theory, if someone could figure out how to build on of these things, you could throw in a two or three CD/DVD drives for accessing and a 20GB hard drive to buffer images. Boom. Now you have the perfect storage backbone for a house-wide media center. I just wish Linksys or someone would throw a linux thinserver onto of the PowerFile hardware and get me something cheap and network-ready.

    - JoeShmoe

    .

    --
    -- I wonder which will go down in history as the bigger failure: the War on Drugs or the War on Filesharing
    1. Re:Why use expensive online storage? by tsangc · · Score: 2, Interesting
      Wouldn't a 100GB drive meet just about any processing demands (genome tracking, video editing, etc)?

      No. Using a very low video data rate (ie, DV25), you're looking at 3.5MByte/sec. That's only eight hours of video. No one captures only what they're going to use, since that's the whole point of editing--you take all your material, capture some of it, and cut that selection down to a final product. So if you're making a 1 hour production, you might very well have 10-20 hours of footage if not more. Of course, you'll selectively digitize with a batch capture deck, but still...

      And of course, editors might have multiple projects running simultaneously. And most secondary media devices are too slow to restore material to your primary disks.

      DV25 is only suitable for consumers, event videography (weddings etc) and industrial work. Go to DV50 and that datastream is doubled-DV50 (ie, DVCPRO50, DigitalS) is minimum level for broadcast work.

      Now, raise that for HD editing requirements. Sony's HDCAM, a highly compressed solution, runs at 140MBit/sec, uncompressed HDTV is above 996MBit/sec (and that's downsampled and cropped).

      So no, 100GByte is really only scratching the surface of what video editing requires.

      Calum

    2. Re:Why use expensive online storage? by Anonymous Coward · · Score: 0

      Why keep it online? Because it's convenient! And it's cheaper to buy another hard drive than to invest in a serious tape backup system. And burning DVD's is slow.

      The reason I have a 4 x 60 GB Raid 5 system (180 GB of on-line redundant storage) is I have 850 CD's ripped to MP3 and OGG. That's currently 68 GB, and my collection keeps growing at an average rate of about 10 CD's a month - mostly secondhand.

      I don't user napster or morpheus or anything other file-swapping service, I rip it all myself.

      Plus I keep ISO's of the last Mandrake release, a whole lot of other RPMs and software, about 15 different linux kernel builds, a few gigs of high-resolution photo scans stored in a lossless format, .... and lots of other junk.

      It currently adds up to 100 GB, and I like having it all live on my local network. I back it up by mirroring it to a single 120 GB drive on a different computer, which I keep at work. This has the advantage of giving me access to ALL my files both at home and work, and redundancy even if I am robbed or my apartment burns down.

      The only reason I ever delete anything is just to stay organized.

  34. Slashdot by pbrinich · · Score: 1

    unless you define "Reasonable" as "I need to store DNA Sequences"


    Gotta Love slashdot Humor!
  35. DVD Video Jukebox by IsThisNickTaken · · Score: 1

    Sony makes a few. Here are a couple of links. http://www.sel.sony.com/SEL/corpcomm/news/consumer /625.html http://63.224.30.17/NASApp/HE/browser/dvd.jsp?GXHC _gx_session_id_=GXLiteSessionID--26601574686454386 53&

    1. Re:DVD Video Jukebox by Dark+Legend · · Score: 1

      Thanx for the links.. cool.. now all I need is a home entertainment network to connect it to and distribute across the house!

    2. Re:DVD Video Jukebox by Anonymous Coward · · Score: 0

      I have one of the 300+ 1 disc dvd players. Do not spend your money on this junk. I have had to have mine in the shop for more then three months.
      The stupid thing keeps trying to jam two Dvd's into its self at the same time. If you realy want one try the kenwood 400 +3 disc.
      When they work they are great. Take one of these and a vidio modulator so you can insert the dvd'd output on a unsued cable channel and you have a instant hole house vidio DVD jukebox.

    3. Re:DVD Video Jukebox by Anonymous Coward · · Score: 0


      I too have one of the Sony DVD jukes. Mine's a 200.

      I've had it a little over a month and thus far (knock on wood) I have not yet experienced any hardware difficulties.

      But I will say this : the software that you use to interface the juke is an unmittigated piece of shit!

      Sony does great hardware (for the most part). But what they fail at miserably and consistantly is software.

      I truly haven't seen an interface since this badly done since DOS days ! It is just so damned klunky, it seems like it was hacked together by a first year programming student. Way to go, Sony!!!

      Also, may I say that the icons that they use to represent the various genre are just utterly insane!

      I dunno, maybe these are cultural differences vis a vis east/west. But when I think of the category, say...oh I don't know. how about drama? For example : when I think of drama, what usually comes to mind as representative? How about the masks 0f Comedy and tragedy. Instead what you get is a blocky image of a face crying that seriously looks like it was drawn by a derranged five year old.

      And that's just one example! Believe me there are many more, and I could justifiably sit here and rag on it all day.

      The interface is so poorly done it is almost impossible to believe that this is a shipping product by a major company, and not a high school science fair project!

  36. Re:Great! Where's the backup solution? by swestcott · · Score: 1

    I think whith the price of this unit you could easy get two of them and miror each other

  37. Re:Great! Where's the backup solution? by Lumpy · · Score: 2

    If you're going to spend $4K on a DLT drive, spend $8K on a DLT tape library that holds 10 DLT's plus 1 cleaning tape and forget about it. Sure it's only 700Gig of backup but you can always compress.. Otherwise upgrade to a 20 DLT tape library box and call it done.

    --
    Do not look at laser with remaining good eye.
  38. Inquiring minds by enigma48 · · Score: 1

    Hi all, quick question for the brighter bunch.

    Last time I looked at IDE in any technical depth, I only saw four addresses "reserved" for IDE controller use. I guess you can have any address, but the BIOS couldn't boot off any address, it has to know where to look for the controller. Predetermined list of 4 seems to ring a bell.

    Secondly, IDE seems to REALLY hit the breaks when you do two independant operations on two drives on the same channel (say, a read on drive 1 and writer on drive 2). If my 4 controller addresses educated guess is right, and performance does crawl, you'd probably want to have 4 drives on 4 controllers, one each.

    If all the above is correct, this guy is plain wrong. He's published, I'm not, I'm willing to admit defeat - where am I wrong? Do the raid controllers emulate being scsi hosts, run off OS drivers (=likely windows ones), etc?

    Thanks - inquiring minds.. yadda yadda.

    1. Re:Inquiring minds by Tazzy531 · · Score: 2

      The requirements specify 4 PCI RAID controllers. Each of these could potentially handle 4 hard drives. I'm assuming that he's only putting 2 on each so that it doesn't come across the problem of accessing 2 drives on the same channel. In addition to this, there are 2 more on the motherboard, that I guess he isn't using. Secondly, these cards are bootable. So any one of them can be set to boot from and you can boot from any drive. But I don't think he is doing that because he has an additional 20 gig drive that I'm assuming is going on the motherboard. That is where the OS is going to be installed.

      Go here for the datasheet

      --


      _______________________________
      "I'm not Conceited...I'm just a realist..."
    2. Re:Inquiring minds by Ioldanach · · Score: 1

      But why is he using expensive raid cards when he's not using the raid functionality of the card, and instead using software raid? Wouldn't it be cheaper to use the cards you can get with the big maxtor drives for an extra $13 ea?

    3. Re:Inquiring minds by edmudama · · Score: 3, Informative

      > Last time I looked at IDE in any technical
      > depth, I only saw four addresses "reserved" for
      > IDE controller use. I guess you can have any
      > address, but the BIOS couldn't boot off any
      > address, it has to know where to look for the
      > controller. Predetermined list of 4 seems to
      > ring a bell.

      There are 4 addresses, but you can only boot off the first 2 in most operating systems. There are ways to get more than 4 up and running to expand to lots of drives, but not sure what OSs it works with.

      > Secondly, IDE seems to REALLY hit the breaks
      > when you do two independant operations on two
      > drives on the same channel (say, a read on
      > drive 1 and writer on drive 2).

      The issue is that most ATA implementation don't support command queueing, therefore there is no bus release. Each command finishes to completion until the bus is released, while the other drive sits idle. Upcoming drives will be implementing queueing and won't have this performance limitation.

      > If my 4 controller addresses educated guess is
      > right, and performance does crawl, you'd
      > probably want to have 4 drives on 4
      > controllers, one each.

      The secondary port isn't inherently slower than the primary port. However, each port uses a controller address. (0x178 or something for the first, can't remember offhand)

      Best performance is achieved with one drive per cable.

      > If all the above is correct, this guy is plain
      > wrong. He's published, I'm not, I'm willing to
      > admit defeat - where am I wrong? Do the raid
      > controllers emulate being scsi hosts, run off
      > OS drivers (=likely windows ones), etc?

      Everything except ATA hard drives are emulated as SCSI hosts. ATAPI (the CDROM protocol) is simply a packet scsi over an ATA cable. The raid controllers also just use the built-in scsi layer in the OS.

      eric

      http://www.t13.org for the real ATA specs if you're curious

      --
      More data, damnit!
    4. Re:Inquiring minds by ryanwright · · Score: 2

      Do the raid controllers emulate being scsi hosts, run off OS drivers (=likely windows ones), etc?

      Yes. As far as the OS is concerned, each raid controller is one big giant SCSI disk. There are no master/slave disks, each controller has 8 ports for 8 drives - again, only one drive per channel, no master/slave.

      The IDE RAID controller is the thing that makes this work. It takes care of all the issues you mentioned (drive limitations, booting, speed issues, etc). But since you can only have 8 drives on a single controller, you put in multiple controllers. With 3 controllers, you can get 24 drives. At 120GB a pop, that's 2880GB. You'll lose some of that to RAID but you're still looking at close to 2TB. Then you do a software raid 0 on the 3 drives (as far as the OS is concerned, you have three huge scsi disks) and you can create one giant partition with very acceptable performance.

      --
      -Ryan, with the unoriginal sig
  39. Re:Nothing special - bullsh*t by chill · · Score: 2

    1. SCSI can handle between 15-30 devices on one good controller, *not* including support for multi-disk changers via LUNs. Most IDE can handle 2-4.

    2. SCSI drives don't turn into slugs when you access more than one at a time. IDE does. Want to see it REALLY screw up, access an ATAPI CD-ROM slave the same time as a HD Master on the same controller.

    --
    Learning HOW to think is more important than learning WHAT to think.
  40. Why not firewire? by weave · · Score: 3, Insightful
    Maxtor now has a 160 gig external firewire drive. You can chain 62 of these puppies." Screw terabyte, think petabyte.

    I figure this is the easiest way to add as you grow without having to break open the case and try to figure out how to add another damn drive in there. For backup, just have two systems with identical capacities and rsync between the two nightly.

    RAID is nice, but for home use, it's not as nice as a nightly mirror. Why? I've seen RAID controllers fail and take out an entire RAID set. RAID also doesn't deal with the "Holy shit, I just accidently type `rm * ~` instead of `rm *~` problem."

    1. Re:Why not firewire? by Ixohoxi · · Score: 2, Informative
      160 GB * 62 = 9920 GB = approx 9.9 TB
      9.9 TB = approx 0.01 PetaByte

      Don't hold your breath thinking about petabytes.

      Also, RAID isn't for people who make stupid mistakes. Sorry about your 'rm' debacle.

      --
      What's a second? An hour? A day?
      It has much more to do with
      the Earth's rotation than with cesium.
    2. Re:Why not firewire? by j1mmy · · Score: 1

      Also, RAID isn't for people who make stupid mistakes. Sorry about your 'rm' debacle

      97% of all stupid mistakes are made in the home.

    3. Re:Why not firewire? by Anonymous Coward · · Score: 0

      Answer1: Those firewire drives are really just IDE hooked to firewire controller.

      2: With their setup, you would probably have greate aggergate bandwith. Fire wire: 400Mb/s (bits) and, we are talking about potentially reading and writing at about 60MB/s (a bit over 400Mb/s eh?), but to really use it as a NAS, you would have to have either gigabit enet, or a 5 or so 100Mb enet ports to that server, and a nice managed switch to hook it all to.

      So, in this case, the real bottleneck is the network to which this is going to be hooked to.

      Also, for thoe 60 drives and software RAID, you're going to have a helluva lot of CPU and memory. Lets start with a quad Xeon for that sucker, probably 4GB memory too.

    4. Re:Why not firewire? by weave · · Score: 2
      Don't hold your breath thinking about petabytes.

      Thanks for correcting my stupid math error. I have no idea why that warrants a Flamebait rating. :-(

      Also, RAID isn't for people who make stupid mistakes. Sorry about your 'rm' debacle.

      I know. Bingo. That's why one still has to have a decent backup system in any environment. Users (even administrators) make stupid mistakes.

    5. Re:Why not firewire? by weave · · Score: 2
      2: With their setup, you would probably have greate aggergate bandwith. Fire wire: 400Mb/s (bits) and, we are talking about potentially reading and writing at about 60MB/s (a bit over 400Mb/s eh?), but to really use it as a NAS, you would have to have either gigabit enet, or a 5 or so 100Mb enet ports to that server, and a nice managed switch to hook it all to.

      This idea is for a home, not corporate environment. You'd only have a handful of clients, and probably never more than one reading from the mess at once. Also, it's mainly for a digital data store for movies and mp3s and pr0n. That data is only accessed mainly one at a time. Can it despool fast enough to drive your vid player?

      As for backup, I was talking about rsync, not software mirroring. Adding data to this jungle would not be done at high volumes. Move a DVD or two during the day and download your 3GB newsgroup limit and you're talking about 10 gigs to go to the primary unit and then 10 gigs over to the backup unit overnight or something.

  41. Sorry, I just couldn't resist... by jonr · · Score: 2, Funny

    1 Promise 6 channel PCI ide raid controller, 99$US.
    12 Maxtor 160gb ata133, 270$ each
    1920gb of Pr0n and other goodies, priceless!

  42. Linux raid reference by jgercken · · Score: 1

    A little dated but still contains usefull insight.
    Linux IDE-RAID Notes

    --
    Never ascribe to malice what can be adequately attributed to ignorance. -Napoleon
  43. Uses for a terabyte by mikera · · Score: 1

    I can think of plenty of innovations that might start using up your terabyte rather fast.

    One interesting idea is a transactional file system which logs all file operations. Great advantage of this would be that you could "roll back" to previous versions of files if you needed them. Couple that with some redundant error-correcting storage and your days of losing files will be a thing of the past.

    Another great use is for indexes. Indexes generally speed up computing operations, at the cost of significantly increasing storage needs. Chuck some more dish space, and you could effectively have a Google on your own hard drive.... no more time consuming searches, just enter "1999 tax avoidance scheme" and get your hits in seconds.

    p.s. I guess it should be "indices" for all the grammar nazis out there, but I actually think "indexes" works better here, as it is a technical term.....

    1. Re:Uses for a terabyte by terrabit · · Score: 1

      You could run Netware on your file server, which allows you to do this.

  44. Re:Great! Where's the backup solution? by Paul+Johnson · · Score: 5, Insightful
    Absolutely. And to those who say "Just build another one" / "RAID doesn't need backup", I have only one thing to say:

    FIRE!

    Any serious data store needs to include a backup system which allows for copies off-site. Fire is the obvious risk of course, but floods, vandalism and lightning strikes are all possibilities.

    AFAIK the only generally available tape backup for something this big is DLT, which IIRC can now do around 40GB per tape before compression. With the 2:1 compression usually quoted thats 80GB per tape, or around 13-14 tapes for a full backup. So you really need about 30 tapes for a double cycle, and maybe more if lots of the data is non-compressible (like movies). But this stuff ain't cheap. DLT drives start at around £1000 and the tapes cost £55 each. So thats around £2500 = $4200 to back this beastie up.

    Having said that, the possibility of using hot-swappable IDE drives as backup devices is intriguing. Just point your backup program at /dev/hdx3 or whatever. One big advantage is that if your tape drive gets cooked in the server-room fire you don't have the risk of tapes that can only be read on the drive that wrote them. A Seagate 5400RPM 60GB drive costs £110, which is only a third more per megabyte than a bare DLT tape. Two cycles-worth of backup (34 drives) would be £3,700. And you can probably do better by shopping around. For servers with only a few hundred GB on line this might well be more cost-effective than buying a DLT drive.

    We use Amanda to do backups here. Its a useful program, but it can't back up a partition bigger than a tape. So you need to think carefully about your partition strategy. (Side note: you can use tar rather than dump to break up over-large partitions, but its still a pain).

    Suddenly that terabyte starts looking a bit more expensive.

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
  45. Re:Great! Where's the backup solution? by ViceClown · · Score: 2

    Amen to that. I was looking for a backup solution for my 60 gig server a few weeks ago. Know what the most cost effective solution turned out to be??? Another damn harddrive!

    --
    Have a Happy.
  46. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0

    As already stated you could mirror the whole system with another array, but also how much do you actually need to backup. If you are putting your DVD collection on it then you already have permanent backups in the form of the original DVDs.

  47. Oops! by Paul+Johnson · · Score: 4, Interesting
    Sorry, I just noticed a thinko in the discussion of IDE drive costs. The DLT costing assumed 2:1 compression. The disk cost didn't. Assuming compression we can squash 120GB onto a 60GB drive, requiring only 9 drives for a full backup, and 20 drives overall (a couple of spares is always a good idea). Thats £2200 for IDE backup, which is actually cheaper than the DLT solution.

    Does anyone out there actually use IDE drives like this? It seems a pretty obvious thing to do.

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
    1. Re:Oops! by cuyler · · Score: 3, Informative

      If your serious about backing up that much data you could also use a 9840 drive which holds 20gb uncompressed and (they say) 80 gb compressed however in my experience you can get 140gb onto a tape. Also, it'll write faster (when backing pu a terabyte having the backup take 32 hours is not a good idea). The 9840B drives write at up to 50gb/hour but usually run closer to 30-35gb/hour. While DLT drives usually write at about 5gb/hour.

      I haven't tested it out but StorageTek has a drive called the 9940 which has tapes that hold 60gb uncompressed (likely 200+gb compressed), it writes faster (10mb/sec ~= 55gb/hour). Also, the drive itself will put you out $33.5k with the tapes being a couple hundred a piece.

      In this case, it'd probably be better just to have a second 1tb raid - then again tapes are much more stable.

      -Cuyler

    2. Re:Oops! by Anonymous Coward · · Score: 0

      We're working on it right now. Instead of upgrading our 50-tape DLT7000 changer (~2 TB), we're going to get 5-7 2TB IDE servers and use those for daily backups. Offsite backups will still use the DLT drives, but normal daily ("I deleted this file!" or "my hard drive died!") backups will go straight to disk.

      We aren't too worried about fire and so forth because the backup server is in a different building then 95% of the data that it's backing up. We'll still ship data offsite into a tape vault every couple weeks, plus we'll probably use some of the money that we'll save (have you priced ~10TB tape libraries?) on adding a small changer in an isolated office for "live" offsite backup of critical data. So, basically going from DLT as our primary backup medium to IDE RAID5 *shouldn't* cost us anything in terms of real backup safety, and it'll be a huge boost in speed, access time, and affordability.

    3. Re:Oops! by Confessed+Geek · · Score: 1

      I'm also looking at doing backups to IDE RAID rather than tape (keeping backup server off site) but I've had trouble finding Good GPL software that has options for using HD's as your storage medium. Can you suggest anything?

      Thanks!

  48. Tape is better for backup... by morzel · · Score: 3, Informative
    ... for the simple reason that the mechanism (eg: DLT-drive) and datacarrier (eg: Tape) are separated. IDE disks have both in one sealed package, which makes it terribly difficult to get to your data if your stepper motor borks.
    With tapes, you just get a new drive.

    --
    Okay... I'll do the stupid things first, then you shy people follow.
    [Zappa]
    1. Re:Tape is better for backup... by Anonymous Coward · · Score: 0

      > IDE disks have both in one sealed package, which makes it terribly difficult to get to your data if your stepper motor borks.

      How much does it cost to get that stepper motor replaced (or do other such repairs so that the drive is mechanically fixed & you can use it again)?..

    2. Re:Tape is better for backup... by realdpk · · Score: 2

      I've heard horror stories about tape heads being differently aligned, causing any tapes written to with older drives useless. But then again, I've also seen IDE drives go bad in an array such as one of these (this one is 16x100GB, adequately cooled).

      Given that, I prefer using a two-layered backup approach - disk-based as the front-end, for high speed backups and restores, and then back the disk-based solution on to tape. Btw, when doing this, I don't think compression will count at all since the backups are already compressed. Is that right?

    3. Re:Tape is better for backup... by Anonymous Coward · · Score: 0

      Fixing is generally not possible. Data recovery can cost $2k-$3k for a drive with no guarantee of anything useful coming back.

    4. Re:Tape is better for backup... by jpostel · · Score: 2

      On the compression question:

      Some (all new) DLT drives use optional hardware compression. This is important because regardless of how much compression is achieved on disk or through software, the DLT drive might compress it more.

      The compression on hard drives is done in software since it would be done by the backup program or the OS.

      --
      Ummm, Jon, aren't you supposed to be dead...? - Otter(3800)
  49. DNA Sequence by zmokhtar · · Score: 2, Informative

    FYI, the DNA sequence isn't that big. The National Human Genome Research Institute has their 90% complete draft burned on a single CD.

    --
    Why aren't we told when editors moderate our posts?
    1. Re:DNA Sequence by Anonymous Coward · · Score: 0

      That is not the DNA sequences that are important to have stored. If you really want to work on a genome project you have to store the source tree, not the fasta consensus sequence. A fasta sequence on CD has been stripped of all the information that makes the sequence usable to people assembling the genomes.

      Similar to software where a binary provides functionality for most people, but if you want to work on the program you need the source.

    2. Re:DNA Sequence by Anonymous Coward · · Score: 0

      Well, you need that if you're _assembling_ the sequence, sure. But if you just need the consensus sequence to analyze in conjunction with, oh, say, several hundred arrays of time-indexed gene expression data and some transcription factor binding data, then a terabyte is overkill. You'd be good to go with just a few dozen gigs.

  50. Re:Great! Where's the backup solution? by phloon · · Score: 1

    "Absolutely. And to those who say "Just build another one" / "RAID doesn't need backup", I have only one thing to say:
    FIRE!"

    Oh sure.... your house is burning down and you're there thinking "Oh no... my P0RN! I should've gone co-lo!"

  51. What about MTBF? by FirstNoel · · Score: 1

    The more drives you have the more likely the failure?

    Sean D.

    --
    "Hmm. I am to metaphor cheese as metaphor cheese is to transitive verb crackers!"
    1. Re:What about MTBF? by Anonymous Coward · · Score: 0

      no, stupid, the more INDEPENDENT systems that need to fail at the SAME time to result in failure-q you have, the less likely that you result in failure-q. (Although, of course, you're twice as likely to result in failure of one or the other over given period x, which is what you grabbed on to. This simple failure isn't bad.)

      i'm not the orig. poster.

  52. Re:Great! Where's the backup solution? by jd142 · · Score: 2

    True. But one thing I haven't seen yet is the fact that most backups aren't full backups. You do a full backup maybe one a month or once a year. Every other backup is a diff only. So while the initial backup may take several tapes, the nightly backups shouldn't. At least on the type of system where the data is basically the same from day to day, which was the point of the article.

    Plus, as described in the article, where the point was to have a singe hard drive based storage for dvd's and cd's, if there was a drive failure, you could just take the original media and do the rip again. Annoying yes, but doable. You haven't lost data unless the fire burned down your house and melted the cd's at the same time it took out your storage. That's why companies buy fire safes and use off-site storage.

  53. $5k is too much, how about half that cost by rzbx · · Score: 0

    You can do it for a lot cheaper than $5k. About 8 hard drives at 160 GB will only cost about $2,100; that makes a little over a TB of storage. Four drives can be put into one computer, so you need two basic computers, no need for sound, video, periphials, or any other extras. They will only be for file storage so you can easily get those two computers for less then $300 a piece. Just hook up the video so that you can install the needed software. So the price of 1 TB storage should only cost about
    $2,700 wow, almost twice as less. If you have more to spare add another four drives and one more computer and you have almost 2 TB for only about $4,140

    --
    Question everything.
    1. Re:$5k is too much, how about half that cost by MjDascombe · · Score: 1

      twice as less?

  54. We need this and more by mintoman · · Score: 1

    Picture this; your home PC's no long have disk drives. In fact, any device in your house that has a need for data storage (like your VCR/DVD/CD/Game console/Toaster, etc...) has no local storage.

    They all connect to this mammoth central storage unit stuck in a closet or down in your basement using either a wireless device or some sort of networking that is built into every home.

    You get on the 'net using your PC console and order a video/cd/game and it miracously shows up on your home storage device ready for use by whatever device is best suited to using it (i.e. use your stereo to listen to music, not your computer and use your game console to play games, not your computer). You use this "product" as if you own (maybe rent it?).

    The point being you will need massive storage to pull this off - and I do believe this scenario will play out.

  55. thousand hours of video? by peter303 · · Score: 3, Interesting

    Video is the most bulky storage people would save. How much would people want to save for re-viewing? First you have the time-shifting stuff like TiVo/Replay- perhaps a few tens of hours at most. Then you would be your favorite movies and TV series. As video-phone improves you might be saving some hours of friends and relatives video conversations. With infinite storage, the constraint becomes need and time to view all that stuff. And you'll probably be wanting to spend your time looking at new stuff. So I'd guess most people's real needs would be hundreds to a thousand hours. At 1-2 BG per hour, your talking about a terabyte or two.

    I don't include the argument that you'd have trouble finding old stuff. Computer software is more clever at organizing things - far better than material storage. A good recent example of this is Apple's "iPhoto" that much more convenient for organizing thousands of photos than physical albums.

    1. Re:thousand hours of video? by gorilla · · Score: 2

      Depends a lot on what your 'favourite series' is. There are several which have been on for 30 years or more, and amassed thousands of hours of footage or more.

    2. Re:thousand hours of video? by Anonymous Coward · · Score: 0

      I suppose if you REALLY wanted to use up space, you could create a Tivo/Replay-like system that records a multitude of channels... maybe, say 200! also, storing HDTV movies will require much more than said 1-2GB per hour... there will always be a need for more storage space.

  56. Re:Great! Where's the backup solution? by smagoun · · Score: 1

    To back up the parent, fire DOES happen. Once upon a time I worked at a small software company in Waltham. I got a phone call one Sat. morning:

    "Hey, did you hear about the fire?"
    "No, what fire?"
    "The office building burned down last night"

    It was a shock, to say the least. Sure enough, a squatter had been evicted from the building, so he got revenge by torching the place. We were one of about a dozen businesses - including the local newspaper and an Armed Forces recruiting center - that was in the building. Almost nothing survived.

    We had tape backups of all our servers. Guess what - tapes are made of plastic. Plastic melts. We found a few tapes after the fire was put out, but they're not so useful after they've been torched.

    Like it or not, fire happens.

  57. Marvelous... by Anonymous Coward · · Score: 0
    I was going to wait until I got home to check IE6, but I'm so happy, I'm going to upgrade now.

    Thanks mate.

  58. i just wish by mrsbrisby · · Score: 1

    ...this kind of thing was easier. i mean it's great and all: using cheap (as in cost) components, some intellegent construction teams, etc, but in alot of places (even towns that are SMALLER) this is still impossible.

    where I live, I can't run new cables everywhere. I can't even purchase the right to do this, and even if I can get the zoning permits I STILL can't do this.
    [and not can as in a question of ability, but can as in the whiny i-don-t-want-to-face-the-consequences]

    this is because we have a town charter that specifically keeps us technologically backwards (we didn't start getting streetlights until there was the closest this town has seen to a riot since the civil war) - our own telco office continue to use the same wiring that was installed in the 1950's.

    Our local cable company was only allowed to "service" existing cable- and they've been VERY VERY Slowly "servicing it" with fibre. I'm 58 kilofeet from the CO (about 12 more from the cable company) and I still can't get a cable modem [i wouldn't want to - but more on that another time]

    an interesting project that happened recently: called network maryland was bid-won to Level 3 networking (gloriously known as crap thanks to the business practices of most of their clients) - they ran fibre through most of maryland and stopped less than 30 miles from my town -- and have no intention of continuing (btw: i'm still quite a ways from the shoreline).

    i can't tap into that because it's their bandwidth, and i can't purchase it from them because they're not selling (and don't have to - it's already sold to the government) - which as you'll remember won't let me run my own wires because of stupid charters.

    now if i wanted to run cross-lada cables (outside of town boundaries) i COULD do that- but it wouldn't do me any good because i couldn't bring a line INTO town.

    so i'm stuck with a maximum bandwidth purchase of "only a T1 at a time" (and our little cove will run out soon -- there isn't even a full DS3 running into this entire area) from telco, or I can purchase some of the cable company (who actually has less bandwidth than my company) -
    or
    or

    or nothing. we've tried to convince our town "hall" of doing something like this (gigabit ethernet), and we've tried offering to pay for it.

    i suppose we could move, but that's a lot of hassle too -- and we'd be giving up our local business.

    or we could get some kind of FEDERAL responsibility -- get the whole fucking nation up.
    [at this point: you should realize that i, like many of my peers like to pretend that only the US matters... infact: i actually hate it here, but that should only be so apparent]

    so what do we do? how can we sell "this" to our town? how do we get our buerocratic slugs to take something like this.

    more importantly, how was this sold in NZ? and how was it sold in other places it was used?

    1. Re:i just wish by Anonymous Coward · · Score: 0

      > where I live, I can't run new cables everywhere. I can't even purchase the right to do this, and even if I can get the zoning permits I STILL can't do this.

      What does this OFFTOPIC post have to do with large storage arrays?..

    2. Re:i just wish by mrsbrisby · · Score: 1

      it appears to have been posted to the wrong topic.

      it should've gone to the NZ ethernet.

  59. Home File Server Appliance by rlp · · Score: 2

    This would be great for a home file server. Many new homes are being built pre-wired with CAT5 (alas not my old house). Just add a big file server in the basement. With proper wiring, it can act as an answering machine / PBX, personal video recorder, music (MP3) repository, mail server, file server, etc. With RAID, you have less worries about a drive crash wiping you out (though you'll need a disaster recovery plan - flooded basements would be real bad). I've always wanted to do this! Main stumbling block is getting CAT5 wiring from the second floor (where my computers reside) to the basement.

    --
    [Insert pithy quote here]
    1. Re:Home File Server Appliance by zitsky · · Score: 2, Interesting

      I had a similar problem when I bought a house last year. I had a converted garage that I wired for ethernet, and even ran ethernet into the basement. However, I didn't want to install ethernet jacks in the house, as it's about 100 years old, and I didn't want the hassle.

      I settled on using 802.11b wireless to communicate between the house and the office. I know all about the security problems (my address is....) but maybe the newer 802.11g or 802.11a might work for you.

      I have some workbenches in the basement that are about 4-5 feet off the floor. I'm going to install a file server and leave it on one of these benches.

      It's cold and damp down there in the winter. I don't know how well the equipment will take to the humidity. I guess I'll find out!

    2. Re:Home File Server Appliance by Anonymous Coward · · Score: 0

      We run a printer RIP server in a room that is humidified to about 60-70% humidity. Thing is, its' also very warm in there (80 Degreees average), and we have not had any failure. The equipment seems to like it just fine. Infact, there is alot less buildup of dust and crap there, then there is in the rest of the buidling.

      If it freezes in your garage, you may have a problem (dew settles at low temperatures, you know.)

      I'd highly advise you to use real ethernet though, you're going to hate that 10Mbit (that usually operates at about 2-5Mbit). Use a strong coated cable (coated metal cloths lines work nicely for this) to string the ethernet between your house and the garage. Tie the cat5 to the cable, and drill into the house. No need to use panels in the house. Iv'e done this with my place (150! years old!), and networked 3 of my neighbors houses. If you still feel the need for 802.11 (which is nice enough for terminal work, not for huge file transfers).

      Also, use IPsec.

  60. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0


    I once heard a rumor that sony created a tape backup system using Beta tapes that held 100+ terabytes per tape.

    This was in the late 80's early 90's and at the time was just to silly a number so the technology was abondoned.

    drawback: read times

  61. Use "MPEG": people differ by 0.1% by peter303 · · Score: 2, Interesting

    At the most people's genomes differ by 0.1% from each other - much less than that if you are relatives. Therefore you'd record the differences, sort of like several of mpeg algorithms.

    1. Re:Use "MPEG": people differ by 0.1% by Anonymous Coward · · Score: 0

      Yeah and if you maximize the lossy compression you should be able to fit it on a floppy.

  62. Ouch! $160GB disks! by jandrese · · Score: 5, Insightful

    Ironically, I just built something very similar to this a few weeks ago (it runs great BTW), but I spent <$1500US on all the components. The biggest thing you have to watch out for is the Hard Drives. I went for the ones with the best bang/buck ratio at the time (Maxtor 80GB 5400RPM drives). This let me build a system with well over 1/2 a Terabyte of usable space at a fraction of the cost. Additionally, the slower drives require less power and less cooling, making them easier to fit in a standard full tower case with a merely beefy (as opposed to server-class) power supply. I think the processor requirements he stated were a little overboard as well. I've found that disk access tends to be limited by the PCI bus (it doesn't help that I used an older motherboard with 33 Mhz 32bit PCI), especially on writes where you can spread data across the write cache on the drives. Be careful when you build an array like this, ATA *hates* having access to both a master and a slave drive at the same time. Be sure to avoid having two disks on the same plex on the same controller. This was natural for me fortunatly, since I was building two plexes, a "backup" and a "media" plex.

    A final word of warning: Promise ATA100 TX2 controllers may look like a natural choice for a server like this, but they only support UDMA on up to 8 drives at once, and Promise's tech support only supports a maximum of 1 (one!) of their cards in any system.

    --

    I read the internet for the articles.
  63. Mirror Usenet? by Matey-O · · Score: 5, Funny
    Maybe Mirror Usenet!
    Well, exclude the binaries and I can mirror USENET on my Palm III!
    --
    "Draco dormiens nunquam titillandus."
    1. Re:Mirror Usenet? by Cramer · · Score: 4, Informative

      A full USENET news feed (everything one can find) will exceed 120GB per day. It'll almost fill a DS3. (And we were receiving a "crappy" test feed from UUNet.) So, minus @alt.binaries.*, one could mirror USENET for a few years. With the binaries, it'll hold you for about a week, 2 at the most.

    2. Re:Mirror Usenet? by Matey-O · · Score: 2

      Well, yes. That was me using a little Artistic License. a) A Palm III has _2_kb RAM, and b) it subtly inferred that 'aside from pr0n and the Simpsons, there isn't much happening on USENET.'

      --
      "Draco dormiens nunquam titillandus."
    3. Re:Mirror Usenet? by Anonymous Coward · · Score: 0

      Actually, a full feed is more like 300GB/day. Although volume has decreased slightly since DrinkOrDie got busted. :)

    4. Re:Mirror Usenet? by blurzero · · Score: 1

      A Palm III has 2MB of RAM, not 2KB.

      --

      The secret of success is sincerity. Once you can fake that, you've got it made.
    5. Re:Mirror Usenet? by Matey-O · · Score: 2

      Sorry. The Newton had 2 kb of ram....or was it 16kb? I'm getting too old to remember machines that had too little memory. :)

      --
      "Draco dormiens nunquam titillandus."
  64. Video killed the MP3 star!! by jabber01 · · Score: 1

    I doubt that video will ever become as commonplace as music, for the simple reason that I can listen to music while doing something else, on the computer or in the room/house/proximity.

    With video, I sorta have to pay attention to the moving pictures, and that keeps me from getting other things done.

    --

    The REAL jabber has the user id: 13196
    What you do today will cost you a day of your life

    1. Re:Video killed the MP3 star!! by spudnic · · Score: 1

      I believe the point here is to stream the video to your entertainment system. That's what I have set up. It's very convenient to have everything in one place and not have to worry about it. Using a remote controler connected to the serial port, I can choose from any media on my system and have it piped into my stereo/television.

      "Think outside the box" ;) I know it sounds cheesy, but it's true.

      .

      --
      load "linux",8,1
  65. 3ware is Painful! by purduephotog · · Score: 2

    Actually, I assembled a 600 gig storage device using the afore mentioned 3ware controller.

    First, there were hardware bugs and they recalled the controller

    Second, 3ware dropped the product line, but vendors were still telling me it was available.

    Third, they brought it back, and I had to get a drop ship

    I lost about 3 months on design phase due to this little tidbit.

    Now don't get me wrong, it's working now and seems reliable... but... there's always this nagging suspicion that something is going to go wrong and I'll lose all that data.

    1. Re:3ware is Painful! by GigsVT · · Score: 1

      I had the same experience, with hardware incompatibilities and recalls. I think 3ware has it worked out now though, and I am happy with the end result, even though the project was severly delayed due to 3ware.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  66. Re:Great! Where's the backup solution? by imuffin · · Score: 1

    Is there some way to stripe actual systems? Rather than having four or five drives striped in a single system, could I have four or five systems each with one drive, but mounted as one physical volume? How could that be configured?

    While this would be a little more expensive, it would be much more fault tolerant. Nothing short of the switch breaking or two systems breaking simultaneously could bring down the system.

    I would be afraid to put a system like the one described in the article in use for fear that a power supply or RAM chip would go bad, and all of my data would be inaccessable until I could replace it.

  67. DNA? by dciman · · Score: 1

    Actually DNA sequence files are not all the large. At least not what most scientists actualyl deal with on a regular basis. Most of the time you are just dealing with fairly small 500-900 base pair seqments, since that is what you can get reliable sequence data back on. Of course sometimes it is needed to prob this sequence against others findings, via some search engine such as Blast! or the like. If you are talking about whole genome sequences, most still are not that large. Considering that most the genomes sequenced are of bacteria and archea, this isn't that hard to see. E. Coli for instance has a genome size of roughly 4.6 million bases and this is fairly average for bacteria, at least within the same order of magnitude. Currently there are 74 completed genome sequences listed on NCBI's Genome page. So, while this is a substantial amount of biological data, it doesn't amount to that many megabytes of data with respect to a Terabyte storage system. At least not today.... with more Eukaryotic organism sequences being complete the size will of course jump dramatically.

  68. Re:Need for memory/storage [OT] by teslatug · · Score: 0

    Can ANYONE actually find this quote??? I have never seen any actual evidence of Billy saying this. Gates has denied this, and say what you want about his business practices, but he has always been a smart guy. Surely he would never think that 640K would always suffice.

    Bashing Gates/Microsoft = +5 funny
    Acknowledging the truth = -5 troll

    sounds about right

  69. But it can't fit inside a lego brick!!!!!!!!! by heroine · · Score: 2

    Get rid of it.

  70. cost... by simpl3x · · Score: 1

    everybody likes to complain about how expensive macs are, so apple decided to skip the scsi. my dell laptop has gone through two hard drives and a battery in two years. rarely does a piece of apple hardware fail to perform decently, and powerbook batteries never seem to die (within reason of course). 2c.

  71. RAID ain't "secure" by -brazil- · · Score: 2, Informative
    First, you mean "safe", not "secure". Second,
    extend that to safe-as-long-as-only-one-hd-fails-and-you-never-ev er-make-a-mistake. RAID only gives you high availability, but it is by no means a substitute for backups: when important data is deleted by a virus, or accidentally because of a user mistake, even a RAID 1 will just dutifully mirror the deletion.


    Always remember: data that is not backed up might as well not be there in the first place!

    --

    The illegal we do immediately. The unconstitutional takes a little longer.
    --Henry Kissinger

  72. basement? by simpl3x · · Score: 1

    i prefer them in the rack in the living room.

    1. Re:basement? by jo42 · · Score: 1

      ...with lots of blinky lights!

  73. Drive Failure Notification Problem by jayrtfm · · Score: 2, Interesting

    With Raid5 a single drive can fail without causing dataloss.

    How do you know WHEN a drive has failed?

    With the low end IDE RAID cards your notification comes when the 2nd drive fails......

    3Ware's website describes a SNMP monitoring utility for windows, but didn't specifically mention Linux support. Ditto for Adaptec.

    If the raid is done in software, is there a linux program to monitor and notify when a single drive goes down?

    1. Re:Drive Failure Notification Problem by 4Runner · · Score: 1

      I just wrote a script that emails me if it doesn't see all U in /proc/mdstat md0 : active raid5 hdk[4] hdi[3] hdm[2] hdg[1] hde[0] 300272640 blocks level 5, 256k chunk, algorithm 2 [5/5] [UUUUU]

    2. Re:Drive Failure Notification Problem by sydney094 · · Score: 1

      There is a utility that runs under Linux. It creates a web based monitor that lets you see how each drive is doing individually. It will also email you with every error message, should you desire.

      Its a pretty slick setup, IMO.

      --
      "If we knew what we were doing, it wouldn't be called research." - Einstein
    3. Re:Drive Failure Notification Problem by Anonymous Coward · · Score: 0

      Don't happen to know where I could get this particular app?

  74. Asus A7B266-D Motherboard Specs?? by debiansierra · · Score: 1

    I was curious if there were onboard video on the mother board or AGP or if he was going headless. So STFW for"Asus A7B266-D Motherboard Specs" and narrow further and further realising that "A7B266" is not out there. I head over to Asus's MB Section where I see that that model# seems to look correct but can't find a match. I'm assuming it has integrated n-force. Just thought it was a little odd to not be able to find this board, I'm sure if I searched harder I could find it at something like Pricewatch, but I wonder why it's not featured at Asus's site?

    --
    I would like some milk from the milkman's wife's tits
  75. Re:Great! Where's the backup solution? by Cramer · · Score: 1

    Two things... "off site" and "fire proof safe". Even Interpath wasn't stupid enough to keep all the data in one place. There were three sets of tapes. One set (two months old) were off site (at someone's house in a fire box.) One set (last month's full backups) were in the fire proof safe in the basement. And the current backup set was at my desk in two fire boxes. When the alarm goes off, you grab those two boxes. (At the time, someone was always there.)

  76. A Bit of a tweek by mattyohe · · Score: 1

    instead of the 4 TX100's go with Promises new ATA100 6 channel IDE controller SuperTrak SX6000 which will chop about 300 bucks off your final price..

    --
    - what is the definition of simultanagnosia?! I've been meaning to look it up!
  77. 3ware and external raid devices by FreeUser · · Score: 3, Informative

    please tell me how you get 6 IDE drives on a pc that gives you any performance in a rad function...

    I don't know how he does it, but I have personal experience in doing it two different ways:

    1) 3ware IDE RAID controller, has 1 IDE controller per drive on the card (i.e. 8 ide controllers), which the firmware maps to a RAID Device. Depending on the RAID configuration the drives appear as one large SCSI drive to the system.
    Performance is on par with SCSI.

    2) External IDE-SCSI Raid chassis. Again, 1 IDE controller per hot-swap drive, appearing to the system as one or more big SCSI drives, controlled by a standard SCSI controller. Speed and reliability have surpassed that of a $60,000 SCSI solution sold by Sun I happen to have lying around.

    U160 SCSI drives will give you at least a 70% speed increase and a 80% increase in reliability....

    If I had to store a terebyte of information I'd be an idiot to use consumer level storage (IDE).


    Nonsense, see above. This is simply SCSI bigotry (I know, I was once a SCSI bigot too). What you say is only true if you are using low end cards, with more than one device on each IDE bus, which is untrue for mid- and high-level IDE-SCSI solutions such as 3ware and various external chassis systems. We run our entire enterprise on one, and have done so for well over a year, with much better reliablity and performance than an older, very expensive SCSI solution provided.

    But yes, if people are plugging drives into el cheapo IDE "raid" cards like Promise and the like, or worse, into their onboard IDE controllers (most of which are inexpensive knockoffs anyway) then performance will be very suboptimal, and reliability problems (one device taking down the entire IDE bus, etc.) abound.

    --
    The Future of Human Evolution: Autonomy
  78. Re:Need for memory/storage [OT] by thing12 · · Score: 2, Informative

    It seems to be just an urban legend.

  79. Re:Great! Where's the backup solution? by Jack+Auf · · Score: 1

    AFAIK the only generally available tape backup for something this big is DLT, which IIRC can now do around 40GB per tape before compression.

    Um...AIT. As fast or faster than DLT, same storage volume per media unit as DLT, media is cheaper than DLT, because the media is smaller than DLT libraries/autoloaders tend to be less expensive than their DLT counterparts.

    --
    "They that can give up essential liberty to obtain a little temporary safety deserve neither liberty nor safety" - BF
  80. Nice but a 3ware controller perform better. by Anonymous Coward · · Score: 0

    I think a good hardwar controller, like the 3ware
    escalade will perform better than the 4 promise
    cards in the box. You probably don't need such a
    high powered CPU then. Not that 1 GHz is much
    anymore.

  81. Re:Great! Where's the backup solution? by timbck2 · · Score: 1

    How's this for a twisted scenario -- we built a terabyte server (actually a 0.8 terabyte server) to use as "backup" for CDROMs we send to clients! It's data that becomes useless over time, so it's discarded 6 months to a year in the future.

    --
    Absurdity: A statement or belief manifestly inconsistent with one's own opinion. -- Ambrose Bierce
  82. Simpsons reference by Anonymous Coward · · Score: 0

    As seen on the Simpsons:

    Take the integral of 3d(r^2) (where d is a constant)
    The answer is d(r^3) or rdrr ...get it (ha)rd(ha)r(ha)r

  83. Very simple. by Karoshi · · Score: 1

    Store more sequences.

    60 million sequences for a nice datamining job, and you need a really BIG raid.

    --
    Don't answer me. Moderate. Slashdot is about moderation, not discussion.
  84. Re:Great! Where's the backup solution? by battjt · · Score: 1

    I use rsync between two large filesystems across a 4 block 802.11b link.

    There is still a possibility of an intruder deleting both, but highly unlikely because one of the backups is a laptop that doesn't share resources.

    [I still need a versioning filesystem, like VMS though.]

    If both building are consumed in an explosion, then data recovery will be the least of my worries.

    Joe

    --
    Joe Batt Solid Design
  85. So where is the news? by maunleon · · Score: 1

    Let me see... they are telling us that if we put together enough off-the-shelf hard drives with off-the-shelf raid controller, we get a terrabyte. Maybe it's a math question? I don't see the "innovation."

    I guess it has a "wow" factor, but as far as innovation or news? Nah. If you can show me how I can put together a 1TB raid10 array of 10k SCSI drives for under $5k, now that's interesting!

    Where would you use it? In a business environment? Maybe for file storage, because the IDE drives are useless for database or other heavy work. In the home? If you have a separate room just for the setup, otherwise the hard drive noise and especially the cooling fans will drive you crazy. Anybody who's been around a real drive array knows what types of fans it takes to cool down an array, and how loud they are!

  86. A terabyte is nice, but... by Anonymous Coward · · Score: 0

    ...what's the performance like? Please, don't compare it to PC-class performance; compare it to real terabyte-level storage systems. How many hosts (of how many types) can you connect to it concurrently? What about instant snapshots, transparent replication, LUN masking, path management? Does it do background memory/disk scrubbing to catch errors before they affect anything outside the box? Can it handle online component upgrades? What kind of service and support comes with it?

    There's nothing wrong with a cheap terabyte. Just don't think it's really enough to put EMC/IBM/Hitachi/etc. out of business.

  87. An alternate ultrafast solution -- 1.28TB @ $6000 by gerry,Hacker+wannabe · · Score: 1

    How about the following config? 1.28TB @ $6000

    Asus A7M266-D Dual Athlon MB ($.3k)
    AMD MP *2 ($.4k)
    2GB RAM ($.8k)
    Adaptec 29160LP U160 SCSI card ($.3k)
    Promise UltraTrak TX8 ($1.7k)
    8* Maxtor Diamondmax 160GB drive ($0.3 each)
    Gigabit Ethernet Network card ($0.1K)
    Others (.5k)
    Backup (.3k)

    1.28 TB with hotswap etc with respectable performance.

  88. They overpay by AviN · · Score: 1

    8 160GB drives (1120GB of storage) - $1900

    Motherboard with Duron 800MHz CPU and onboard 100BaseTX ethernet - $100

    2 512MB SDRAM - $150

    2 Promise FastTrak/66 IDE RAID controllers $110

    Full tower ATX case - $110

    Total price: $2370

    (Prices are from stores on www.pricewatch.com, rounded up to include shipping.)

  89. Cheaper and more redundant if you use 2 .5 tb serv by HiyaPower · · Score: 2

    You can stuff 8 60 gb disks into an antec server case. With a pair of 1600 XP processors, the total cost is 2 promise cards = $50, 8 drives = $720,
    2 xp processors = $220, mobo = $220, memory = $200,
    case = $150, total is about $1500 for .5 tb and $3000 for the full tb. Further, you have a bit more
    i/o bandwidth with 6 ide controllers, and 2 pci busses than with the single. Also when one of them craps out, the other is still going in all probability. Going to 80 mb drives gives you about the same cost per gb of drive space and lets you put .6 tb into a case. When you are paying for floor space and cooling, the 160 gb drives make sense, but when you are tunning these in your basement, going for two boxes makes it a cheaper and more robust solution.

  90. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0

    So you have a backup tape. If everthing was torched in a fire where are you going to load that tape to???

    I really hate when people jump all over the backup to tape and not really look at offsite locations to load the tape information. This is true backup for businesses and guess what it is not cheap.

  91. Re:Great! Where's the backup solution? by AviN · · Score: 1

    Also, RAID isn't fault tolerant to human or software error (rm -rf /).

  92. Even cheaper by Restil · · Score: 2

    With 120 gig drives, your total cost for a 1 TB array would be about $2500. With 4 IDE ports and a large enough case, you could get all that into one box, then network the beastie.

    Now I just need to find $2500. I know I won't have a problem filling it.

    -Restil

    --
    Play with my webcams and lights here
  93. Re:Great! Where's the backup solution? by Nugget · · Score: 2

    For those that choose to go the "fire proof box" route, please be careful that you buy a unit that's certified to protect media. A fireproof box that will protect papers from catching fire isn't necessarily sufficient to keep tapes and disks from being destroyed by the heat. Make sure you buy one that's appropriate for your intended contents.

  94. Re:Need for memory/storage [OT] by Anonymous Coward · · Score: 0

    Easy way to turn facts into falsehoods: just put whatever you don't want people believing on an urban legends site.

  95. gawd by kin_korn_karn · · Score: 4, Funny

    using a tb array for anime is like having one of your turds bronzed.

  96. Re:Need for memory/storage [OT] by Anonymous Coward · · Score: 0

    Stop whining!

  97. There's a reason they called it "Terraserver" by jefp · · Score: 3, Interesting

    I've wanted a terabyte of storage since the mid-1970s, when I realized that there were approximately a trillion square meters on the Earth's surface. Store one byte of grayscale image for each square meter and that's a terabyte of data right there.

    Of course these days I'd want 3TB so I could store color images.

    1. Re:There's a reason they called it "Terraserver" by chriss · · Score: 1

      You realize that inbetween the mid-1970s and today image compression happened and the oceans compress very well?

      Chriss

  98. Re:Great! Where's the backup solution? by jandrese · · Score: 2
    You've just stumbled across one of the main concepts behind the Storage Area Network. The biggest problem you have is bandwidth. Your average local disk bus (ATA100, or LVD SCSI3) blows away Fast Ethernet, and with RAID3 or RAID5 you need to access multiple machines to do a single write (write the actual data and write the parity data).

    The other problems with your scheme are:
    1. Cost, That's a lot of machines to buy for a single storage array
    2. Admin time. Upgrading the OS for a bug fix is a much bigger pain when you have 8 machines instead of 1
    3. Space. You're building a rack for these machines and they're eating up at least 8U of space, probably more if you want to keep the cost down to Earth
    4. Software. All SAN solutions I know about are proprietary. Nobody builds RAID code that runs over a network.

    It's not a bad idea, but certainly not something that can be done for $5k. I'd think there must be a breakpoint somewhere where it makes sense to build stuff in multiple machines (instead of cramming tons of disks into a single machine), but I think it's not at 1 disk/machine.

    How much uptime you need is purely dependant on you. Since my array is for personal use, I don't mind a bit of downtime when a component fails (since I'm working on the problem myself anyway, it's not like I'd get much use out of it when it was partially down anyway!). If you really really need multi-9 uptime, $5k IDE storage solutions really aren't the way to go.
    --

    I read the internet for the articles.
  99. Re:Need for memory/storage [OT] by thing12 · · Score: 1

    But can you find any source citing him saying that? I've believed it to be true for a long time too, but there's no proof of it anywhere. I want to believe it, but you can't say it's a fact without proof to back it up.

  100. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0

    Linear Tape Open tapes do 100GB uncompressed right now, and the roadmap has them going up to 1.6 TB by generation 4. SuperDLT is also out there.

  101. Re:Great! Where's the backup solution? by ellem · · Score: 1

    Two words...
    9
    11

    gotta CoLo a terrabyte

    --
    This .sig is fake but accurate.
  102. A disk array is not a data warehouse by monkey99 · · Score: 1

    Not trying to be a database snob, but anyone can build a piece of hardware. I takes a lot of diligent work and planning to implement a data warehose

    1. Re:A disk array is not a data warehouse by Anonymous Coward · · Score: 0

      data warehose

      Dude! I love that one. She sucks off all the guys in the FBI's Data Warehouse and at the end does herself with a USB Hub and nail gun...

    2. Re:A disk array is not a data warehouse by Anonymous Coward · · Score: 0

      I think you meant data whorehouse.

  103. Moore's Law and bioinformatics by alispguru · · Score: 2

    Remember, most of the breathless prose about the huge, enormous, gigantic, [favorite-bigness-adjective] amount of information in DNA was written years ago, by biologists. Moore's law has been in effect for some time since then, and the human genome hasn't gotten any bigger in the meantime.

    --

    To a Lisp hacker, XML is S-expressions in drag.
    1. Re:Moore's Law and bioinformatics by Marcus+Brody · · Score: 3, Funny

      Moore's law has been in effect for some time since then, and the human genome hasn't gotten any bigger in the meantime.

      In fact, the EMBL database (all known DNA + protein sequences) nearly tripled in size within the 11 months of Nov. 1999 - Aug. 2000 [Stoesser, 2001]. Shake your Moore's law at that figure, matey.

    2. Re:Moore's Law and bioinformatics by alispguru · · Score: 2

      Very impressive!

      But, will it continue to grow at that rate? Which will peter out first, computer power or bio data accumulation? Moore's Law has worked for 35 years so far, and Moore himself thinks we've got at least another 15 years from now.

      My money is still on the computers.

      --

      To a Lisp hacker, XML is S-expressions in drag.
    3. Re:Moore's Law and bioinformatics by Marcus+Brody · · Score: 2

      I do believe that the DNA/Protein databases have been growing exponentially for around 20 years, since the Swiss Institute of Bioinformatics started the SWISS-PROT protein database.

      I must admit that the figure I gave was bloated due to the last push to get the human genome, and has probably settled down a little since then. But anyhow, biology/CS have been pretty head-to-head on rate of acceleration for some time, and I cant see any decent reason why either field will slow down for some time yet.

  104. DNA seuqneces aren't that big by pclminion · · Score: 2
    I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences").

    Unless Taco is storing DNA sequences from aliens, I don't know what he's talking about. I downloaded the human genome project last year and if I remember correctly it was definitely under a gigabyte.

  105. Re:Great! Where's the backup solution? by jtosburn · · Score: 1

    I totally agree. In fact, I've just been researching backup solutions. A summary my findings so far:

    DDS-4 - 5 tape changer, 20 GB / tape = 100 GB for $2500. 9+ hours to write 100 GB. Tapes are $10 each.

    DLT - 40 GB /tape, $1400, 3:45 to write. Tapes are $55 each.

    Super DLT - 110 GB / tape, $4700, 2:46 to write. Tapes are $115.

    LTO (Ultrium) - 100 GB / tape, $3500, 1:51 to write. Tapes are $100.

    All sizes are native, uncompressed, and times given assume no compression, so if the data set is compressed 2:1, then capcity doubles, as does throughput, and write time doesn't change.

  106. The price... by ellem · · Score: 3, Funny

    1 Terrabyte solution - $2500

    All the pr0n you could ever watch - $1,000,000

    The look on your Mom's face when she clicks on AsianDogAssRape10.mpg - Priceless

    --
    This .sig is fake but accurate.
    1. Re:The price... by marktwain · · Score: 1

      A SETI@home dude. Omigosh. Thanks for brightening my day! (And keep up the good work!)

  107. Re:Great! Where's the backup solution? by Amazing+Quantum+Man · · Score: 2

    [I still need a versioning filesystem, like VMS though.]

    I hate to say it, but SCO (yes, SCO) had a versioning filesystem in OSR5. HTFS (High Througput File System) had versioning support.

    --
    Fascism starts when the efficiency of the government becomes more important than the rights of the people.
  108. I could rip my entire anime collection for instant by Anonymous Coward · · Score: 0

    Yea cause we all know only really cool guys like anime.
    Honestly if anime is not the gayest of gayest cartoons I don't know what is.

  109. Re:Great! Where's the backup solution? by Amazing+Quantum+Man · · Score: 2

    You've just stumbled across one of the main concepts behind the Storage Area Network [snia.org]. The biggest problem you have is bandwidth.

    Dude, that's why most SANs are made out of Fibre Channel. FC is a 1GB transport that has a SCSI protocol on top (FCP-SCSI). 2GB FibreChannel is available, and work is currently under way on 10GB. In addition, FC is full duplex.

    --
    Fascism starts when the efficiency of the government becomes more important than the rights of the people.
  110. DNA sequencing by Cyno · · Score: 1

    Yeah, it sucks. I'm gonna have to wait until 2005 before I can map my family's DNA and search for anomolies.

  111. Re:Great! Where's the backup solution? by Amazing+Quantum+Man · · Score: 1

    Shit. I forgot to put the afteer the first "Fibre Channel". Next time I'll remember to use the "preview" button!

    --
    Fascism starts when the efficiency of the government becomes more important than the rights of the people.
  112. I see a flaw here... by Anonymous Coward · · Score: 2, Interesting

    We (my company) designed a very similar system using a Tyan Tiger200 with dual GHz Cel's etc. The problem is that the drives he lists (the 160GB Maxtors) aren't addressable by the RAID controller he is using (the Promise TX). The Promise card will only address up to 127GB per drive. You have to use a ATA-133 spec controller to get the full capacity out of those drives. We did an array using the TX and WD 120GB 7200RPM drives (with 8MB cache - mmmmmmmmm.....) that flat smokes anything that you can put together with the Maxtor drives. Oh well....

  113. Usenet ? by _Spirit · · Score: 2, Insightful

    I think Usenet is underestimated here. I remember reading on the site of one of the larger ISPs, specialized in good usenet access (ie. 30000+ groups & week+ retention even on binaries groups) that they have significantly more than 1 TB of storage space (don't remember how much, but several TB). So mirroring Usenet might be a tight fit.

    --

    beauty is only a light switch away

  114. Wrong by Sir+Homer · · Score: 1

    Thats our way to read DNA. The human body can read and start duplicating a entire strand of DNA almost instantanusly. That will beat the hell out of your 15000RPM SCSI drive :).

    1. Re:Wrong by netsharc · · Score: 1

      Imagine that, a new P2P sytem.. the RIAA would start trying to prohibit sex and it'll be the end of humanity as we know it. :)

      --
      What time is it/will be over there? Check with my iPhone app!
    2. Re:Wrong by Aexia · · Score: 1

      Hey baby, I've got some MP3s I'd *love* to upload to you.

  115. Re:Great! Where's the backup solution? by vovin · · Score: 1

    Yeah, when you spend 250,000 for a TB of disk your backup problems magically disappear!

    Seriously I complied a list of reasons to backup a RAID storage solution:
    -- Accidental deletes and over-writes.
    -- Total catastrophic recovery. (Fire/Flood/War, etc).

    So it *may* not be the solution for critical data that needs to be shadowed off-site, nor the only location for difficult to reproduce data.

  116. Re:Nothing special - bullsh*t by Anonymous Coward · · Score: 0

    The IDE disk arrays discussed by other posters use a single controller per disk, so the arguments about multi-disk controller screwups don't apply unless you've built a bad IDE array.

    I could build a bad SCSI array if I used wrong design principles.

  117. performance issues by john_uy · · Score: 2, Insightful

    i believe that there is more problem in performance rather than capacity.

    a typical configuration that cheap will use an ide hdd (and to make it cheaper software raid).

    the main problem (for us in this case) is the performance. how do you increase the data transfer? for the past few years, the storage space has increased tremendously but the transfer rate of the drives are out of proportion with the space.

    ide is usually placed in a 33mhz/32bit bus which will give a burst transfer of 133mbyte/sec. that is the max whatever you do. but if you will place a nic card, they will share the bandwidth unless it is placed in a different bus.

    for the interface itself, scsi can handle more i/o operations/sec and fc even more. technologies today can implement raid5 at almost no performance hit.

    so given 1tb of data, definite many people will be accessing it (unless you really plan to use it for your insane storage space). so if people will be able to store much, they can access it at a much slower rate.

    so you won't see the scsi and fc being obsolute even though the serial ata gets through. it will remain in the low end segment of the storage market.

    and besides, if you want to backup your data, the best way is to store it to tape and that will cost big (since mirroring the info in another server will not give you the reliability compared to tape)

    --
    Live your life each day as if it was your last.
  118. *Really* large files? by jblake · · Score: 1

    There is an interesting problem though...do we have the ability (assuming we can hook up however many storage devices we want to one system) to space huge files across them? (DNA, DNA+metadata references, etc...) I wonder when the first terrabyte file will be made. I'd speculate it'd be a [.wad|.mpq|.u|*] file for Uber-QuakenHalflifeTournament XP. ;)

    --
    I just found a new sig.
  119. Re:Nothing special - bullsh*t by ahodgson · · Score: 1

    http://www.3ware.com/

    No, you still can't build really big servers.
    But you can slap 8 160GB drives in a box and drive them all at full speed, or just as close to full speed as SCSI controllers manage with 8 drives on a controller (PCI bus speeds being what they are).

    I've been using 3Ware gear for a year and a half - they work. For any system that requires 8 drives or less, there really is no reason to pay SCSI prices.

  120. Defrag? by eples · · Score: 1


    I wonder how it takes that sucker to Defrag...

    Shoulda used a SCSI Controller...

    --
    I'm a 2000 man.
  121. No RAID 1 of OS by gkbarr · · Score: 1
    If you're going to drop 5 grand into a terrabyte RAID array, why the hell don't you spen an extra $80 and get a second drive to use as a mirror for your OS volume? I mean, duh! C'mon people...

    I pitty the fool!

    --
    Sapere Aude - Homer
  122. Wrong Way to go by joeblowme · · Score: 2, Insightful

    This guy totally went the wrong way for expandability and speed. You can get the Promise SuperTrakSX 6000 for $480 and that has hardware raid 5 and supports 6 drives. I'd throw one of those in with 6 drives to start and take my 800Gig and be happy. That would save me at least a $1000 up front. I wouldn't need 2 of the harddrives, the second processor or so much ram. Plus it would be faster and much more reliable. Then later on I could add another one for about $2500 and have 1.6 TB of space to store my huge collection of pornography... err rather mp3's, software and G-rated dvd movies.

    --

    If your not cheating your not trying. If your not trying your not winning and if your not winning why play?
  123. 2 cheap pcs by Anonymous Coward · · Score: 0

    Buy 2 cheap PCs £500.
    Add 6 extra ide drives £1000
    Add cat5e cable x 2 £4
    Add hub £40
    Add Linux £?
    Your network will easily be filled with data from the disks for a total cost £1544 - much less than $5000 and you get some cdrom and floppy drives to use as cup mats.

  124. L0ts 0 pr0n! by nuetron · · Score: 0, Offtopic

    L0ts 0 pr0n!

  125. Re:Great! Where's the backup solution? by eno2001 · · Score: 1

    Ummm... isn't the whole point of this for use at home? WHY THE HELL would a home user need OFF SITE backup? How many of you guys have off site backup at home? Huh? How many??!! I'm waiting...

    --
    -"...bad old ideas look confusingly fresh when they are packaged as technology" - Jaron Lanier (Digital Maoism on Edge.o
  126. The Antec case is not good for ide raid by gd · · Score: 3, Informative

    I used to build a similar kind of raid system (half a TB) using the Antec case. Their case is nice, but not for the IDE raid. The problem is that the IDE cables need to be within certain length in order to get DMA 5. The case is designed for scsi, which has a longer cable length limit. To hook up all the IDE drive in that case is really a pain in the butt.

    For IDE raid, this case is good except it's a bit expansive:

    http://www.rackmountnet.com/rackmountchassis/rac km ountchassis_4ud.htm

    It can hold up to 16 drives with hot swappable trays. There should be no cable length problem.

    On a side note, I used to plugin 5 Promise Ultra100TX2 cards in one computer. All cards are recognized but only 8 drives are recognized correctly (I plugged in 12 drives altogether). I remember seeing some where (either in linux kernel source or FreeBSD sys source) saying that Promise has a limit of 12 drives per system, with 8 of then in DMA mode, and the rest 4 in PIO mode with some tweak (burst?). So for a big raid like that, an ide raid cards (either 3ware's or high point's) are recommended. Using a hardware raid ide card also has the benefit of being able to hot swap the drives with the case mentioned above.

    --
    gd
    1. Re:The Antec case is not good for ide raid by Anonymous Coward · · Score: 0
      "I used to build a similar kind of raid system (half a TB) using the Antec case. Their case is nice, but not for the IDE raid. The problem is that the IDE cables need to be within certain length in order to get DMA 5."

      What the hell? 36 inches isn't enough for you? That's 3 feet of cable. You can get 36 inch IDE cables for about $5 each.

      Using 36 inch cables has no adverse effects on drive performance, and certainly doesn't cause them to be arbitrated of UDMA5 operation. All these drives are connected to their controllers with 36 inch cables:

      hde: 150136560 sectors (76870 MB) w/1916KiB Cache, CHS=148945/16/63, UDMA(100)
      hdg: 150136560 sectors (76870 MB) w/1916KiB Cache, CHS=148945/16/63, UDMA(100)
      hdi: 150136560 sectors (76870 MB) w/1916KiB Cache, CHS=148945/16/63, UDMA(100)
      hdm: 150136560 sectors (76870 MB) w/1916KiB Cache, CHS=148945/16/63, UDMA(100)
      hdo: 150136560 sectors (76870 MB) w/1916KiB Cache, CHS=148945/16/63, UDMA(100)

  127. Video Opened Up MP3 BitRate to 160 and 190 by vbprgrmr · · Score: 1

    Seeing video being able to go to HDs in decent quality has upped my standard as to whats acceptable for my MP3 Jukebox. Before with the high cost of harddrives, most of my saved songs were ripped with 128. But once I hooked my second PC to my stereo, to act as my sole method of listening to recorded music, I was disappointed by the quality of some songs at 128. So now, with cheaper HD prices, I will have to re-rip my collection at 160 or 192 or maybe even 256.

  128. Re:Great! Where's the backup solution? by bonzoesc · · Score: 1

    I take all my... uh... backup copies of original legal program media to friends' houses in case mine burns down.

  129. Far better solution in my book... by unicorn · · Score: 2

    Would be to replace the 4 controllers, and the monster case. Use a more "standard" chassis. Slap a regular SCSI card in it. And then for the drives themselves, use an UltraTrak100 TX8
    to hold the drives.

    It just seems like a far cleaner solution. Not to mention FAR more expandable. And works out to be about the same price.

    --
    "Politicians are interested in people. Not that this is always a virtue. Fleas are interested in dogs." P.J. O'Rourke
  130. I can do it for $2427 by tjackson · · Score: 0, Redundant
    According to PriceWatch:

    $355 - 3ware Escalade 7810 8-port RAID Controller

    $2072 - 8x Maxtor 160GB IDE Drives ($259 each)

    You could hook these up in a 7+1 RAID5 array, and you'd have a 1.018TB Array.

  131. Do it for half with Pricewatch by mangoless · · Score: 3, Interesting

    Storage solution: 1TB RAID5 storage array (Prices are from Pricewatch) Quantity Price Subtotal Intel Celeron 700 MHz w/ Socket 370 MB, UDMA 100, AGP VIDEO 8~64MB shared only, Sound, 56K AMR Modem, 10/100 Network in MidTower case w/Powersupply 1x$135.00=$135.00 Power Magic PCI IDE U/ATA100 RAID Controller w/Cable 4x$22.00=$88.00 Maxtor 4G160J8 5400/133 8x$259.00=$2,072.00 60.0GB EIDE Ultra DMA 5400 1x$85.00=$85.00 Total: $2,380.00 - Mangoless

    --
    [a mango-free monkey]
    1. Re:Do it for half with Pricewatch by spacefrog · · Score: 1

      Your going to run 9 HDD's off of a cheap-o mid-tower case and power supply? That will be interesting. Wattage aside, how much duct-tape is involved?

      As far as the wattage, can you say, "Snap-Crackle-Pop"?

  132. Re:Great! Where's the backup solution? by checkyoulater · · Score: 1

    How about an IBM Ultrium? 100GB per tape, or 200GB compressed. 10 tapes per backup. And that would be a full backup.

    In my experience, backing up 300GB of JPEG's, it took approximately 3 hours per 100GB. Connected via a Adaptec 160 card.

    --
    Is that a real poncho? I mean, is that a Mexican poncho or is that a Sears poncho?
  133. Re:Great! Where's the backup solution? by dragonfrog · · Score: 1
    Do you know if you always need dedicated machines for a SAN? Or could you, for example, spring for an extra 30 GB of disk space on twenty office/lab machines you were going to buy anyway, and have them run as a (slightly less than)600 GB SAN?

    That way, the powerpoint presentations that your 20 middle managers are making instead of doing real work would be effectively indestructible...

  134. Better performance.. by tcc · · Score: 4, Interesting

    Get a 3ware escalade card in march they'll support 48bits-LBA in the new firmware, you'll be able to hookup those 160GB monsters in raid-0 (or raid-5) with a tenfold increase in performance, without taking up all the PCI slots.

    the TX2 is a nice little card, but you can only use 2 drives per board for getting the "full speed" (else if you use master/secondary, 4 drives will give you the raid speed of 2 in stripe) and then you'd have to stripe your raid-0 drives in software. Instead of wasting PCI slots and using an underperforming card, you pay a couple of bucks more and you get the real thing with full speed and hardware raid5.

    There are a lot of raid benchmarks at storagereview.com as well. IDE raid is so damn cheap.

    --
    --- Metamoderating abusive downgraders since my 300th post.
  135. Poorly researched and unimplemented vapour by Anonymous Coward · · Score: 2, Informative
    What the hell is this article? It's obvious that they never built this thing. For one the Maxtor 160GB drives use the newer ATA133 standard which increases the bits used for addressing from 32 to 48 to overcome the 137GB limit which hinders the existing ATA100 standard. If they want to use the full 160GB of these drives, they should've speced the Promise Ultra133 TX2 controller.

    They also spec'd the motherboard as an "A7B266-D". I'm guessing this is the A7M266-D, as there is no A7B266-D (no one else is even considering manufacturing an SMP Athlon chipset besides the forthcoming Micron Scimitar)

    It seems to me like this is a rather poorly thought out spec. Why are they using 4 FastTrak100 TX2s when they could use 2 FastTrak100 TX4s? Which of course brings up another point, why are they even using FastTraks? Under Linux the FastTrak driver is quite immature, and last time I used it only worked with 2.2 kernels, which hinders tbe ability to use filesystems like XFS. Also, the FastTrak cards are essentially software RAID as they offload the work of calculating the stripe locations onto the host CPU. There's no point in using md to combine multiple FastTrak arrays.

    Many people were mentioning the 3Ware Escalade. It is a relatively good card, but for a home storage array Linux md + XFS might be a better choice. (Also note that the advantages of 64-bit PCI couldn't be had with the A7M266-D as it doesn't include any 64-bit PCI slots. Perhaps the Tyan Tiger would be a better choice for a 3Ware solution) My recommendation would be 3 Promise Ultra133 TX2 controllers. The read and write performance on an Escalade 7410/7810 is appaling. With the embedded processor on the 7450/7850 (R5Fusion Technology, as 3Ware calls it) the performance exceeds that of software RAID, but at the much more expensive price, of course. I think the goal here is bulk storage and not performance, and the ATA133 controllers are by far the cheapest solution.

    For more information on IDE RAID under Linux, check out this site It's information is a bit dated at this point, but I used it for my home storage server and haven't regretted it. With 5 7200RPM drives on Promise Ultra100 controllers and Linux md RAID-5 w\ XFS, my bonnie++ scores are 90/30MBs for sequential read and write, respectively. I couldn't be happier. This site also has benchmarks showing the superior performance of software RAID over a hardware solution with a 3Ware card.

    And there were a few other things people seemed confused about. No one in their right mind would put more than one drive per channel for the purposes of a performance RAID. That's just foolish. As for the limitation of being unable to access both the primary and secondary IDE channels simultaneously, this limitation was removed years ago with the introduction of EIDE.

    In as far as everything else goes, I'm a SCSI bigot. I have SCSI drives in my workstations and I couldn't be happier. However, IDE RAID is a very economical solution for a home user, often with performance on par with that of more expensive SCSI RAID solutions.

    To conclude, this article seems very poorly researched and documented. Had they actually attempted to build this beast and failed, perhaps I would've been more amused. However, as stands it's an overpriced specification which uses incompatible parts, and little research has been done on the optimum parts for the configuration.

    1. Re:Poorly researched and unimplemented vapour by Anonymous Coward · · Score: 0

      http://www.asus.com.tw/mb/socketa/a7m266-d/overvie w.htm

      He probably meant this motherboard

  136. The human genome in a 800mb .zip file. by autopr0n · · Score: 2

    Actualy, when the Human Genome first got online, I downloaded the thing as an 800mb zip file. Because I could. It was only a few gigs uncompressed. Unless you needed to store the whole genome for a couple people (rather then, say, diffs) current tech works fine. Hrm, a little odd knowing that the whole Human Genome is only about four or five times the size of a Divx movie.

    --
    autopr0n is like, down and stuff.
    1. Re:The human genome in a 800mb .zip file. by Lord+Bitman · · Score: 1

      what is referred to as "The Human Genome" is not the complete DNA Sequence, but a small part of it which somebody decided was the part that defines who we are (I guess it varies more than any other part they bothered to study)

      However many people are of the opinion that this is Bullshit.

      --
      -- 'The' Lord and Master Bitman On High, Master Of All
  137. coupled with FreeBSD 4.5 by Anonymous Coward · · Score: 0

    This would be great to run with FreeBSD 4.5.
    A huge fileserver on the world-famous proven
    BSD FFS filesystem.

  138. not really by autopr0n · · Score: 2

    The bandwidth is pretty good, but it's the latency that'll kill you.

    --
    autopr0n is like, down and stuff.
  139. raidweb by NetMasta10bt · · Score: 3, Informative

    Ok. This is just inane. Why build this when someone has already done it better for cheaper?

    http://www.raidweb.com

    We purchase their 8 disk IDE RAID arrays. They are hot swap, support RAID 0, 0+1, 1, 3, 5, and hot spare, have dual failover power supplies, come with 64MB cache, which can be upgraded. Configurable via the EZ front LCD display, or via serial console. They support ATA-100, and ATA-133 coming shortly. Software upgradable, and it runs Linux.

    They array (sans disks) runs us $3200. They even have versions that have dual fiber ports out the back.

    WARNING - DO NOT purchase these with IBM GXP75 (75GB) disks like we did... we have about 80 of them that failed.

  140. read this offtopic post! by kilgore_47 · · Score: 0, Offtopic



    For God's sake, do not kill us! We surrender!

    quoting from rw: Before dawn in Afghanistan last Thursday, US Green Berets launched a surprise attack on their unarmed allies, storming a disarmament depot with indiscriminate fire, then rounding up survivors only to tie their hands behind their backs with plastic bands and execute them. This according to that America-hating, propaganda-strewn leftist rag, The New York Times. God bless America.

    (yes, I'm very much abusing my 50 karma account and spamming this message all over the place with a +1 bonus. People need to read it, ok? Read the fucking nytimes link. Thanks for your time.)

    --
    ___
    The way to see by faith is to shut the eye of reason. --Ben Franklin
  141. Re:Great! Where's the backup solution? by greenfly · · Score: 2

    I could be mistaken, but I didn't know that one could hot swap IDE devices. I thought they didn't really take kindly to you pulling them out of a running system. That means that you end up having to power down your system each time you want to take a backup home.

  142. more then that. by autopr0n · · Score: 2

    Therefore 57MB required per human

    You still need indexing information. You need to spec where those diffrences occour.

    --
    autopr0n is like, down and stuff.
  143. Uh, no by autopr0n · · Score: 2

    One base always matches up with the same one. Cytozine with Guanine (CG), Atozine with the 'T' one (AT) and the reverse (GC, TA). So you only need to record half of the pair.

    --
    autopr0n is like, down and stuff.
  144. Another design by heretic · · Score: 2, Informative

    I just built a similar setup -- 500GB for less than $2,900. However, I made some different design choices.

    First of all, I wasn't too impressed with the Promise controller, so the choice for me was between the 3Ware 7850 and the Adaptec 2400A. The Adaptec had the best overall performance, but the 3Ware is close and can support 8 devices. For the hard drives, I wanted to come reasonably close to SCSI performance, so I chose the WD1000JB drive with the on-board 8MB buffer. I used a Tyan Tiger K7 with 64-bit PCI for the motherboard with dual Athlon XP (not MP) 1700+ CPU's plus 1GB ECC registered PC2100 DDR RAM. Put them all in a nice aluminum rackmount case.

    I'll probably replace the motherboard with the newer Tyan with 66MhZ PCI bus in the near future and use the current one in a workstation. I'll also drop in more RAM if/when prices drop.

    It's been pretty sweet so far with LVM + XFS. My backup solution is a 33GB tape drive, so I spend most of every Sat. backing up the array. Time and money permitting, I'll build a second one and look for a DLT tape library on ebay.

  145. The mailing list.... by Anonymous Coward · · Score: 0

    Don't forget to sign up for the linux-ide-arrays mailing list. Just send a blank email to

    linux-ide-arrays-subscribe@lists.math.uh.edu

    Andrew Klaassen

  146. Ripped Anime unreasonable? by Lord+Bitman · · Score: 1

    I have my entire anime collection ripped for instant access, and I dont intent to ever stop this practice. (maybe if I get a hundred disc DVD changer.. but then what about VHS?)
    I suspect that in coming years this will not be such an uncommon practice as to be called 'unreasonable'
    just remember that sloppy, bulky, huge code, that's the wave of the future, and the only way some tasks will ever be carried out. 1TB-per-disc can't bee too far off, the public demands otherwise, it seems.

    --
    -- 'The' Lord and Master Bitman On High, Master Of All
  147. software RAID by DavidJA · · Score: 2

    and if you use software RAID via win2k

    PLEASE do not ever used software RAID on a production file server! Esp. Win2k's implimentation of software RAID!

    We use to run a software RAID on a file server (serving only 10 macs mind you!) - Both using 4x9 gig SCSI drives (a while ago); and 4 x 30gig IDE drives

    Everything runs OK until you need to replace one of the drives; then the performance whilst rebuilding absolutly sucks!

    I've seen the system take over 12 hours of production time to rebuild a 90 gig software RAID; all time performance for network users absolutly sucked!

    The solution; good quality hardware RAID; we now run a compaq 5200 hardware RAID card; and all compaq drives: I can pull a HDD out right now; put a new one in and have the RAID re-built without any network user noticing....

  148. I could use a terrabyte by HanzoSan · · Score: 1


    How else can people store HDTV or full quality DVD movies, games, full quality music cds,

    think about it.

    --
    If you use Linux, please help development of Autopac
  149. 1.4TB at my company by glwtta · · Score: 2
    and we are using a mere fraction for our DNA info storing needs (which I am sure has been said a hundred times already). DNA sequences themselves are tiny (comparatively) it's the annotations that take up the space, but even that is under a TB for most needs.

    I can't remember exactly right now, but Celera's storage was something like 100TB, wasn't it? Of course when you are actually doing the sequencing and annotation of the whole damn thing, you need more space. (of course they weren't using nearly all of it, and it also included stuff to service their "subscription" clients, each one of which would of course get a significant chunk to store their stuff...)

    any one have more recent (or more exact) info?

    --
    sic transit gloria mundi
  150. Re:Nothing special - bullsh*t by afidel · · Score: 1

    No, you still can't build really big servers. But you can slap 8 160GB drives in a box and drive them all at full speed, or just as close to full speed as SCSI controllers manage with 8 drives on a controller (PCI bus speeds being what they are). I've been using 3Ware gear for a year and a half - they work. For any system that requires 8 drives or less, there really is no reason to pay SCSI prices.

    while 3ware doesn't advertise it, you can have more than one escalade in a case. I have seen people put as many as 3 full loaded 6800's in one case. 18*160GB=BIG storage (each controller has 8 drives, lose one for parity and one for a hotspare)

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  151. Re:Need for memory/storage [OT] by seann · · Score: 1

    I've concluded this is a giant conspiracy theroy to convince people he has never said that.
    I now want money. You all got money, wheres my money?! I'll say windows is better than linux for money.

    Windows is better than linux.

    Now wheres my money? I need to go buy a Slackware cd.

    --
    I'm a big retard who forgot to log out of Slashdot on Mike's computer! LOOK AT ME.
  152. Offsite backups to my home by Paul+Johnson · · Score: 2
    We have 2 PCs in this house and I back up both to a TR-4 tape drive. Then I take both tapes to work and bring back the previous week's backup.

    Over the years we have put so much of our lives on to the PCs that we would be seriously lost without the archive.

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
  153. Re:Great! Where's the backup solution? by Paul+Johnson · · Score: 2
    Err, me, as I mentioned in another post in this thread. My wife and I realised some years ago that we had an awful lot of our lives on disk. Since I have tape backups, taking a tape set to work to keep in my desk seems a trivial precaution.

    Paul

    --
    You are lost in a twisty maze of little standards, all different.
  154. Re:Great! Where's the backup solution? by Paul+Johnson · · Score: 2
    Check out Amanda. You give it a backup cycle of D days and a tape cycle of T tapes where T is usually 2D+3 or so. Drives to be backed up from around the network are listed in a config file. Over a period of D days the system will (if it can) schedule a level 0 (full) backup of everything and also do a level 1 (diff against last level 0) backup of every drive every day. Sometimes it has to drop to level 2 (diff against last level 1) to make it all fit.

    When you want to recover something a browser lets you traverse the directory tree and tag the files you want. Then Amanda tells you which tapes to mount to recover them. Cool!

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
  155. Fire costs by Paul+Johnson · · Score: 2
    It takes maybe a month to locate new premises, buy new equipment, wire everything up and get people moved in. Maybe less if you are a small organisation or have someone talented doing it. The costs of doing this are paid by your fire insurance (which you have, of course). This is still not free because you are not running your business at the time so your cash flow situation may get tight.

    But if you lose your data you lose your business, and no insurance is going to cover that. Years of work goes up in smoke.

    Paul.

    --
    You are lost in a twisty maze of little standards, all different.
  156. Re:Please tell me your error-correction system by Anonymous Coward · · Score: 0
    How the fuck is this offtopic you moderator-nazi's? Huh?

    I think the mentality was: "Cancer bad--make the comment that makes think of cancer go away"

    Mutation has everything to do with cancer.

  157. Re:Need for memory/storage [OT] by Anonymous Coward · · Score: 0

    Yeah, I know. Since the 640kB quote was said to be an urban legend, I thought I'd throw that in for fun. Wee!

  158. Re:Great! Where's the backup solution? by isorox · · Score: 2

    Similarly I read something about a 1GB/hour VHS backup system about 5 years ago. With packs of 5 standard 4 hour VHS tapes costing about £5, that works out 25 pence a gigabyte - about half the price of cdr's.

    However even the best video tapes will degrade very quickly compared with optical, computer tape systems, and even IDE hard drives.

  159. Re:Great! Where's the backup solution? by Chewie · · Score: 2

    As far as the 40/80 GB max on DLT, IBM and Compaq both offer larger backup solutions, LTO and Super DLT. Compaq has embraced Quantum's SDLT, which has a capacity of 110/220 GB and a transfer rate of 11 MB/sec (uncompressed). Search speed is roughly 4.5 meters/sec. IBM has embraced LTO, which uses 100/200GB tapes, has a transfer rate of 15 MB/second (uncompressed, and a >2x increase over 40/80 DLT at about 6 MB/second uncompressed), and has an on-tape chip which can hold an index of all the files on the tape for easier retrieval. The search speed on LTO is about 6 meters/sec.

    Now, all of this is useless without being "generally available", so I did a little price-checking. Below are internal single-drive units (no autoloaders), and list price from manufacturers:

    Compaq 40/80 DLT Drive (internal) - $3,499.00
    Compaq 110/220 SDLT Drive (internal) - $5,590.00
    IBM 100/200 LTO Drive (internal) - $3,999.00

    Just wanted to point out that there are other options.

    --
    49 20 68 61 76 65 20 74 6F 6F 20 6D 75 63 68 20 66 72 65 65 20 74 69 6D 65 2E
  160. What's the big deal? by cfulmer · · Score: 2, Interesting

    Sam's clum in the area recently had a WD 100GB dide hard drive on sale for $120 after rebate. 1TB at that rate is ~$1320, plus a few hundred for the Motherboard, processor memory and extra controller cards, and a TB server is within reach of an 18-year-old who saved his paper-route money.

    The real question is: how long will it take to listen to all those mp3s? At some point, extra storage just isn't practical because you can't fill it fast enough.

    1. Re:What's the big deal? by Rader · · Score: 2

      Well, that depends. I have 9,800 albums in MP3 format, taking up about 570 GB. If I could fit them all on the HD, I'd be able to trade and double it in a couple months.

      Thus, for me, having enough hard drive space causes me to trade faster, thus need more hard drive space.

  161. What about this? by seb_sikora · · Score: 1

    16 Pentium 100 base units - £160 16 100baseT nic - £160 16 60gb maxtor 7200rpm IDE ATA 100 HDD - £1920 16 port hub - £150 Base machine - Mainbord - £60 - 1.1ghz thunderbird - £70 - 1gb RAM - £180 - 20gb hdd - £50 Total base machine - £400 TOTAL COST - £2470 TOTAL STORAGE - 960gb Im a bit new to all this, but surely the above lot could work. If you made sure there was a good bit of ram in each of the doner pcs and used software raid you could access the whole lot like a huge drive couldnt you? It would allow the system to be redundant as well, as you could use 460gb storage, and have the other 460gb as an automatic backup. Take care all

  162. Some Stupid Hardware Choices by shoemakc · · Score: 1

    Yeah it's cool they had to money to do this, but they made a number of STUPID Choices on hardware.

    1 - I can see using the A7B266 because of it's 64bit PCI slots....but what use is the extra processor for? a 1Ghz Athlon is already Overkill for calculating raid5 parity information, no less two of them

    2- If you sprang for the extra cash to avoid saturation, WHY THE HELL would you use a 10/100 NIC? No matter how fast the array is now, your sitll only going to be able to move 12~13MB/sec MAX. Either Save the cash and get a cheaper mobo, or pony up to an all-gigabit backbone.

    3-2GB of RAM???? See Above. Given it's cheap...but even 512MB would be overkill.

    4-Fasttrack 100Tx2's with Maxtor 160's. Did anyone tell these guys that ata100 only supports up to 137 GB per drive? These guys are wasing 184GB even before disk slack. Their "gigabyte array" is more like 900GB. Either buy the 120's and save some cash, or pony up for ata-133 controllers.

    Slashdot actually published this? It's not particulary special, or even clever. These guys just had a bunch of cash to throw at hardware and didn't even take the time to do the basic research.Shame....Shame.

    -Chris

    --
    --an unbreakable toy is useful for breaking other toys--
  163. Um, I'm having trouble adding past 5K by arget · · Score: 1

    So, like, the total I get for his parts is 4920, but his total is 5720.

    $800 bucks buys a lot of skittles and coke...

  164. I remember when the Circut City guy said that... by Poppageorgio · · Score: 0

    I can still remember the words of the Circut City guy.... "A 40MB hard drive is HUGE! You will never be able to fill that up..." And that was only like 6-7 years ago.

    --
    Me fail English? That's unpossible!
  165. Storing DNA by Anonymous Coward · · Score: 0

    I'm no biologist but as far as I understand it to store the entire DNA data for one human being you would have to capture the current DNA in every single cell of the human body. As a lot of things damage our DNA from day to day can you say smoking, drinking, getting a sun tan. Granted 99 % or so of the DNA recorded may be identical but the other 1 % or so may be quite different. Doesn't sound like much but 1% of what, more than trillions of cells is still a lot of data. Yes your body does try to repair any damage however it doesn't always succeed just look for a mole somewhere on you, even a tiny one that's still a lot of cells. Now throw in storing positional information for each cell etc. And terabytes start to look like bytes.

    NB: As I understand it our body essentially has a one level error correction when it comes to repairing DNA I think a cool use of nanotech when we eventually get there will simply be to improve on this, imagine you get your DNA sampled from a number of different places stem cells etc. So you know what the master copy should look like, then you have little nanites like ribosomes go around your body and help out, if they find a cell with DNA that doesn't match the master copy they repair it. Would help us live healthier longer lives. As far as I'm aware the only reason our bodies haven't evolved better error checking is simply because it doesn't need to as far as your DNA is concerned your just a vessel for procreation and you already live to about 70 that's plenty of time to pass on your genes. I think I'll stop now before I really get going ;)

    You can always use more space I'll bet the transport buffer for a real life teleporter like on star trek wouldn't be much good with a few terabytes , might teleport an eyelash or two ;)

    Disclaimer: All of the above is purely as I understand it, if any of it's off by a mile or two then go easy on the flames ppl.

  166. Calculus, Clippit and 640k of RAM by BigBlockMopar · · Score: 2

    I guess the simple truth is that now that 100 gig drives are a couple hundred bucks, we now have the ability to store anything we reasonably could need (unless you define "Reasonable" as "I need to store DNA Sequences").

    Doesn't "640k ought to be enough for anybody" suggest that Bill Gates once felt the same way about RAM?

    Of course, visionary that he is [snicker!], there's no way he could have imagined desktop machines being used to edit video.

    Likewise, who knows how big and bloated Clippit The Office Paperclip can get if we have 100 gigs of hard disk space to burn... maybe, one day, he'll actually bear consultation when you need information, instead of when you need something to laugh at.

    I love calculus so much, I want to give it to everyone! Come, get some integration!

    MmMMmmm... calculus. Hours spent in the dentist's chair, with him scraping hard crusties off my teeth... And you're just giving that stuff away?

    --
    Fire and Meat. Yummy.
  167. Sorry, Taco. by Anonymous Coward · · Score: 0

    I don't see it that way. I remember when 20GB drives were out and about. No one could have ever possibly filled those. :)

    More storage space simply means two things..

    1. Bloated programs.
    2. New and better media formats.

    A pity we can't just get #2. :p

  168. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0

    Why would you settle for DLT... When there is LTO (Ultrium) Techonology.

    100 GB Tapes (Compression gives you upto 200GB)

    you'd get a Full backup in about 10 hours on 4 or 5 tapes!

  169. Storing DNA sequence by king-manic · · Score: 1

    GenBank, the US DNA sequence repository, isn't at a tera byte yet. give it a year.

    --
    "There are more things in heaven and earth, Horatio, than are dreamt of in your philosophy."
  170. Promise is Linux Unfriendly by PhotoGuy · · Score: 2

    I'm surprised no one has mentioned this, but Promise has become more and more Linux-unfriendly lately.

    There's different minor revisions of the 100Tx2 controllers; you can only tell by looking at the chip on board, I think only the last digit is different. I could not get the latest ones working with Linux at all. I ended up buying these boards under the Maxtor brand name (same units, but slightly older), which had the older chip set.

    On the latest boards, it seems Promise appears to have intentionally made certain registers read only, thwarting open source driver development.

    With that kind of behaviour, I'm staying away from Promise controllers, period. (I also had a hard time with their Raid5 controllers.)

    Back when they were Linux-friendly, their ATA100tx2 cards were nice. But with the latest incompatible chipsets and no help from the company, forget it.

    I also had some frustration with Adaptec's 2400 controller. It is *still* only supported by Adaptec under RedHat 7.0. And it has no audible alarm for drive failure, most annoying. Finally, under FreeBSD 4.3, it's performance was abysmal; there was definitely something wrong with the I2O driver working with this card. (I haven't tried 4.4 yet.)

    For now, I'm just sticking with motherboard IDE controllers; far more tried and true.

    -me

    --
    Love many, trust a few, do harm to none.
  171. Terrabyte Arrays by Anonymous Coward · · Score: 0

    Imagine a Beowolf Cluster of THESE!!!

  172. Just be very careful. by Bob_Robertson · · Score: 2

    If you do store DNA sequencing information, make sure you only use lossless compression.

    Or, for that matter, the issue for me is backup capasity again. With the advent of DVD-R (or whatever it's called today) I thought that "full backups" were going to be possible again. But now, with such vast quantities of data possible to have online and changing, backup issues again come to the fore.

    Lossless compression helps, but now I'm stuck writing not 50% of a 4-Gig tape over the weekend, I have to write two or three full tapes.

    As memory and disk space has become cheaper, bloat-ware uses more and more of it. I don't consider bloat-ware a good thing, but it cannot be fought any more than the monster shopping mall can be fought just because I happen to like mom and pop shops.

    The difference between information and data, I guess. The next great invention I think will be the personal digital secretary, like the ones detailed by Daniel Keys Moran in his wonderful "books of continuing time", designed to sift through the impossible quantities of data yet still have the personal touch to say "Gee, that bit over there looks interesting. I think Bob would like that."

    Bob-

    --
    The Ludwig von Mises Institute. The reasoning individuals economics
  173. Terra Earth by Gis_Sat_Hack · · Score: 1

    Actually, it's the land surface that's approx
    100 x 10^12 sq metres (100T sq metres).

    Using a rough radius of 6,378 Km & assuming the earth to be a sphere, the surface area of the entire earth is approx:

    511.2 x 10^12 sq metres.

    What to store ?

    let's say color+height+simple usage byte.
    color = 3 bytes
    height = 2 bytes (16 bit signed int => MeanSeaLevel +/- 32,000 metres)
    type = 1 byte

    Storage = 6 x 511.2 TB = 3067 TB

    ( Aside:
    ocean floor color ? yes - most geophysical / geological imagery uses 'psuedo-color' created from measurable surface properties.
    )

    And that's _*just*_ the 'instantaneous' surface, as humans we would be most interested in the surface +/- 10Km about sea level, projected both forwards and backwards in time.

    The current batch od low orbit earth scanning satellites have instruments that (each) deliver approx 1GB data/hour & thats round the clock for the lifetime of the package (est 12 years).

  174. Bill Gates never said that by jhamm · · Score: 1

    It's an urban legend. For more info:

    http://www.urbanlegends.com/celebrities/bill.gat es /gates_memory.html

  175. Removable Disk Drive Drawers for Backup. by billstewart · · Score: 2

    For about $20-30, you can get disk drive drawers that turn a 3.5" drive into a 5" removable drive. Nothing active; it's just a bunch of mounting hardware. (About $20 for the part that stays in your machine and $10/disk for the removable drawer parts.)
    This makes it easy to use disk drives as backup media, which is good, because they're much faster than tape. It also makes it easy to upgrade your disk capacity when you want to do that.

    --

    Bill Stewart
    New Fast-Compression-only CPR http://preview.tinyurl.com/dy575ks
  176. Re:Great! Where's the backup solution? by jandrese · · Score: 2

    Fibre Channel hardware tends to be a little too expensive for the $5k crowd. This guy is using commodity hardware, and that generally doesn't include fibre channel. Even if he bought the hardware, the driver support for something like a SAN just doesn't exist in Windows/Linux/BSD yet.

    --

    I read the internet for the articles.
  177. I think.... by BenTheDewpendent · · Score: 1

    we killed it... it worked fine this morning but now its dead.

  178. been there, done that. by Anon-Admin · · Score: 1

    I have a large home network. and I already store all my CD's in mp3 and stream them to the clients. As well as 16,000+ books on line etc. Although the story was ok I have been working on makeing mine out of the cheep parts I have picked up from the dot com's that are going under. The local side walk sale it worth it.

    If interested I have a page up about my network at http://www.xganon.com/~cl/network/

  179. Record your whole life by rednox · · Score: 1

    What do we do with this kind of storage?

    How about recording audio files of your entire life!

    The way to do it would be to carry around a small, lightweight device that would record each day's audio. The device would have 100BT ethernet and some custom software to connect it to a massive storage server. Some custom software would be written to transfer the files each night while the batteries are being recharged.

    Aqcuiring Hardware

    Pocket ePC-II System - $749
    950g, 157mm (L) x 146mm (W) x 45mm (H)
    PII 900Mhz, 128MB RAM, 10GB HD, 100BT Ethernet
    4 Li-Ion Laptop batteries - $800
    Mini Stereo Microphone - $65

    Storage Required

    128 kb per second for stereo mp3 at 128kbps
    86400 seconds per day
    1382.4 megabytes per day
    504.576 Gigabytes per year

    1TB would give you enough storage for 1 year, plus 1 complete redundant backup. Assume that every 18 months, the same amount of money will buy you a hard drive that is twice as big. Also assume that you throw away the old server every time you replace it with a new one, just so you don't end up with a huge pile of servers. I will start off with a 1.5TB server, which will last the same 18 months to make the math easier.

    Storage Hardware

    1.5 TB server
    ~ $8500

    1st purchase at birth = 1.5TB, enough for 18 months
    2nd purchase at 18 months = 3TB, enough for 3 years worth
    3rd purchase at 3 years = 6TB, enough for 6 years
    6 years = 12TB
    12 years = 24TB
    24 years = 48TB
    48 years = 96TB, enough to last until you are 96 years old

    So over your lifetime, you would have to make 7 purchases, for a total of $59,500. Add in the capture hardware of $1614, and you have a total lifetime cost $61,114. As the price of storage goes down, this will be even more affordable.

    Soon, it will be feasible to do the same thing with high-quality video!

  180. Re:Great! Where's the backup solution? by scottj · · Score: 1

    I've been backing up most of the servers that I administer onto a hard drive for at least a couple of years now. Typically, I simply scp a tar file to my backup server once a day. It's worked flawlessly, and my backup server can EASILY be offsite. This is my preferred backup strategy.

    --
    .-.--
  181. One problem... by nakens · · Score: 1

    The only thing I haven't seen anyone post about are power cables. I recently built a 1 TB server. It ended up costing about $3500. The problem I ran into was that while I had plenty of space in the Antec SX1240 Full tower case I bought, the power supply only has 6 power connectors. I'm currently in the process of figure out a solution, one of which is buy a second power supply and just use the extra connections on it. That will teach me to work without a checklist. Another issue to worry about are IRQ's. If your going to put 4 IDE add-in cards in there you'll probably get some overlapping (with windows at least). The solution I opted for was buying one of the relatively new abit boards that has 4 IDE controllers on it and one IDE add-in card.

  182. Nominal size by Tony-A · · Score: 2

    You don't say 1.024k bytes, you say 1k bytes and expect the listener to know that about 1000 is exactly 1024 due to the context. If 1k bytes were always 1024 bytes, how would you interpret 14.112k bytes?

    3/4" pipe is 1.050" Outside Diameter.
    The 3/4" refers to an Inside Diameter of a pipe with a particular wall thickness (which may or may not still be made). Regardless of how thick the walls are, and consequently what the Inside Diameter really is, 3/4" pipe is 1.050".

    IIRC there is something about a US bushel being a different volume depending on what is being measured.

  183. Gorp at the Amazing progression of technology by MjDascombe · · Score: 1

    And how much cheaper things get, 6 months ago a peice of kit like this would have cost you nearly, erm, $5000 [http://slashdot.org/article.pl?sid=01/07/19/15542 16&mode=thread]. Whoops

  184. Re:Great! Where's the backup solution? by hardc0de · · Score: 1

    Here's a thought...

    Split $DollarAmoutYouWantToSpend in half. Build two of these badass mofos. Give one to $Buddy/Work/'RentsWithBandwidth. Cron 'em to (r)sync over night.

    I don't mind if I lose a day of pr0n.. Teehee.

  185. Awww man... by gfolkert · · Score: 1

    Got a bit of info on the new 3ware product coming out in about April/May.

    12 IDE ports on the card, basically just another cascade on board. It'll be 32/64-bit and be able to run at ATA133 with >128GB devices. Best part is the current drivers will(should) work with it!!!

    At the rate HD prices are coming down and GB per device is going up... shouldn't be long before we see 10TB (with RAID5 + hot swap per 12 drives) for US $18K. That would be $1.80/GB for this kind of storage. And now that iSCSI is "working" with Linux... you could prolly use it as a SAN Storage server... Hmmm.... GB ethernet SAN.... hmmm....

    Now, bring on an IBM 3494 Tape Library and 3590K Tape Drives and media... we all set. Of course let's not forget TSM (or what used to be ADSM) and a machine to to backups only.(1 Cabinet and 2 Drives, full of media $120K)

    --
    greg, REMEMBER ED CURRY!!!
  186. Re:Great! Where's the backup solution? by Anonymous Coward · · Score: 0

    I do. I use a script to tar important directories, email, programs I working on, etc, and burn them to a CD. Periodically I leave a copy at my brothers. Cheap, easy and knowing that I will not loose access to the source to projects I've done over the years for myself and clients is "priceless".