Slashdot Mirror


Large Scale Solid State Memory Storage?

spacechicken asks: "I am doing a theoretical study of an extremely secure large scale data storage concept. Due to the nature of the (theoretical) location of the (theoretical) warehouse some of our constraints include very few (if any) service visits, complete remote administration and no moving parts. Does anyone have any experience using or information on large scale (on the order of 10^12 - 10^15 bytes) deployment of solid state storage? And to preempt those who will say it - I have Googled."

38 of 76 comments (clear)

  1. Here you are. by Henry+V+.009 · · Score: 2

    I believe that this might be what you are looking for.

    The technology is quite mature.

    1. Re:Here you are. by Henry+V+.009 · · Score: 2

      Preview, goddamn it. Here is the link.

  2. HUH?!?! by Guspaz · · Score: 2

    You want almost a THOUSAND TERRABYTES (10^15 bytes is 909.5 terrabytes)... of SOLID STATE storage?!? Are you insane?

    That's about a quarter of a billion dollars in PC133 ram. I'm sure wholesale prices would bring that down, but you're still talking many millions of dollars.

    1. Re:HUH?!?! by Guspaz · · Score: 2

      A terrabyte is 1024 gigabytes.

      The guy who submitted this story is talking about over 900000 gigabytes of storage, using only memory and no hard-drives. Using modern technology, such a thing would cost shitloads of money. I came up with the figures of over 3 million dollars to do it with hard-drives with 2x redundancy, and half a billion to do it with PC133 memory with 2x redundancy.

    2. Re:HUH?!?! by spacechicken · · Score: 2, Informative
      I am the guy who submitted this.

      Never mind the cost. Right now we are looking at the technical problems.

      As I said - there will be no visits from service staff to replace/repair failed modules. Also, the site will need to be set up in one go - hence the need for so much storage all at once.

      Hard drives can not be used - they would not survive the trip or the location.

      And I am most definitely NOT Uncle Sam.

    3. Re:HUH?!?! by .milfox · · Score: 2

      See comment later for some solutions in the Terabyte range. :P

      Sounds like a space/deep ocean shot of some sort. Curious, curious... Let us know what it is?

    4. Re:HUH?!?! by Johnny+Mnemonic · · Score: 2


      Sounds like a space gig, based on the poster's previous post in the Australians to Build Spaceport on Christmas Island thread. Also, in this thread, the poster specifically uses the example of Australian Aerospace students.

      While circumstantial, I would guess that this is a privately funded space shot. Why he needs this much storage is still an open question, but by non-serviceable I believe we can take him literally :)

      --

      --
      $tar -xvf .sig.tar
    5. Re:HUH?!?! by .milfox · · Score: 2

      Well, if it's space then radiation becomes an issue. That alters any possbilities of a cheap solution.

    6. Re:HUH?!?! by Twirlip+of+the+Mists · · Score: 2

      Have you done any estimates of total worldwide memory production? In a discussion of 64-bit filesystems, I once saw some figures indicating that in order to build an 18,000,000 terabyte filesystem, one would have to buy every hard drive produced in the world between 1993 and 2000. I don't know how accurate those numbers are, but it raises the question: how much RAM is manufactured in the world every year? How many years of production would you have to commandeer to do what you're talking about? Have you thought about those questions at all?

      --

      I write in my journal
    7. Re:HUH?!?! by Guspaz · · Score: 2

      I'd say they could easily survive the trip, if not the location. Consumer hard-drives (I'm using Maxtor DiamondMax Plus 9 as an example) are rated for 60G when operating for up to 2ms, and 300G when not operating for up to 2ms. Obviously if you have prolonged high-G environment, these are significantly lower, but should nevertheless be sufficient. And these are consumer models, there are companies that maked hardened HDs designed to take much more punishment. Hard-drives are regularly launched into space. There are several laptops abord space shuttles.

  3. Should be easy enough... by node+3 · · Score: 3, Funny

    Just go the the place where you got your 10^6 processor smp server. I'm sure they deal with 10^15 bytes of solid state storage all the time and will be happy to get you what you are looking for.

    Hope that helps.

  4. Re:Heh by Guspaz · · Score: 2

    Heh, yeah, not often, but I still think you could make significant savings by using non-solid-state. You could get good reliability using RAID 0+1.

    The biggest consumer drive I know of is Maxtor's 320GB drive. It would take a minimum of roughly 3000 drives to cover ~900TB with no redundancy. The MTTF of these drives is over a million hours according to Maxtor, though that's over a hundred years so I question this figure. I wasn't able to find a price on the 320GB drives, but assuming double the cost of the 160GB drives, we're talking 3.2 million dollars for 2x redundancy, much lower than the 250 million price for single-redundancy in memory.

    As you can see, the cost is still enormous even with non solid-state. Maybe the article submitter needs to delete a few (theoretical) things so he doesn't need so much (theoretical) space ;-)

  5. How about RAMDRIVE + SCSI + SAN? by .milfox · · Score: 2

    How about a combination of commercial IDE/SCSI RAID technologies chaining many individual SDRAM based solid state units along with some sort of SAN?

    For example

    http://www.storagesearch.com/ssd-3.html can hold up to 8gb per unit, and at 15 per SCSI card, 6 cards per system, you'll end up with 720GB per 'server'.
    Figure gigabit fiber (at the minimum) between machines and you'll end up with quite a bit of storage.

    1. Re:How about RAMDRIVE + SCSI + SAN? by .milfox · · Score: 2

      Oops, too late to edit. :P Alternately, you can go IDE. These are 12GB units, and you can hook up 12 per IDE RAID controller, 6 controllers per system, for a total of 864GB per system.

  6. Question by Thrasymachus+Online · · Score: 4, Interesting

    The interesting part about your question is the complete lack of background on why you would possibly need all this storage.

    So I'll make a guess. 10^12--10^15 bytes is a large range. And I can only think of a few ways to generate that much data. The most probable is video cameras, but I can't think of any reason why you would need it secure in that fashion.

    Secure without human intervention is interesting. I mean, if all you want is security, the easy way is distributed networks and encryption. And really, that would be more secure in the event of nuclear war or other similar events.

    So I have two guesses:

    A) Given the similarity of your numbers to the 10^11 neurons in the human brain (and each neuron has as many as 1000 connections to its neighbors) this is some sort of screwy immortality thought experiment.
    B) Given the security requirements, this is some screwy thought experiment involving the preservation of the sum of human knowledge over a vast stretch of time without human presence. That could be interstellar travel, say, or large disasters wiping out the human race.

    1. Re:Question by spacechicken · · Score: 2, Informative
      I've been waiting for someone to ask what it is about.

      Rest assured it is neitherone of the two screwy thought experiments you have postulated. It is another one entirely. Albeit one with a commercial bent.

      As I said earlier the reason for so much storage is that the site needs to be built in one go. There will probably be no ability to increase the size of the facility as requirements grow. We are not pulling numbers out of the air (or counting neurons). We are simply planning ahead

    2. Re:Question by pizza_milkshake · · Score: 2

      i was thinking genetic data storage

    3. Re:Question by Zerth · · Score: 2

      Something out of this atmosphere?(in either direction)

    4. Re:Question by Monkelectric · · Score: 2

      I think you're a meta-troll. And if so you've done a damn fine job of it

      --

      Religion is a gateway psychosis. -- Dave Foley

  7. Back of the envelope calculations by ka9dgx · · Score: 2
    You want 1 Terabyte-1Petabyte of solid state storage, so here are some "back of the envelope" type calculations.

    Assuming slow, but commodity CompactFlash cards of 1Gigabyte each (currently $800 retail, your price may vary). You'd need 1000 to 10^6 of these puppies for approximately 800,000 to 800,000,000 $US (Retail). It would be fairly compact, fairly reliable, and fairly slow.

    So, with a price of $800,000 today for the low end, in 3 years (more or less) the price for 1 Terabyte of CompactFlash will be $100,000. This drops further to the point where I can afford it, in about 2 more orders of magnitude (7 years?).

    Bottom line, it's feasable, would be $1Million to $1Billion to implement 1 of (at retail, buy the fab, prices WILL drop dramatically). I'll be able to afford the same thing 10 years later.

    --Mike--

  8. i got it! by pizza_milkshake · · Score: 2
    i got it! i've figured out all possible uses for this much storage:

    • bill gate's savings account
    • national debt ticker
    • google hit counter
  9. Re:Leg work on /. leads to fires of speculation by spacechicken · · Score: 3, Informative
    I'm being hunted :)


    There is a serious topic behind this question. However, it usually requires a lot of explanation.


    Some of your guesses are correct - this does have space applications, but not in the way you think.


    Suffice to say that the storage is not planned (theoretically) to be used as anything more than a commercial repository.


    The reason for the vast amounts (as I keep saying) is because there will probably be only one visit to the site to install it all - meaning no upgrades.

  10. Ok, what about other problems? by Klaruz · · Score: 5, Informative

    Using solid state drives you could probobly do it on the order of about a half billion dollars. Possible indeed. But there are other things you haven't thought of. You didn't say if you need to have this thing on 24/7, but I'm going to assume you do.

    1: Power. Solid state drives tend to forget stuff when shut off, so you'll need a UPS in the data center to handle it. No biggie, except when you realize that the batteries are going to need maintence. They do go bad after a while, I know of no batteries that don't. In theory you could do flash memory instead of volitile for about 3 times the cost (512meg ATA flash storage is $300 on pricewatch, add raid, san, etc, pricey but possible)

    2: Cooling. Massive amounts of solid state chips are going to generate massive amounts of heat. This means water chillers (most likely) and fans. Both involve moters. Moters go bad. You can build redundent, but eventually both cooling systems will go out.

    3: The hardware it's self. CPUs go bad, controllers blow up, ram chips go out, power supplies blow, etc. You can only leave a redundant system alone for so long until it's no longer redundant.

    4: Acts of god. Floods, fire, lack of fuel for power, emi. These things happen. You can build two data centers in seperate locations and write one off when something bad happens, but then you're back to no redundancy.

    5: Murphy's law. Don't forget, Murphy always wins.

    Having worked on a LARGE scale redundent system (Think uncle sam), I can tell you these things do require maintence. Building a system that large without bugs that creep up in a few years time is going to be next to impossible.

    That said, it sounds neat. Let me know when you guys need an engineer to build it, it'd be fun.

  11. Not much to say by photon317 · · Score: 2


    Remotely administering solid-state storage is no different than remotely adminstering spinning storage, just less chance of failure, so less visits. The data size you require can easily fit in a single rackmounted disk array. Vendors like E-Disk sell highly reliable solid state drives in IDE and SCSI, up to at least the high 10's of gigs in size per disk, 2.5 and 3.5 in form factor.

    --
    11*43+456^2
    1. Re:Not much to say by Twirlip+of+the+Mists · · Score: 2

      The data size you require can easily fit in a single rackmounted disk array.

      I think your calculation may be broken. Spacechicken is asking about storage in the range of one terabyte (lower bound, very easy to achieve) to one thousand terabytes (definitely possible, but bigger than damn near everything). You can squeeze about 1.8 TB into a rackmounted disk array using 160 GB drives; the new 320 GB drives will double this to about 3.6 TB, at least in theory. You're still 996.2 TB short of Spacechicken's high number.

      --

      I write in my journal
    2. Re:Not much to say by photon317 · · Score: 2


      Oops, I was off by 10^3 when I read his post :)

      Still, that just means you need several racks, and the problem against isn't solid-state specific. Check out EMC and Hitachi for better storage density in a normal array, but they may not offer a stock solid-state option (although I bet you could retrofit it with drives yourself). In any case, managing a large amount of storage, and managing solid-state storage, are two different things, both of which are pretty standard question with standard answers.

      I will say one thing - if you're thinking of SANing all of this storage to a cluster of servers - don't buy the existing market leaders' products. Cisco is on the verge of coming out with their second-gen SAN switches that are in catalyst chassis and support virtualization (read: veritas software-raid) inside the switch, making for a very nice solution.

      --
      11*43+456^2
  12. Space by TheOnlyCoolTim · · Score: 2

    If you're planning to shoot this into space, I'd think again. You're going to have trouble finding solid state storage that both holds a petabyte AND is radiation-hardened.

    The only other practical place I can think of right now where hard drives wouldn't survive the trip is some sort of undersea base. Sealand already has an offshore data haven, and they can visit theirs.

    Tim

    --
    Omnia vestra castrorum habetur nobis.
  13. Re:Leg work on /. leads to fires of speculation by Zerth · · Score: 2

    So it is sort of a data haven in space kinda thing?

    Heh, screw HavenCo, let's see the feds get a search warrant for an orbital datadump.

    On second thought, one would have less social protections against getting the crap nuked out of it...

  14. Re:Leg work on /. leads to fires of speculation by .milfox · · Score: 2

    *LAUGH* Don't forget the latency to Geosynch, unless you're talking LEO, in which case you'd be in range of ASAT systems.

  15. Look At Me! by Llama+Keeper · · Score: 4, Funny

    SARCASM

    Look at me Mom, I got my question posted on Slashdot by makin up some bogus and totally illogical question. Then I made tons of vauge references and quasi-logical posts to make people wonder. Ooh ooh ooh look at me!

    /SARCASM

    --


    Rule of Life Number 2: Remember, it can all go to hell at any minute. --Jimmy Buffet
  16. Re:Leg work on /. leads to fires of speculation by TheOnlyCoolTim · · Score: 2

    That means:

    A)You are sending it up in a big lead box.

    B)You are burying it on the moon or another spacebone body.

    C)If ISS has proper shielding, it's going into ISS.

    A would probably be insanely expensive. Costs a lot to shoot lead into space. That's why NASA buys radiation-hardened equipment instead of buying some P4s and shooting them up in a big lead box.

    B would be huge news. It would be extremely likely to fail if you send a robotic digger to the moon or an asteroid, so you'd have to do it with people, which would be a historic event.

    C makes the most sense, but I don't know if ISS has enough shielding. Presumably there's enough to keep the people from overdosing on bad particles, but I'd imagine the computers would still have problems.

    Tim

    --
    Omnia vestra castrorum habetur nobis.
  17. Re:Heh by Twirlip+of+the+Mists · · Score: 2

    Remember that MTBF stands for mean time between failures. That's mean as in average. Half of the drives will fail in less than the mean time. The more drives you have, the greater the change that one of them will fail at any given time. If the MTBF is one million hours, and you have 1,000 drives, then unscientific but useful arithmetic says that the MTBF for your entire system drops to 1,000 hours, which is just over six weeks. As I said, the math isn't entirely sound, but it is very useful for estimating the overall reliability of a system.

    --

    I write in my journal
  18. Re:Geez. by Twirlip+of+the+Mists · · Score: 2

    Wrong. One terabyte (what's your fascination with InterCaps?) is 2^40 bytes. One petabyte is 2^50 bytes. The difference between the decimal expansions and the binary expansions become very significant at higher powers. For example, your statement that one terabyte is 10^12 bytes is wrong by 11%, or more than 99 billion bytes. That's quite a rounding error you've got there.

    --

    I write in my journal
  19. Re:Geez. by dfn5 · · Score: 2
    WHATEVER FOR?!

    Ummmm hello?!? alt.binaries

    --
    -- Thou hast strayed far from the path of the Avatar.
  20. Talk to IBM by afidel · · Score: 2

    I assume this is for a space mission from previous comments, so talk to the people with the most space experience. For this type of application the only system I would use would be a raid 1 utilizing IBM's shipkill technology. Basically you would have RAID 5 at the dimm level and raid 1 at the system level. Would it be cheap, hell no. Would it be the most likely to survive radiation hazards, yep. IBM has nearly a half century of studying the effects of radiation on computer components, so if you want to put a complex computer system into space talk to them. For some information to digest before talking to them see the chipkill whitepaper here.

    --
    There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
  21. An actual answer by pla · · Score: 2

    Did anyone think in terms of *real* solutions, rather than just pricing out parts and assuming this guy could magically make them work together?

    Anyway, at least one solution to this request appeared on Slashdot just a few days ago: The solid-state RocketDrive.

    Perhaps not the ideal solution (I honestly can't say if OEM solid-state storage exists on a much bigger scale), but something that you can concretely say "this would work". Proof-of-concept, if nothing else.

    Granted, for the size you want, at $5k/4G, This would cost USD $1.3 billion just for the storage itself (not counting the array of 32k+ PCI slots you'd need to hold all these and the hellacious network to RAID them), but this sounds like a gub'mint project anyway, so cost presumeably forms the *last* of your concerns. If cost *does* matter, you can get the unpopulated controller boards for $800 each, and certainly a *much* better bulk deal on RAM then what Cenatek offers (basically they charge $1k/1G? Perhaps 10 years ago!).

    Checking Pricewatch, the average non-volume-buyer can get 1G of PC133 for around $100). That would lower the storage-only cost to only USD $315 million, before considering volume discounts.

  22. this whole thing is a massive troll by tomhudson · · Score: 2
    This whole thing is a massive troll. Anyone who was really considering such a thing would be able to do the back-of-the-envelope calcs that would show that:
    1. Power and heat: either you don't have enough juice, or you melt down - ram also produces heat;
    2. Errors: the more parts, the more errors, so a project this size would be the worlds' biggest random error/number/data generator;
    3. MTBF: you have parts, you have failures. For example, Maxtor drives may have a million hour MTBF, but the company recently reduced their warranty period from 3 years to 1.
    If this was real, and your boss was even half-way competent, you'd be out a job immediately.

    The only storage that would begin to meet SOME of the requirements would be molecular/biological. Try again in a decade.

  23. Re:Geez. by joib · · Score: 2

    No, the hard drive makers got it right. It was everyone else in the semiconductor industry who got it wrong by using SI prefixes (ie. the mega, giga etc. things) to mean various powers of two, when they really mean powers of ten. Nowadays there is a NIST standard which specifies prefixes for powers of two (the mebi, gibi, etc. things), but few people use those prefixes.