Slashdot Mirror


The Ultimate All-In-One Storage Solution

karnifex writes "Filled up your LaCie Bigger Disk already, and looking for a little more storage space? Good news! The Petabox is ready! 'The petabox by the Internet Archive is a machine designed to safely store and process one petabyte of information (a petabyte is a million gigabytes).' And luckily, as the Internet Archive notes, it's shipping-container friendly (20' x 8' x 8'). So save on delivery costs and order two!"

91 of 387 comments (clear)

  1. Finally by foidulus · · Score: 4, Funny

    My million monkeys at a million terminals will have somewher to save all their potential Shakespeare works.
    But the question is, do my monkeys use VI or Emacs? That shall remain a mystery.

    1. Re:Finally by Anonymous Coward · · Score: 5, Funny

      Well, seeing as how monkeys type random gibberish,
      vi would seem like a perfect fit, yes?

    2. Re:Finally by foidulus · · Score: 5, Funny

      A British researcher actually did trap some monkeys inside a room with a computer for a couple days. They flung large amounts of feces at the monitor and keyboard and beat the living crap out of the box, and typed the letter s a lot. Sounds like the response of a typical user upon their first vi experience.

    3. Re:Finally by MaxiCat_42 · · Score: 4, Funny

      Sounds like the beginnings of Titus Andronicus.

    4. Re:Finally by TTK+Ciar · · Score: 2, Funny

      Argh .. well, I made sure vi and emacs were installed, and jove is installed (my personally preferred editor), and pico is installed, but there is no teco.

      Guess your monkeys will have to use something else :-)

      -- TTK

    5. Re:Finally by cfl · · Score: 5, Interesting

      I just use this:

      Monkey Shakespeare Simulator

      Maybe not as much fun, but without the faeces
      I've noticed that Mozilla Firefox seems to give better results than IE

  2. colossal... by ambienceman · · Score: 2, Funny

    Imagine all the Spice Girls' songs you could save on that thing...wow...

    1. Re:colossal... by foidulus · · Score: 4, Funny

      Don't just stop at songs! Store the DNA sequences for the spice girls on it, such that years from now our decendants may know the joys of Scary Spice.

    2. Re:colossal... by ambienceman · · Score: 2, Interesting

      Yeah and maybe we could clone a few of them (a couple scary and sporty spices) for our own conveniences.....hehe.for singing of course

    3. Re:colossal... by Cyno01 · · Score: 3, Funny

      They sing?

      --
      "Sic Semper Tyrannosaurus Rex."
    4. Re:colossal... by glwtta · · Score: 4, Informative
      Store the DNA sequences for the spice girls on it, such that years from now our decendants may know the joys of Scary Spice.

      The current genome build has a size of 3,020,300,000 bp, at 2 bits per bp and 5(?) spice girls, that's about 3.5 GB (uncompressed).

      Of course with a mostly static database like that you only want to store the diffs, not the whole thing. The bulk of the diff would be SNPs, roughly 1 per 1000 bp: 3,020,300,000 / 1000 / 4 / 1048576 that's about 0.72MB per spice girl. An if you only store the ones actually different from wildtype you probably don't need more than 20% of that.

      You can fit a Spice Girl on a floppy.

      --
      sic transit gloria mundi
    5. Re:colossal... by iminplaya · · Score: 4, Funny

      You can fit a Spice Girl on a floppy.

      Or, you can fit two floppies on a Spice Girl
      "Sock it to me."

      --
      What?
  3. I'd buy one.... by david_reese · · Score: 5, Funny
    but I heard that all you can store on there are
    ...drumroll

    Peta-files

    1. Re:I'd buy one.... by Smidge204 · · Score: 5, Funny

      I heard the FBI managed to squeeze 1000 of these into their annual budget...

      Exa-Files.
      =simdge=

  4. Can you get one... by hemp · · Score: 2, Funny

    if you still enjoy wearing furs??

    --
    Skip ------ See the latest from http://www.anArchyFortWorth.com
  5. Business idea by j_hirny · · Score: 2, Interesting

    Maybe someone should try to sell these boxes to GMail? They will surely need a lot of storage space.

    1. Re:Business idea by timeOday · · Score: 2, Insightful
      I suspect google is more interested in building a platform that would be a competitor to this product. For the device in this article they estimate 1 FTE (full-time-employee) for each petabyte of storage. That doesn't sound so good. Google's system will apparently replicate and migrate data betwen units as necessary so you never need to replace drives at all; the maximum capacity just degrades slowly with time. Perhaps when it gets to 80% original capacity you just roll in a newer unit (which is probably much bigger in capacity as well), hook them together for a day, then throw out the old one.

      The power requirements are also quite hefty. It shouldn't be necessary to run all those drives (and the computers behind them) unless the unit is near capacity and access is random (which I'm sure would rarely be the case). Instead, they should be dynamically powering drives and computers up and down, and migrating data to a reasonably small 'working set' of drives.

      On the hardware front, the device in this article also incorporates 800 "low-end PCs." IOW it's a big cluster that happens to be heavy on storage. If all you want is the storage, surely there is some way to get rid of all those motherboards and CPUs with their fault-prone, power-hungry fans. They need to develop a controller that can directly handle, say, 64 hard drives, analogous to a big network switch.

      Anyways, it sounds like a fun project!

    2. Re:Business idea by AlecC · · Score: 2, Informative

      They need to develop a controller that can directly handle, say, 64 hard drives, analogous to a big network switch.

      It's called a Fibre Channel controller. Fibre Channel loop (which disks use) offers a total of 255 addresses - which has to include the controller. Disks now available in the 300Gbyte region, so 80 Tbyte/loop seems reasonable (and, according to the article, they seem only to have 100Tbyte up so far). 12 of these loops will give you your petabyte. Mind you, you will waste the disk bandwidth; this will gicve you capacity but not throughput.

      --
      Consciousness is an illusion caused by an excess of self consciousness.
  6. One question... by Cyno01 · · Score: 2, Funny

    How many LOCs/VW Bug?

    --
    "Sic Semper Tyrannosaurus Rex."
  7. Petabox is ready! by KevinKnSC · · Score: 5, Informative
    Good news! The Petabox is ready!

    From the article:
    PILOT STATUS 5/2004
    * The first 100TB Rack is up and running!
    * The second 100TB Rack will be up by the end of May

    Apparently this is some new use of the word "ready" with which I am not familiar. Neat technology, no doubt, but it doesn't really look like it's ready for prime time just yet.

  8. In 10 years ... by Bob+Loblaw · · Score: 5, Insightful

    Will we find one of these things in eBay in 10 years selling for $10 and feel all nostalgic about those days when that amount of storage media was the size of a room?

    1. Re:In 10 years ... by DrEldarion · · Score: 4, Insightful

      It does make you nostalgic thinking about things like that. I have more storage on a tiny card in my digital camera right now than every computer in the computer lab at my Jr. High had combined. My cell phone is more powerful than my first desktop computer. I can download in 5 minutes what would have taken me a month to download back then. Ah, technology.

    2. Re:In 10 years ... by blancolioni · · Score: 2, Insightful

      Every hard drive I've ever bought has been larger than all my previous hard drives combined. And this is without even trying.

      The storage problems I have these days are almost entirely organisational.

  9. Don't get too excited by 42forty-two42 · · Score: 3, Informative

    They only have one rack, which is 100 TB.

    1. Re:Don't get too excited by danormsby · · Score: 4, Funny

      Almost a Petabit though.

      --
      Omnis amans amens
    2. Re:Don't get too excited by 0racle · · Score: 2, Funny

      Ya but what a rack it is.

      --
      "I use a Mac because I'm just better than you are."
  10. To give you an idea of how much that truly is: by Three+Headed+Man · · Score: 5, Interesting

    If you gave me a 100 mbit line, it would take me over 92 days to fill it up with porn. More if I slept.

    --
    I'm probably at the karma cap. Mod up a funny troll instead, it lightens the mood :)
    1. Re:To give you an idea of how much that truly is: by ecampbel · · Score: 4, Informative

      You're off by a factor of ten. Let google do the math: 1 petabyte / 100 megabits / second in days = 994.205393 days

      --

      Sig goes here
  11. Potential customers by thedillybar · · Score: 4, Funny
    "After extensive market analysis, we have found 1 (one) organization that is interested in purchasing this device."

    Can we say, Goooooooooooooooogle?

    1. Re:Potential customers by moosesocks · · Score: 4, Interesting

      I believe that back in the 50s, the president of IBM enthusastically proclaimed that there was potentially a worldwide market for four, possibly even five computers. And this was good news.

      So don't laugh!

      (I'm sure there are PLENTY of organizations which could use this type of storage. The IRS and NASA being among them)

      --
      -- If you try to fail and succeed, which have you done? - Uli's moose
    2. Re:Potential customers by lewp · · Score: 4, Funny

      NASA just uses an alien with a really good memory to store all their data. The IRS doesn't store any data at all. They just make shit up on the fly.

      --
      Game... blouses.
  12. Re:Price? by Anonymous Coward · · Score: 3, Insightful

    If you have to ask, you can't afford it. Just remember that. It might come in handy again someday. :)

  13. I am NEVER going to get caught up... by ackthpt · · Score: 2
    I swear, I can't keep up. I just got myself up to 1/2 terrabyte and could easily get to a full terrabyte, then this petabyte sh!t comes along.

    Ah well, all I'd do is fill it up with Simpsons episodes....

    --

    A feeling of having made the same mistake before: Deja Foobar
  14. Ummmm by akira69 · · Score: 2, Funny

    20 feet by 8 feet by 8 feet eh.... I thought they had that in the 1970s

    1. Re:Ummmm by Anonymous Coward · · Score: 3, Funny

      "20 feet by 8 feet by 8 feet eh"

      In 2010 it better be 1 by 4 by 9.

    2. Re:Ummmm by shigelojoe · · Score: 2, Funny

      Obscure? This is Slashdot we're talking about here. ;D

  15. Re:wrong by Vihai · · Score: 3, Funny


    Wrong :)

    One PetaByte is 1,000 TeraBytes which are 1,000,000 GigaBytes wich are 1,000,000,000,000,000 Bytes wich are 3.35 LotsOfPr0n.

  16. Sooooo.... by Boogaroo · · Score: 2, Interesting

    Think someone like the government is going to keep track of who buys these things?

  17. Ahhh... by 10101001011 · · Score: 2, Funny

    I just updated my old stone disk (it was erroding) with the latest gerbil in a wheel (I couldn't afford the guinea pig) with awesome seek times (he can seek food in less than 30 nanoseconds) and at least double-digit RPMs.

  18. Ohh Good, now I can buy Longhorn by l0ungeb0y · · Score: 4, Funny

    Though it says Linux is the standard OS, I'm hoping they plan on optimizing for Longhorn... so far this is the only system out that can meet Longhorns recommended disk capacity and RAM requirements. ...now if they could only find a way to fit all that into a mini-ATX tower.

  19. two words by Loconut1389 · · Score: 4, Interesting

    Good God.

    or alternatively

    What for?

    At least as far as the next year or two is concerned. RIAA has all but outlawed music on the computer and even so, a petabyte of $1.25 songs would cost you more than bill gates makes in a year. If you have a petabyte of home movies, you must be making porno films.. If you have a petabyte of DVD's ripped, you have several life sentences coming, even if you own all the dvd's somehow (more bill gates salary multiples). And if you have text files, then holy grapes batman, youll never read all that in 10 lifetimes.

    I can see uses in the comercial realm, buying multiple units in order to backup. But if this is in anyway marketed toward the consumer, only the biggest 'mine has to be bigger than yours' geek would buy something like that right now. I'll probably have one of those on my desk/floor about 5 to 7 years from now when its affordable/realisitic for me.

    1. Re:two words by pbox · · Score: 5, Insightful

      Assuming 2 layered disks that is 10 GB per disk (feeling generous).
      100 disk -> 1 TB
      15000 disks -> 150 TB.

      Netflix has a "mere" collection of 15000 disks. Your patebyte disk is only 1/6th full.

      You upload all music CDs: 1 GB per disk (feeling generous).

      How many CDs can be in print? Maybe a 500,000?

      That is only 500 TB. Now your disk is 2/3rd full.

      Lets upload all printed material. May or may not fit in the rest.

      Then again, if you want to archive the internet: ~6G pages. 10kB each. 60 TB. each run. Store the last 16 versions -> 1TB.

      --
      Code poet, espresso fiend, starter upper.
    2. Re:two words by maczmail · · Score: 2, Funny

      Good luck trying to create a taxonomy to organize a PB worth of objects much less a file system! /dev/FrigginHUGE

  20. Not really a Petabyte...yet by Berylium · · Score: 4, Insightful

    From the site:

    PILOT STATUS 5/2004
    * The first 100TB Rack is up and running!
    * The second 100TB Rack will be up by the end of May
    * Thermal Targets have been met
    * Systems Booted from USB Dongle
    * Reiser FS running
    * PC-based Router running


    Maybe I'm missing something but this looks to me like they don't really have a Petabyte of storage working but plans to incorporate a Petabyte of storage with only 100 TB up and running now. Not that 100 TB is anything to brush off.

  21. one petabyte? by circletimessquare · · Score: 4, Funny

    one petabyte ought to be good enough for anybody

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  22. To bad it won't last... by gremlins · · Score: 5, Insightful

    I know the pull is to get these things as big as you can get but i would love to see hard drives that will work for ever. Now I know everything breaks but I mean in 400 years how is anyone going to know what we were like if all the data on us slowly goes away because the hard drives or the cds don't really last very long

    --
    just because your a schizophrenic doesn't mean people arn't really out to get you
    1. Re:To bad it won't last... by Tailhook · · Score: 4, Funny

      ...but I mean in 400 years how is anyone going to know what we were like...

      What tiny fraction of our history is actually preserved in a useful manner will be misinterpreted and spun in ways you can't possibly imagine. 400 years from now you will be known as an ignorant fool guilty of untold crimes against lord knows whom. This will be true regardless of the quality of the archive used to figure you out.

      So don't worry about it.

      --
      Maw! Fire up the karma burner!
  23. Re:wrong by Anonymous Coward · · Score: 4, Funny

    After extensive market research, it has been determined that a "buttload" is roughly 3.5 gigabytes.

    In case you were wondering.

    It's my favorite unit of measure.

    Coincidentally, I have a buttload of porn.

  24. Re:wrong by Loconut1389 · · Score: 2, Informative

    Depends on whether you're talking Pb or PBi.. If i recall with the big HD size debate several months ago, the Gb/Mb are multiples of 1000 whereas GBi/MBi are multiples of 1024.. maybe i have the abbreviations wrong.. but there are separate units for 1024 multiples due to some whacky issue with SI units or something.. does anybody remember the link to that thread?

  25. Aha! by Dark+Lord+Seth · · Score: 3, Funny

    See? Those Longhorn specs are quite easy to achieve... Now let's sit back and wait for Intel or AMD to come up with a 1x1x1m slab of silicon that can melt graphite and run Longhorn at the same time!

  26. Re:"a million gigabytes"... by I+confirm+I'm+not+a · · Score: 2, Informative

    As always, wikipedia has the answer(s):

    Because of irregularities in definition and usage of the kilobyte, the exact number could be any one of the following:
    1. 1,125,899,906,842,624 bytes - 10245, or 250. This is 1024 times a terabyte. This is the definition used in computer science and computer programming
    2. 1,000,000,000,000,000 bytes - or 10 15.

    Damn! Ambiguity!

    --
    This is where the serious fun begins.
  27. Useless Statistics! by INMCM · · Score: 5, Funny

    1 MILLION GIGS! BAH! That isn't news unless they convert it to some entirely inappropiate metric. How many Library of Congresses is this? How many 128kbps MP3s can you store on it. And most importantly, how many floppy disks is this equivalent too?!

    --
    Caffeine Good
    1. Re:Useless Statistics! by isorox · · Score: 5, Informative

      How many Library of Congresses is this?

      50

      How many 128kbps MP3s can you store on it.

      250-300 million depending on song length

      And most importantly, how many floppy disks is this equivalent too?!

      700 Million - nearly 40,000 miles when laid end on end, or about 1500 miles when stacked on top of each other.

    2. Re:Useless Statistics! by IntelliTubbie · · Score: 4, Informative

      How many 128kbps MP3s can you store on it.

      Glad you asked. Assuming that we have a 10^15 byte disk (which is how those decimal-loving hard disk manufacturers would define it), and your MP3s are encoded at 128kbps (where 1 kb = 1024 b = 128 B, as the binary folks would have it), then you could listen to MP3s nonstop for:

      10^15B/(128kb/s * 128B/kb) = 61035156250 seconds ... which works out to just about 1934 years without hearing the same song twice.

      Cheers,
      IT

      --

      Power corrupts. PowerPoint corrupts absolutely.

    3. Re:Useless Statistics! by Combuchan · · Score: 3, Informative

      > >How many Library of Congresses is this?

      > 50


      According to this article, a Library of Congress is approximately 10 TB (who knew--this obtuse metric actually has a measurement!!!)

      http://articles.findarticles.com/p/articles/mi_m 0B MD/is_39_9/ai_98189690

      So the device actually can contain 100 Libraries of Congress.

      --
      "[T]he single essential element on which all discoveries will be dependent is human freedom." -- Barry Goldwater
    4. Re:Useless Statistics! by aboyko · · Score: 4, Interesting

      How many Library of Congresses is this?

      50

      Oh, it isn't, either. Will you people knock it off already with the Library of Congress == 20TB comparison? It's some sort of inane computation made as if the collection were only books, and all the books were represented as ASCII text only. Well, guess what? It's not, and they're not.

      American Memory alone is a good bunch of terabytes, and that's just a wee digitized slice, just several million objects, of all the stuff in the Library. There's a lot. Of Stuff. A lot a lot a lot. Pictures. Maps. Movies. Big ol' stuff.

      Well, I feel better. Thanks!

    5. Re:Useless Statistics! by carlislematthew · · Score: 3, Funny
      "40,000 miles when laid end on end"

      What the hell does that mean? How many times around the earth is that? How many times to the moon and back?

      START MAKING SENSE!

  28. Re:Price? by RollingThunder · · Score: 4, Informative

    In the "discussion" blocks down below there's a price link.

    Rack materials cost is currently estimated to be $121K for 96TB. Node materials are a just under $1450. This price does not include markup, assembly or burn-in from the system integrator and thus will increase by another 5-7% to approximately $130K/rack.

    The weight of a fully-loaded rack is estimated to be 1500 lbs. That figure may rise depending on what hardware is required for rack cooling.

    Power is estimated to be 5500 watts. This too will depend on rack level cooling equipment.

    These figures assume no external 1G Ethernet NICs.

    For a breakdown of all the above, see the attached spreadsheet.

  29. I can see the office manager now... by stienman · · Score: 4, Funny

    Office manager: "Hey, Adam, do you know why our power usage might have gone up this last month?"

    I surreptitiously conceal the firewire cable going out the side door.

    Adam: "No, John, I haven't the foggiest."

    OM: "Ok, well I'll ask Kim when I talk to her about the strange shipping container outside. Thanks."

    -Adam

  30. A petabyte here, a petabyte there... by rikomatic · · Score: 2, Funny

    ... and pretty soon you're talking REAL storage.

  31. Immortality? by Sir+Nimrod · · Score: 5, Funny

    In his novel 3001 Arthur C. Clarke asserted/speculated that one petabyte would be sufficient space to store a lifetime's memories. (He didn't say if this was compressed.)

    So, assuming you can handle the trivial exercise of transferring your memories (the implementation of which is left as an exercise for the reader), immortality is yours for the buying!

    1. Transfer memories to Petabox. Sign with your public key, so everyone knows it's you. Don't encrypt!
    2. Put Petabox in shipping container, along with retrieval instructions in English, Esperanto, and Chinese (to cover your bases).
    3. Bury shipping container in Yucca Mountain. (It's unlikely to ever see any nuclear waste, and it'd be a shame to waste the space.)
    4. Kill yourself.
    5. Wait for a society (a) advanced enough to restore you and (b) rich enough to bother.
    --
    The United States of America: We mean well.
    1. Re:Immortality? by theLOUDroom · · Score: 4, Funny

      Transfer memories to Petabox. Sign with your public key, so everyone knows it's you. Don't encrypt!

      Don't encrypt! Ye gods man!
      I don't know about you, but I have a few things I wouldn't like people to know about, even long after I'm dead.
      I mean, let's face it, not all our memories are that flattering.

      And anyways, I'm pretty sure some of the memories from my college years are already "encrypted".

      --
      Life is too short to proofread.
  32. The cost... by brokenspark · · Score: 2, Informative

    "Rack materials cost is currently estimated to be $121K for 96TB"

    http://www.archive.org/iathreads/post-view.php?id= 13509

  33. read the docs by curator_thew · · Score: 2, Interesting

    If you look down in the message list, you see a reference to pdf + ppt docs. Here's another related project Planet Ten Modular Data Centers.

    Yes, it's a petabyte once you fill the shipping container. Honestly, I thought of this idea last year (using stock shipping containers), and now I'm fascinated that they've made it happen.

    My only suggestion is that this is prototype: the eventual production systems (say, a couple of years time) should have custom shipping containers for:
    * any of the side panels can open to access a rack and hot swap failing racks, so there is no need for a middle entry aisle
    * the cooling system should be built into the structure, like existing refigerant containers
    * not just data storage, but also computing facilities

    1. Re:read the docs by TTK+Ciar · · Score: 2, Interesting

      > * the cooling system should be built into the structure, like
      > existing refigerant containers

      There's been discussion/research at the Archive regarding exactly this :-)

      > * not just data storage, but also computing facilities

      Each petabox node is a completely functional ITX PC, running Debian Linux, so it is already also a computing cluster. It's not a very good one, unfortunately, because in the interest of keeping the heat and power down a very low-power processor was used, an 800MHz Via C3, which is marginal as an integer-intensive processor.

      Also, the storage nodes (which comprise 796 of the 800 nodes) only have 100bT ethernet, which would limit the petabox's ability to efficiently make use of what processing power it does have (in the data distribution phase).

      Nonetheless, 796 800MHz processors is nothing to sneeze at. When the PetaBox is being a PetaBox, its processors are mostly idle. I'm sure folks will figure out something to do with those spare cycles (or perhaps not, considering that exercising the processors noticeably increases the power and cooling burden).

      -- TTK

  34. Re:Uber-huge! by DrLZRDMN · · Score: 2

    I think your confusing square footage with cupic footage, if the average house had ten foot high ceilings, it would only take up 16 feet of floor space

  35. In other news... by nacturation · · Score: 2, Funny

    Duke Nukem Forever is "ready"!

    --
    Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
  36. Re:Uber-huge! by vlag · · Score: 2

    I think you're confused. A normal 2 bedroom house (that doesn't have wheels) will have a floor space around 1600-1700 square feet. With 8 foot ceilings, the interior (3D) space would be about 12,000 cubic feet at least.

    --
    Do you want to remove linux?
  37. Hard drive lifespans by MikeBabcock · · Score: 2, Interesting

    I'd love to see an equivalent to all the benchmarking websites out there for telling me what hardware is reliable, and not just fast. I already know what the fastest drives, fastest video card, quietest fans, etc. are, but which ones last longest? Which drives *never* have failures that affect real data? Which cables are properly certified and insulated for high-volume transfer in a confined space rubbing up against other cables? Etc.

    If you know of such a site, tell me.

    --
    - Michael T. Babcock (Yes, I blog)
  38. Interesting link... by Noryungi · · Score: 4, Interesting

    It's right there under the pictures :

    http://capricorn-tech.com/

    The site is rather empty right now, but it seems this is the company that will market this petabyte machine... er... box... er... whatever the name is.

    --
    The right to offend is far more important than the right not to be offended. (Rowan Atkinson)
  39. themselves, not google... by SuperBanana · · Score: 3, Funny
    Can we say, Goooooooooooooooogle?

    I'm guessing they were referring to themselves, not google.

  40. Comment removed by account_deleted · · Score: 2, Interesting

    Comment removed based on user account deletion

  41. Re:Finally - monkeys by Splork · · Score: 2, Interesting

    monkeys eventually write both vi and emacs while working on shakespeare

  42. There's an easier way to do this... by exp(pi*sqrt(163)) · · Score: 2, Insightful

    ...just mount /dev/random as a petabyte drive. Admittedly it might be hard to find your data in there - but chances are it is in there somewhere.

    --
    Doesn't it make you feel good to know that our freedoms are protected by politicans, lawyers and journalists.
  43. Re:wrong by Elwood+P+Dowd · · Score: 4, Interesting

    Everyone I know says giga with a hard G. The only exception I know of is Christopher Lloyd's character in Back To The Future.

    --

    There are no trails. There are no trees out here.
  44. Makes life easier by esvoboda · · Score: 2, Funny

    Good, now I don't need to delete my spam.

  45. actually... by MarcoAtWork · · Score: 4, Funny

    given that monkeys have basically four hands I think they'd be more suited for emacs (alt-meta-control-super-hyper-shift-q) than vi ;-)

    I have been using emacs for nearly 10 years now and I swear sometimes I have been seriously considering adding a foot pedal or 2 to my setup (besides control, shift and meta I also routinely use Super and Hyper, xmodmap is great!)

    --
    -- the cake is a lie
  46. Scientific Data by Scott+Ransom · · Score: 4, Interesting

    I'm heavily involved in a 5-6 year project to use the Arecibo telescope to search for new pulsars. The project uses a new 7-beam receiver system, each of which takes data from up to 1024 nearby frequency channels. The data is 16-bit sampled over 15000 times per second from each frequency channel. We need the time and frequency resolution to find exotic millisecond pulsars.

    Over the couse of the survey we expect to take about 1 PB of data. We're still trying to figure out exactly how we will process and store it all.

    For more info, you can poke around here.

  47. Re:Price? by cliveholloway · · Score: 2, Informative

    From the forum:

    Rack materials cost is currently estimated to be $121K for 96TB. Node materials are a just under $1450. This price does not include markup, assembly or burn-in from the system integrator and thus will increase by another 5-7% to approximately $130K/rack.

    So, about $1.3M (10 racks)

    cLive ;-)

    --
    -- Trinity in high heels carrying a whip: The donimatrix - there is no spoonerism
  48. Re:Price? by theLOUDroom · · Score: 2, Insightful

    So, about $1.3M (10 racks)

    What would be interesting is to know the estimated maintenance costs as well. With than many drives, I imagine you'd be changing them like light bulbs, especially as time passes and the probability of each drive failing get's higher and higher.

    If one was really clever, they could use the failure rate of a typical hard disk and Moore's Law to estimate monthly replacement costs for the next 100 years or so. I would expect them to rise in the short term as the drives age, but fall in the long term as moore's law catches up.

    --
    Life is too short to proofread.
  49. WTF? by imag0 · · Score: 4, Funny

    If you gave me a 100 mbit line, it would take me over 92 days to fill it up with porn. More if I slept.

    yeah, but if you looked closer, it's the same 6 gigs over and over again.

  50. Petabox? BAH - GMAIL! by Compulawyer · · Score: 4, Funny

    I'm just gonna get 1,000,000 free Gmail addresses and email all my data to myself 1 Gb at a time.

    --

    Laws affecting technology will always be bad until enough techies become lawyers.

  51. If you're a big fan of this much storage? by stox · · Score: 4, Funny

    Does that make you a petaphile?

    [massive karma burn detected]

    --
    "To those who are overly cautious, everything is impossible. "
  52. Ozymandias by Boglin · · Score: 3, Insightful
    Okay, I've heard this too many times and I'm just starting to get sick of it. It's not the computers that are killing your quest for digital immortality; it's just the way that history works.

    You're complaining that these hard drives won't run forever and you're right. Neither will CD's. However, I would also like to point out that the vast majority of ancient egyptian papyrus isn't around today. Also, don't start goign off on using clay or stone tablets, because they break (even the Rosetta stone is broken).

    Honestly, computers are still far superior to what we were using before. It's not like we've got Homer's original version of the Illiad sitting in a museum somewhere; we just have many duplicated copies that have been reproduced over the years. You're right that hard drives fail and CDs break, but we can keep updating onto new media. Besides, when a monk drops an iota when transcribing the Bible, Jesus goes from being God to godlike. When a computer adds an iota, the checkbit fails and the data is resent.

    Somebody is also going to point out that, as systems change, data can become unreadable. Heck, I had a professor who couldn't update his lab instructions because the software that read the lab printouts wouldn't run on new machines and the fileformat wasn't understood by any other software. So, want to stop our data from becoming unreadable? Well, let's just do what the Etruscans did! Of course, we don't have a clue what they did because nobody can read Etruscan. For a more familiar example, think of heiroglyphics before the Rosetta stone. It's pretty common for data to become lost and unreadable. Also, this bring us back to the solution. Along with the data, include the source code for the software that can read it. If you really want to be anal, you could even include the source to an emulator for the machien it was designed to run on.

    Still, you might point out, 400 years from now, we'll still lose 99% of that do to failures of whatever nature. Once again, you would be be right. However, do you honestly believe that we have 1% of all the data that was collected in 1604? Hell, most of the people couldn't even right, so we don't know ANYTHING about their lives. I'm sorry that we can't digitally preserve our wonderous society for all of eternity, but it's completely blind to believe that this makes us in ANY way different to any other culture. Read Percy Shelley's Ozymandias before complaining about how people in the future won't know what our lives were like.

  53. Replacing HDs could be a pain by gumbi+west · · Score: 3, Insightful

    If you expect a hard drive to fail after three years (I'm guessing) but these occurances are randomly distributed (an assumption that will be true after running this thing for a year or two) you can then expect that the 4000 hard drives in this array would have about 3 failures per day. This thing would never be at full speed! it would be constantly restructuring its RAID. Also, it would cost about $300 just in hard drives (not to mention controllers, power supplies, et cetera).

    1. Re:Replacing HDs could be a pain by vidarh · · Score: 2, Insightful
      Yes, but you aren't seriously suggesting it would be one RAID over all the disks are you?

      So assuming 3 failures a day, at most 3 RAID's would be running slower a day. Assuming 4 disks per RAID that's at most 12 disks at reduced performance, or 0.3% of the total data set that isn't available at full speed. If that is an issue, you duplicate any data that MUST be available on multiple nodes.

  54. That's Nice But... by dbretton · · Score: 3, Funny


    Is there enough disk space left to do anything useful after installing WinXP on it?

  55. The Ultimate Storage Solution? I think not... by ning · · Score: 2, Interesting

    10^15 bytes? Each human on Earth has 3 billion (3x10^9) base pairs of DNA. Assuming 2 bits to encode a base pair of DNA, that means a PetaBox(tm) can only store the DNA of 1.3 million people. So you'd need getting on for 5000 of these (assuming no compression) to store the entire population.

  56. Just nerds seem to reply... by Chreo · · Score: 2, Funny

    ... but honestly, that's a lotta pr0n!

    --

    Life is what happened when Good Intentions met Harsh Reality (the brother of the more infamous Chaos).
  57. Re:wrong by grahamlee · · Score: 2, Informative
    bytes are not SI units, those would be octets

    Actually the SI defines the prefixes irrelevant of units used. Think of the mil ('milli-inch'); how many do you think there are in the inch? If I had a thousand cats I could refer to the set as one kilocat, and hence if I had 1024 cats I could refer to it as a kibicat, Tweety-pie style; note that a cat is not an SI metrological term. Try playing around with the units(1) command sometime; to get a feel for these SI prefixes.

  58. Re:You think they could spare a boot disk. by vidarh · · Score: 2, Insightful
    Laugh all you want but what they are doing makes sense: Recovering a node in case of a crash or messed up filesystem is easy - you replace the dongle and hit the reset button. No need to have space wasting CD drives or floppy drives, and the rest of the OS can be pulled down over the network.

    The last thing you want with a setup like this is having to haul hardware around or disconnect stuff if you for any reason can't boot of the disks anymore. And you certainly don't want to reduce density by wasting space that could be filled with disks with other stuff.