Slashdot Mirror


Digital Big Bang — 161 Exabytes In 2006

An anonymous reader tips us to an AP story on a recent study of how much data we are producing. IDC estimates that in 2006 we created, captured, and replicated 161 exabytes of digital information. The last time anyone tried to estimate global information volume, in 2003, researchers at UC Berkeley came up with 5 exabytes. (The current study tries to account for duplicating data — on the same assumptions as the 2003 study it would have come out at 40 exabytes.) By 2010, according to IDC, we will be producing far more data than we will have room to store, closing in on a zettabyte.

176 comments

  1. XXX by daddyrief · · Score: 5, Funny

    And half of that is porn...

    --
    "Banking establishments are more dangerous than standing armies." -Thomas Jefferson
    1. Re:XXX by the-amazing-blob · · Score: 4, Funny

      Way to clog the tubes up, guys. Seriously. :P

    2. Re:XXX by harp2812 · · Score: 1

      Only half? Hmm, the porn industry must be slowing down.

      --
      I've found that nurturing one's Zen nature is vital to dealing with technology. Violence is pretty damn useful too.
    3. Re:XXX by LighterShadeOfBlack · · Score: 1

      And half of that is porn... Only half? Honestly?
      --
      Spelling mistakes, grammatical errors, and stupid comments are intentional.
    4. Re:XXX by daddyrief · · Score: 2, Funny

      That's just what happens after a 'digital big bang.'

      --
      "Banking establishments are more dangerous than standing armies." -Thomas Jefferson
    5. Re:XXX by Quantam · · Score: 0, Redundant

      And half of that is porn...

      I'd imagine more in the 95%-99% range...

      --
      You have tried to support your argument with faulty reasoning! Go directly to jail; do not pass Go, do not collect $200!
    6. Re:XXX by iamstretchypanda · · Score: 1

      I don't know about you, but my tubes aren't clogged at all. Seriously. :p

    7. Re:XXX by Coucho · · Score: 0

      In other news, colleges around the country are requesting their male students to clog the tubes-- at home.

      --
      *pSig = NULL;
    8. Re:XXX by maxume · · Score: 4, Funny

      They upgraded to mpeg4.

      --
      Nerd rage is the funniest rage.
    9. Re:XXX by Gunslinger47 · · Score: 1

      And half of that is porn...

      ...and in another five years, the other half will be Wikipedia.

    10. Re:XXX by sukotto · · Score: 1

      and most of the rest of it is crappy YouTube flicks

      --
      Come play free flash games on Kongregate!
    11. Re:XXX by Anonymous Coward · · Score: 0

      isn't the point of porn to unclog the tubes?

    12. Re:XXX by Benaiah · · Score: 1

      So after the digital big bang,
      Do us nerds get paid for a money shot?

    13. Re:XXX by bursch-X · · Score: 1

      Nah, the other half is spam, I'd guess.

      --
      There are two rules for success:
      1. Never tell everything you know.
    14. Re:XXX by Anonymous Coward · · Score: 0

      Wikipedia is actually pretty small. Every past revision of every page (since all revisions are retained) as well as all images (and all past revisions of those too), of every language version of the project adds up to only a couple of terabytes. There are individuals with more data than Wikipedia stored on their hard drives (think downloaded movies).

      If you just take the text of the current revisions of all the articles in the English version, and compress it, you have around 2GB. Fits easily on a USB memory stick.

    15. Re:XXX by Anonymous Coward · · Score: 0

      I like computer tech and all, but could I get an estimate as to how much information the human brain processed in the last year including not just simple external information processing but all that involved in every function of the human body.

    16. Re:XXX by mikael · · Score: 1

      And the other half is spam...

      --
      Vintage computer adverts: http://www.vintageadbrowser.com/computers-and-software-ads
    17. Re:XXX by CoolCat · · Score: 1

      don´t you think half is a little low estimate?

  2. It was only 9 megs by noewun · · Score: 5, Funny

    Without Slashdot dupes.

    --
    I am a believer of momentum and curves.
    1. Re:It was only 9 megs by cmacb · · Score: 4, Insightful

      HA!

      But seriously, I wonder what percentage of this data is text. I'd guess it is a very very small amount. When I had a film camera, in twenty years I bet I took less than 100 rolls of film. With digital cameras I've take thousands of pictures, sometimes taking a dozen or more of the same subject, just because the cost to me is practically zero. Now there are vendors that will let me upload large numbers of these amateurish photos for free, and let's pretend that there are enough people interested in seeing my pictures that these companies can pay for this storage with advertising. That's scary.

      Excluding attachments I think it would be practically impossible for anyone to use up Googles 2 gig of storage, but I've heard of people using it up in little more than a week by mailling large attachments back and forth (oh yeah, I HAVE to have every single iteration of that Word document, sure I do!)

      But what's scarier is that for some nominal fee (like $20 a year) they place no limit at all on my ability to hog a disk drive somewhere. I know people who are messed up in the head enough to want to test these claims. Give them 5 gig for photos and they've filled it up in a week, give them "unlimited" and they upload pure junk to see if they can break the thing.

      Like any house of cards, this thing is gonna come down sooner or later. I just hope that people who are making sensible use of these online services don't lose everything along with the abusers.

    2. Re:It was only 9 megs by neax · · Score: 1

      the earth is going to become one giant hard disk. perhaps we should outsource to the moon...or mars.

      --
      Hard work is just an accumulation of the easy things that you didn't do when you should have.
    3. Re:It was only 9 megs by ET_Fleshy · · Score: 1

      There is an extension for Firefox that lets you use your gmail account as an online storage database. This makes it quite realistic to use up all of your space. Enjoy ;)

    4. Re:It was only 9 megs by Fweeky · · Score: 1
      "Excluding attachments I think it would be practically impossible for anyone to use up Googles 2 gig of storage"

      -% du -sh Mail
        27G Mail
      Only about 5 years worth, and I don't deal much with attachments. Or do you not class "I don't bother deleting email" as practical? Works well enough for me :)
  3. Finally, an excuse... by bigforearms · · Score: 5, Funny

    The furry porn gets deleted first.

    1. Re:Finally, an excuse... by Anonymous Coward · · Score: 1, Funny

      The furry porn gets deleted first.

      I'd mod you down, but there's no -1, Fursecution option.

      Damn you Slashdot! When will it ever stop!?!?!!~

    2. Re:Finally, an excuse... by PitaBred · · Score: 1

      You really don't need an excuse to delete that stuff. I mean it.

  4. How many... by Looce · · Score: 3, Funny

    ... times does the Library of Congress fit in that? Exabytes simply don't speak to me.

    Alternatively, you can also answer in anime episodes, or mp3 files.

    1. Re:How many... by LighterShadeOfBlack · · Score: 5, Funny

      That'd be 1,191,400 Libraries of Congress.

      Honestly, I don't know why the /. editors allow these "scientific articles" that only provide data in these obscure and archaic "byte" measurements. Absurd!

      --
      Spelling mistakes, grammatical errors, and stupid comments are intentional.
    2. Re:How many... by bluemonq · · Score: 1

      Assuming the "standard" fansub size of 233MB for an ED release, it would be about 760 billion episodes of anime. To put that in perspective, Naruto is currently 224 eps long (85 too many).

    3. Re:How many... by franksands · · Score: 5, Informative
      Since you asked:

      Oh, the equivalents! That's like 12 stacks of books that each reach from the Earth to the sun. Or you might think of it as 3 million times the information in all the books ever written, according to IDC. You'd need more than 2 billion of the most capacious iPods on the market to get 161 exabytes.

      I don't have anime estimates, but I can make a Heroes analogy.a hi-def episode is more or less 700mb. Considering the first season has 23 episodes, that would make 16.1gb. So 161 exabytes would be 10,000,000,000 (ten billion) seasons of Heroes. Since the earth currenlty has around 6.6 billion people, this would mean that you would have 1 episode for each person on the planet, and all the people of China, India and the US would have a second episode. That's how big it is.

      Regarding the storage space, I call shenanigans. We already have HDD that stores terabytes. A couple years from now, MS office will require that space to be installed.

    4. Re:How many... by Anonymous Coward · · Score: 5, Funny

      760 billion episodes of anime.In other words, about half the length of a typical Dragonball Z fight scene.

    5. Re:How many... by susano_otter · · Score: 1

      Um, I already have several Hero-episode equivalents stored here at home. If all this data adds up to everybody gets a DVD, and some people get two, it doesn't seem like something anybody would really even notice, amongst all the DVDs they have already.

      --

      Any sufficiently well-organized community is indistinguishable from Government.

    6. Re:How many... by franksands · · Score: 1

      Small mistake, where it reads "one episode for each person" it should read "one season for each person", and the same for the second sentence. Quite a lot, huh?

    7. Re:How many... by PitaBred · · Score: 1

      Yeah, well... I'll see your Heroes reference, and raise you a Firefly quote (that is vaguely, but not really at all, related): Dr. Simon Tam: Uh, her... her medications are erratic. There's-there's not one that her system can eventually break down, and... Mal: When want a lot of medical jargon, I'll talk to a doctor. Dr. Simon Tam: You are talking to a doctor.

    8. Re:How many... by JazzLad · · Score: 1

      Ya, but the standard DVD-rip of Firefly's first (and so far only) season is only about 6.5GB... ;)

      --
      "If you have nothing to hide, you have nothing to fear." - Every fascist, ever
    9. Re:How many... by Anonymous Coward · · Score: 0

      Reminds me of point-and-click game "Leisure suit Larry 7". You are in a computer room which has two tape rolls for data. Clicking on one of these and choosing Info (or whatever it was you had to choose), Larry says something like this: "Amazing. Each of these tapes can hold one terabyte - but pretty soon it will take two of these just to hold Microsoft Office". It's a 1996 game!

    10. Re:How many... by franksands · · Score: 1

      Yeah...good old days of LSL. I must have taken the phrase from there :P

    11. Re:How many... by kv9 · · Score: 1

      are they still on Namek?

    12. Re:How many... by ioshhdflwuegfh · · Score: 1

      1,191,400 Libraries of Congress ought to be enough for anyone. and would, like exabytes above, contain lots of redundant information.
    13. Re:How many... by pafrusurewa · · Score: 1

      Since the earth currenlty has around 6.6 billion people, this would mean that you would have 1 episode for each person on the planet
      Oh, I love nonsensical comparisons like that.

      "The population of the UK has reached 60 million for the first time. If they were to stand on top of each other they would reach about 20 feet before toppling over." -- paraphrased from BBC Radio 4 News last August or September.
    14. Re:How many... by Bugs42 · · Score: 1

      A couple years from now, MS office will require that space to be installed. Sir, that is preposterous!
      Suggesting the MicroSoft would make a product requiring such massive amounts of space is unthinkable! Why, we all know that 161 exabytes should be enough for anyone. MS produces nothing less than the cleanest, leanest, most efficient code on the planet.

      Now, if you'll excuse me, I've got to get back to work - I'm installing Office 2007 on a new machine here, and I'm currently on HD-DVD 342 out of 5000.
      --
      Programmer: an ingenious device that converts caffeine into code.
    15. Re:How many... by Anonymous Coward · · Score: 0

      We are just hungry for more. We can't help ourselves. My lunch alone contained 16 sandwichbites.

  5. Sorry, my fault... by slobber · · Score: 5, Funny

    I left cat /dev/urandom running

    --
    "You mortals are so obtuse." -Q
    1. Re:Sorry, my fault... by iPaul · · Score: 1

      You might as well. I imagine 99.999% of that is useless gibberish that people are retaining the way they retain random loose screws and bolts in coffee cans in their garage.

      --
      Leave the gun, take the cannoli -- Clemenza, The Godfather
    2. Re:Sorry, my fault... by product+byproduct · · Score: 4, Interesting

      Amazingly it would take 1,600,000 years for /dev/urandom to produce 161 exabytes (assuming 3.2 MB/s, YMMV)

    3. Re:Sorry, my fault... by DarkAxi0m · · Score: 1

      not unless we use a beowulf cluster or some kinda boinc project...

    4. Re:Sorry, my fault... by iggymanz · · Score: 0, Offtopic

      not to mention the random loose screw we're retaining in the white house that's full of gibberish

    5. Re:Sorry, my fault... by Dirtside · · Score: 4, Funny

      Yes, but what he didn't say is that he left it running on every computer on earth.

      --
      "Destroy science and religion. Science would re-emerge exactly the same; but not religion." - Penn Jillette, paraphrased
    6. Re:Sorry, my fault... by T-Ranger · · Score: 2, Funny

      Checking quickly, your comment just showed up in my /dev/urandom at 74629629165936 blocks of 1k. It may be in there again.

    7. Re:Sorry, my fault... by Anonymous Coward · · Score: 0

      Others call it SETI@Home.

    8. Re:Sorry, my fault... by Anonymous Coward · · Score: 0

      technically that could be defined as producing data, so good luck topping it when the next statistic comes out.

  6. And here I thought Malthus was dead by Anonymous Coward · · Score: 5, Insightful

    We won't be running out of space just like we didn't run out of food. New technology will allow us to store ever more data.

    1. Re:And here I thought Malthus was dead by LighterShadeOfBlack · · Score: 3, Informative

      As the article notes, the amount we produce is not the same as the amount we would actually want to store. Since that 161EB includes duplications such as broadcasting, phone calls, and all manner of temporary or real-time data it's not really relevant to compare that number with storage capabilities as the summary implies.

      --
      Spelling mistakes, grammatical errors, and stupid comments are intentional.
    2. Re:And here I thought Malthus was dead by Anonymous Coward · · Score: 1, Insightful

      The Earth has a finite mass, which at the very least means we can't have infinite humans on Earth. So there IS a carrying capacity, Malthus just guessed the wrong number.

    3. Re:And here I thought Malthus was dead by Spunkee · · Score: 2, Insightful

      The mass of the Earth would increase as the number of humans increase up to a theoretical limit approaching infiniti.

      They wouldn't fit comfortably, and you'd certainly have to stack them. The acceleration of gravity would increase as more humans were added to the mass of the Earth.

      Possibly, you'd have to import food from throughout the universe... I'm not sure if conservation of mass applies to a planet and all living (or not living) entities on it... Debris from space that enters Earth's atmosphere may or may not be useful in reproducing humans.

      Of course, as the mass approaches infiniti the universe would begin to be pulled toward Earth eventually ending in one big-assed mass. Maybe. Smoke some weed, drop some acid, and figure it out for yourself. Most of you are so left-brained a little mental exploration would probably be good for you.

    4. Re:And here I thought Malthus was dead by Patrik_AKA_RedX · · Score: 2, Funny

      Exactly. My company is developing a new storage medium based on penistechnology. If you don't have enough space, just play with it and it gets bigger. We're close to commercial release, just one more critical bug to iron out: it tends to burst out data if you try to enlarge it for too long.

    5. Re:And here I thought Malthus was dead by pipatron · · Score: 1

      The acceleration of gravity would increase as more humans were added to the mass of the Earth.

      Only if the humans are added from another source, say, some sort of space-humans. Humans grown on earth would retain the current mass.

      --
      c++; /* this makes c bigger but returns the old value */
    6. Re:And here I thought Malthus was dead by poot_rootbeer · · Score: 1

      We won't be running out of space just like we didn't run out of food.

      Correction: we haven't run out of food, yet.

      But we animals have a self-correcting system as far as that goes; if the food supply isn't sufficient to support a large population, then the population simply grows at a slower rate. The same will happen with our data; if we reach a point where we can't store all the non-ephemeral data we generate, we'll reflexively limit the amount of non-ephemeral data that we generate.

  7. What's an exabyte? by Anonymous Coward · · Score: 2, Informative

    Simply put, a lot

    10^18 bytes, or One million terabytes

    1. Re:What's an exabyte? by springbox · · Score: 2, Informative

      Did they measure in exabytes or exbibytes (2^60 bytes)? The difference between 161 exabytes and 161 exbibytes are 24,620,362,241,702,363,136 bytes - about 21.36 exbibytes. Kind of important since the margin of error will only increase as the measured data grows. (Lets stop using the SI units when we don't actually mean it.)

    2. Re:What's an exabyte? by HappyEngineer · · Score: 1
      I've got a much more thorough page at: http://g42.org/tiki/tiki-index.php?page=BigNumbers

      Yotta is the largest metric prefix and it's the next one after Zetta, so it looks like the standards people are going to have to get together to name some more prefixes.

    3. Re:What's an exabyte? by Anonymous Coward · · Score: 0

      It's not terribly important in this case because in the course of their calculation they multiplied by the pulled-out-of-their-ass single-significant-digit number "3". Also a quick look at TFA will reveal that they used "metric" exabytes.

    4. Re:What's an exabyte? by TranscendentalAnarch · · Score: 1

      When you realize that the data created at a single personal computer in your home accounts for One 161-millionth of the entire world's data production, you gotta ask yourself: do I seriously have time to watch all this pron?

    5. Re:What's an exabyte? by FiloEleven · · Score: 1

      (Lets stop using the SI units when we don't actually mean it.) Fine, but can we please come up with less stupid-sounding names for units than "gibibytes," "tebibits," "exbibytes," and the like?
    6. Re:What's an exabyte? by Anonymous Coward · · Score: 0

      One MILLION terabytes!! (pinky in corner of mouth)

    7. Re:What's an exabyte? by springbox · · Score: 1

      Well do you have a better idea? Because these have been floating around for a while and I haven't seen anyone try to actually propose alternatives. I think the abbreviations sound cool at any rate (KiB, MiB, GiB, etc.)

  8. What if ISP's are forced to retain data? by cryfreedomlove · · Score: 4, Interesting

    I imagine that a lot of this is web traffic logs. What if the US government really does force ISP's to keep records detailing the sites visited by their customers? Will my ISP rates increase to pay for all of that disk space?

    1. Re:What if ISP's are forced to retain data? by daeg · · Score: 1

      Just wait until the government hears that URLs change and they try to force ISPs to maintain a cache of pages along with the history.

    2. Re:What if ISP's are forced to retain data? by garcia · · Score: 2, Funny

      Will my ISP rates increase to pay for all of that disk space?

      No, of course not. Any law or regulation that the government comes up with doesn't have any hidden costs.

    3. Re:What if ISP's are forced to retain data? by daeg · · Score: 2, Funny

      Costs be damned when you're The Decider and, much to the dismay of IT budgets everywhere, can change time itself on a whim!

    4. Re:What if ISP's are forced to retain data? by jez9999 · · Score: 1

      Someone's already done it.

    5. Re:What if ISP's are forced to retain data? by eMbry00s · · Score: 1

      Yes. No free dinner.

  9. Must be the space donuts by Anonymous Coward · · Score: 5, Funny

    So the sum total of data has increased by a factor of more than 30 since 2003? I knew Brent Spiner was putting on weight, but damn.

  10. The awesome information we retain by iPaul · · Score: 5, Insightful

    Web server log files with the history of people clicking around. My address stored by everybody I ever bought anything on line from. It's more an information land-fill than an information warehouse.

    --
    Leave the gun, take the cannoli -- Clemenza, The Godfather
  11. And there used to be so little on-line data by Animats · · Score: 5, Interesting

    What's really striking is how little data was available in machine-readable form well into the computer era. In the 1970s, the Stanford AI lab got a feed from the Associated Press wire, simply to get a source of machine-readable text for test purposes. There wasn't much out there.

    In 1971, I visited Western Union's installation in Mawah, NH, which was mostly UNIVAC gear. (I worked at a UNIVAC site a few miles away, so I was over there to see how they did some things.) I was shown the primary Western Union international gateway, driven by a pair of real-time UNIVAC 494 computers. All Western Union message traffic between the US and Europe went through there. And the traffic volume was so small that the logging tape was just writing a block every few seconds. Of course, each message cost a few dollars to send; these were "international telegrams".

    Sitting at a CRT terminal was a woman whose job it was to deal with mail bounces. About once a minute, a message would appear on her screen, and she'd correct the address if possible, using some directories she had handy, or return the message to the sender. Think about it. One person was manually handling all the e-mail bounces for all commercial US-Europe traffic. One person.

    1. Re:And there used to be so little on-line data by Anonymous Coward · · Score: 0

      One person was manually handling all the e-mail bounces for all commercial US-Europe traffic. One person. Unbelievable! I'd pay good money to see footage of that!!

      Perhaps it looks something like this.
    2. Re:And there used to be so little on-line data by Anonymous Coward · · Score: 0

      Sitting at a CRT terminal was a woman whose job it was to deal with mail bounces. About once a minute, a message would appear on her screen, and she'd correct the address if possible, using some directories she had handy, or return the message to the sender. Think about it. One person was manually handling all the e-mail bounces for all commercial US-Europe traffic. One person.

      That made my jaw drop.

  12. "closing in on a zettabyte" by Supreme+Dragon · · Score: 3, Funny

    Is that the size of the next MS OS?

    1. Re:"closing in on a zettabyte" by josemayor1 · · Score: 0

      What if the US government really does force ISP's to keep records detailing the sites visited by their customers? Your are sure of this? http://www.dovoyeur.com/

    2. Re:"closing in on a zettabyte" by Anonymous Coward · · Score: 0

      Shouldn't the oceans have boiled by then?

    3. Re:"closing in on a zettabyte" by Patrik_AKA_RedX · · Score: 1

      Makes me wonder what the critical mass is of junkware. One of these releases is either going supercritical and vaporize the MS HQ or collapse on itself and suck the whole planet in.

  13. It doesn't seem like that much by gelfling · · Score: 1

    I'm just one person and I have 20GB just of OS and applications code. Plus another 20GB of MP3's. 161 billion /40 is about 4 billion 'gelfling people units'. Doesn't seem like a lot.

    1. Re:It doesn't seem like that much by Umbrae · · Score: 1

      You forget that this removes duplicates.

      Every OS file you have, application file you have, mp3 file you have, is only counted once. So 10000 gelflings is still only 40GB.

    2. Re:It doesn't seem like that much by gelfling · · Score: 1

      No it isn't I have 4 other people in my house, 4 other computers and they have even more per machine. I work for a company that does outsourcing. I don't think there is a reasonable estimate for the number of physical servers we manage. It's easily in the hundred thousand plus. How much DASD? Who knows. Figure one 100GB per server @ 40% utilization per x 100,000 = 400,000GB. Double that for offline and nearline storage. That's 800,000GB, easily.

    3. Re:It doesn't seem like that much by Anonymous Coward · · Score: 0

      I'm just one person and I have 20GB just of OS and applications code. Plus another 20GB of MP3's.

      How many music CDs do you own? Each CD has about 600-700MB. How many movie DVDs? 4.5-9GB each. VHS tapes? Maybe 1-2GB (wild guess).

  14. Supply and demand by rufty_tufty · · Score: 4, Insightful

    I'm sorry, how stupid is this?
    "producing far more data than we will have room to store"

    That's like saying, for the last 2 months, my profit has increased by 10%. If my profit keeps increasing at 10% per month, then pretty soon I'll own all the money in the world, and then I'll own more money than exists! Damn I must stop making money now before I destroy the world economy!!!

    Who are these people who draw straight lines on growth curves? Why do people print the garbage they write and why weren't they the first against the wall after the dot com bust?
    The only things that seem certain are death, taxes, entropy and stupid people...

    --
    "The weirdest thing about a mind, is that every answer that you find, is the basis of a brand new cliche" -
    1. Re:Supply and demand by Looce · · Score: 4, Insightful

      Actually, you're spending some of the money you earn, in investments. You are neither a sink nor a source of money.

      Though with data, some people, or even companies, are merely sinks. They store huge amounts of data, mostly for auditing purposes. Access logs for webservers. Windows NT event logs. Setup logs for Windows Installer apps. For ISPs, a track record of people who got assigned an IP address, in case they get a subpoena. Change logs for DoD documents. Even CVS for developers, to keep track of umpteen old versions of software. Even the casual Web browsing session replicates information in your browser cache. Many more of these examples could be given.

      We also need to produce more and more hardware to store these archived data, the most obiquitous of which is the common hard drive. In the end, we'll need more metal and magnetic matter than the Earth can provide.

      Martian space missions, anyone?

    2. Re:Supply and demand by ni1s · · Score: 1
      You forgot to assume the summary was sensational.

      "If everybody stored every digital bit, there wouldn't be enough room." Well, Duh!
    3. Re:Supply and demand by maxume · · Score: 1

      Stupid is relative. If stupid is a certainty, it implies something else is.

      --
      Nerd rage is the funniest rage.
    4. Re:Supply and demand by Rodness · · Score: 1

      Forget Mars, we can just tow asteroids into orbit and mine them. Hell, there's already one on the way! :)

    5. Re:Supply and demand by fafalone · · Score: 1

      That's assuming other storage mediums don't step up to replace todays magnetic storage. There's numerous other storage mediums on the horizon that don't use much metal or magnetic materials, that will far surpass traditional hard drives in data density. By the time the resources to make magnetic drives become prohibitively scarce, I just can't imagine it not being irrelevant because most new data is stored in some sort of crystal or organic material with such a high density that using up all the resources on the planet just to build enough storage devices will be just as far off as running out of magnetic material is today.
      New technology will keep up.

    6. Re:Supply and demand by Thuktun · · Score: 1

      We also need to produce more and more hardware to store these archived data, the most obiquitous [sic] of which is the common hard drive. In the end, we'll need more metal and magnetic matter than the Earth can provide. Right, and we ran out of wood because we used it all up heating our stone houses and all our land is taken up by pasture to feed the horses we use for transportation.

      Extrapolating our future needs based on the most common /current/ technology is a bit shortsighted.
  15. We won't produce more data than can be stored. by ProfessionalCookie · · Score: 4, Funny

    Data that cannot be stored will not be produced because all data that is produced must be stored. Data that is not stored (for however short a time) is not really produced.

    Then again the past no longer exists anyway, the future doesn't exist yet and the present has no duration- so maybe the data never existed anyway. Maybe you don't exist?!?! Awe man maybe I *~/ disappears in a puff of logic*
    ----
    Kudos to Augustine and Adams

    1. Re:We won't produce more data than can be stored. by xenophrak · · Score: 1

      This is of course, not true.

      I routinely have to compile static versions of my company's web stores in order to archive them and they are about 1GB each of HTML once compiled.

      Each store, however is about 100 megs of assets and then the data in the DB makes up another 50M or so. All of this is then generated dynamically and sent to client browsers that will just cache them temporarily. So, the data transmitted may be huge, but what people are storing would appear to be less.

      --
      Contrary to popular belief, life is not a bitch. It is far far worse.
    2. Re:We won't produce more data than can be stored. by Fordiman · · Score: 1

      May want to start compressing that shit. Use 7z; it's really good at noticing redundancies in logs and backups.

      --
      110100 1101000 1101000 1100110 0 1101111 1101000 1100011 1
    3. Re:We won't produce more data than can be stored. by istartedi · · Score: 4, Funny

      disappears in a puff of logic

      Great. Now we're all going to be inhaling second-hand logic. There ought to be a law...

      --
      For all intensive purposes, "whom" is no longer a word. That begs the question, "who cares"?
    4. Re:We won't produce more data than can be stored. by ioshhdflwuegfh · · Score: 1

      ...but supports only up to approx 16 exabytes long files.

    5. Re:We won't produce more data than can be stored. by Fordiman · · Score: 1

      Well, GP was talking about mere gigs of site backups. If he's backing it up monthly, it'll be untold moons before he hits the limit of 7z.

      --
      110100 1101000 1101000 1100110 0 1101111 1101000 1100011 1
    6. Re:We won't produce more data than can be stored. by szembek · · Score: 1

      It is stored, albeit temporarily. If there was no room to store it, it could not be created!

      --
      nothing
  16. 2010 by ni1s · · Score: 1

    An anonymous reader tips us to an AP story on a recent study of how much data we are producing. IDC estimates that in 2010 we created, captured, and replicated close to a zettabyte of digital information. The last time anyone tried to estimate global information volume, in 2006, researchers at IDC came up with 161 exabytes. (The current study tries to account for duplicating data -- on the same assumptions as the 2006 study it would have come out at 250 exabytes.) By 2012, according to IDC, we will be producing far more data than we will have room to store, closing in on 6 zettabyte.

  17. Internet | uniq by Duncan3 · · Score: 2, Insightful

    The problem is, everything is duplicated, a LOT. All those copies needs to be stored tho, so here we are swimming in data.

    My work machine that I backed up a couple weeks ago, was a 30MB zip file, and 3/4 of that was my local CVS tree. So out of a 30GB, less then 1/3000th was not OS, software, or just copied locally from a data store.

    At home, I've saved every email, every picture, everything from my Windows, Linux, OSX and every other box I've every had since ~1992, and that's barely a few GB uncompressed.

    The amount of non-duplicate useful material is far far smaller then your would think.

    --
    - Adam L. Beberg - The Cosm Project - http://www.mithral.com/
    1. Re:Internet | uniq by Firehed · · Score: 1

      Only a few gigs? You clearly don't have a camera that shoots in RAW... I've burned through well over ten gigs of storage just from mine, and I've owned it for all of six weeks (averaging to just under 300MB per DAY of new content). Sure, email takes next to nothing and I have plenty of duplicate content, but I have over a terabyte of storage and after doing my best to trim out redundancy, I still have a very sizable chunk of it used. I suppose it's really down to usage habits, but with 10+MP cameras and HD camcorders being available at consumer-level pricing, the amount of non-duplicated original content is going to shoot through the roof.

      --
      How are sites slashdotted when nobody reads TFAs?
    2. Re:Internet | uniq by Anonymous Coward · · Score: 0

      Congratulations on not being addicted to pr0n.

    3. Re:Internet | uniq by maxume · · Score: 1

      Are you keeping everything or stuff that is 'worth it'? I have the keep-everything mindset, but I take steps to help myself notice how rarely I use a lot of stuff. I figure if I ever end up with a raw capable camera and get in the habit of actually shooting raw, I would at least try to render lots of it to 'good enough' jpegs and not worry about it and only keep the raw for the shots where it was interesting(and hope that my ideas of interesting didn't change too much).

      Sd cards are getting pretty cheap. For a consumer camera, treating 1 GB cards as write only media isn't all that crazy at this point, if you figure ~2000 photos for $12(or even 1000). I sort of hope that 10 MP cameras don't make much progress, at least not as far out of lockstep as they are with the lenses and sensor size already.

      --
      Nerd rage is the funniest rage.
    4. Re:Internet | uniq by Duncan3 · · Score: 1

      I am an anti-packrat, I purge all the junk every time I transfer to a new hard drive (they never last long do they) and so I keep it pretty trim, having "stuff" is generally just more of a headache. Deleting email attachments you don't need also goes a long way, those 10MB .doc files that say the same thing as the paragraph of ASCII.

      I will admit the digital camera turned it from a CD into a DVD of backups, but I just need to get a really good 3-4MP camera, instead of a rather bad 8MP one. Also just deleting all the duds right away is a good 20x savings.

      --
      - Adam L. Beberg - The Cosm Project - http://www.mithral.com/
  18. Internet a product of biology? by blubadger · · Score: 2, Interesting

    In River Out of Eden Richard Dawkins traces the data explosion of the information age right back to the big bang.

    "The genetic code is not a binary code as in computers, nor an eight-level code as in some telephone systems, but a quaternary code with four symbols. The machine code of the genes is uncannily computerlike."
  19. Dumbest comment ever... by Anonymous Coward · · Score: 0

    "By 2010, according to IDC, we will be producing far more data than we will have room to store, closing in on a zettabyte."

    So I guess in the future you can't buy more hard drives or something...

  20. stupid by Lord+Ender · · Score: 0, Redundant

    By 2010, according to IDC, we will be producing far more data than we will have room to store
    Does anyone else find that statement to be utterly moronic?
    --
    A slashdotter who didn't build his own computer is like a Jedi who didn't build his own lightsaber.
    1. Re:stupid by Anonymous Coward · · Score: 0

      Thank god I'm not the only one.

      How the hell can you produce data but unable to store it, sounds like "Contradictio in terminis"? is it like writing all the data to /dev/null?

    2. Re:stupid by Anonymous Coward · · Score: 0

      What do you think happens every single time you pick up a telephone and call someone? Praytell, where is that data stored? Or are you perhaps saying that it isn't produced in the first place?

      Before throwing a temper tantrum and whining about stupid journalists, take a moment to think things through. There just might be an interpretation that makes sense.

    3. Re:stupid by OneoFamillion · · Score: 2, Funny

      What do you think happens every single time you pick up a telephone and call someone? Praytell, where is that data stored? Uhh... Department of Homeland Security?
  21. How much is actually used? by basic0 · · Score: 4, Interesting

    Ok, so we generate some staggering amount of computerized data every year. This is one of those stories where I can't remember hearing about it before, but it really doesn't feel like "news".

    My question is how much of this data is actually being used? I'm horrible for constantly downloading e-books, movies, software, OSes, and other stuff that I'm *intending* to do something with, but often don't get around to. I end up with gigabytes of "stuff" just sucking up disc space or wasting CDs. I burned a DivX copy of Matt Stone and Trey Parker's popular pre-South Park indie film "Orgazmo" in about 2001. I've since seen the film 2 or 3 times on TV. I STILL haven't watched the DivX version I have, and now I can't find the CD I put it on. I know I'm not the only one who does this either, as many of my friends are using up loads of storage space on files they've just been too busy to have a look at.

    Right now I'm on a project digitizing patient files for a neurologist. We're going up to 10 years deep with files for over 18,000 patients. Most of this is *just* for legal purposes and nobody is EVER going to open and read the majority of these files. The doctor does electronic clinics where he consults the patient and adds new pages to their file, which will probably sit there undisturbed until the Ethernet Disk fails someday.

    I think a more interesting story (although probably MUCH more difficult to research) would be "How much computerized data is never used beyond it's original creation on a given storage medium?"

    1. Re:How much is actually used? by Anonymous Coward · · Score: 0

      -I know I'm not the only one who does this either-

      Actually, I'd bet you ARE the only one searching for a CD with a DivX rip of "Orgazmo".

  22. Think tank bias by RealGrouchy · · Score: 1

    I'd take this study's fearmongering with a grain of salt. It probably came from one of those deletionist Think-Tanks.

    - RG>

    --
    Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!
  23. Comment removed by account_deleted · · Score: 5, Funny

    Comment removed based on user account deletion

  24. Of course we will by PIPBoy3000 · · Score: 2, Interesting

    Think about scientific instruments that gather gigabytes of data per second. They hold on to that for as long as they have to, pulling out interesting data, summarizing it, and throwing out the rest. I track all the web hits for our corporate Intranet. The volume is so huge that the SQL administrators come and have a little heart-to-heart chat with me if I let it build up over a few months. I don't really care about the raw information past a month or so. Instead, I want to see running counts of which pages are being viewed, which people are big utilizers of our network, and so on.

    A good analogy is the human brain. We gather in huge amounts of information per second via touch, sight, and so on, but throw out the vast majority of the information. The key is to have good filtering systems so that things that are interesting and relevant are held onto.

  25. Suzumiya Haruhi by alexjohnc3 · · Score: 0, Offtopic

    Make sure you watch out for giant crickets, especially if you visit Superior Japan.

    1. Re:Suzumiya Haruhi by Anonymous Coward · · Score: 0
      How is the above off-topic? Obviously the moderator doesn't know about Suzumiya Haruhi no Yuuutsu and therefore shouldn't have modded the post off-topic.

      The links the poster provided aren't very informative. I would have used this, from Anime on My Mind:

      Yuki activates something and transports the Haruhi-less SOS Brigade to the alternative desert reality, and the computer club fondler is trapped there in the form of a huge cricket. Uh huh. Yuki and Itsuki (who can now conveniently manifest his power, though this 1/10th squaggle-waggle reeks of Bleach) then show off their powers in defeating the cricket and returning the computer club fondler back to the normal world. Only Kyon and Mikuru were absolutely useless during this scene, and I think Kyon was only around to provide Itsuki an excuse to explain whats going on, and Mikuru was only around to give us more of cowering Mikuru.

      They then return to the club room and give a post mortum on the event, which basically was that the cricket is a manisfestation of a data entry entity (sorta like Yuki, maybe) that was dormant until it saw the SOS Brigades logo on 2chan, which supposedly contains 436 petabyes of data.
  26. Exabyte tapes by Roger+W+Moore · · Score: 2, Funny

    So at this rate it won't be long before we will need real Exabyte tapes. I always thought the original ones should qualify for the award of world's most misleading name since their capacity was 500 million times less what their name suggested.

    1. Re:Exabyte tapes by aybiss · · Score: 1

      Will I need a real Gigabyte motherboard? :-)

      --
      It's OK Bender, there's no such thing as 2.
  27. What are we supposed to do with it all??! by AaronPSU777 · · Score: 1

    Seriously, forget about storing all this data, what exactly are we going to do with it?? How are we going to process and manage zettabytes worth of data? What tools are we going to use to sift through that much data and get what we need? Should we even be keeping it?? Hell %90 of it may well be porn. The more data we produce the more urgent it will become to ask these sorts of questions, and find the answers to them.

  28. Google Says: by nbritton · · Score: 2, Interesting

    (161 exabytes) / 6,525,170,264 people = 26.4931682 gigabytes per person.

    1. Re:Google Says: by Anonymous Coward · · Score: 0

      Unfortunately the Google calculator doesn't handle exabytes and gigabytes correctly, so you have to redo those calculations manually.

    2. Re:Google Says: by DragonTHC · · Score: 1

      yes, yes, in the U.S. Government's citizen dossiers. 26 GB per person.

      --
      They're using their grammar skills there.
  29. zetta--watt iss that? by 454_Casull · · Score: 1

    ....and in the year of our lord 2012 our data became self aware.....

  30. Google Says: by nbritton · · Score: 2, Interesting

    (161 exabytes) / 1,093,529,692 people[1] = 158.086639 gigabytes per person and 19.6380918 gigabytes per person if you don't count the duplicate data.

    [1] Total est. of people on the Internet:
    http://www.internetworldstats.com/stats.htm

  31. 2nd the motion of Firehed by NotQuiteReal · · Score: 1
    I too am a programmer, and I have almost every scrap of code I ever wrote, including z80 assembly code to play "pong" on an analog oscilloscope. Why do I have it? I dunno, because I can. I don't even know where it is at this moment, but I THINK I have it on a cd-rom somewhere. And as long as that "archive bit" is set in my mind, it is ok (but if I couldn't find it, I'd just shrug and say, oh well...)

    Text (code, misc letters) IS very small. Up until just a couple of years ago, all the "good stuff" would fit on a CD-R or two.

    Now, I have several full DVD-Rs with copies of digital photos, and I just finished making 30+ more DVD's of (compressed data) that hold 60+ hours of old home video before the tapes rot.

    By-and-large, there is a lot of crap that I personally don't feel a need to save (because I can always get it from somewhere else, if need be) but even "personal" stuff is adding up to 100's of Gigabytes.

    Still, data is smaller than boxes of pictures and video tapes.

    I didn't RTFA, but I don't see what the big deal is. From where I sit, I have computer power and data storage equivalent to what cost millions and millions of dollars at one time, in my own lifetime.

    And it keeps getting cheaper...

    I suspect, just like in the physical realm, "important" digital items will survive, thru shear duplication and media updates, far more often than "unimportant" items... like my family photos, but at least they have a shot.

    Upload all your snapshots to a royalty free photo site and gain digital immortality via file hoarders. The only rub is that you can't let your pix be personal enough to trace back to you or the stalkers will get you.

    --
    This issue is a bit more complicated than you think.
  32. The number is way too low! by EmbeddedJanitor · · Score: 2, Insightful
    OK,ok, I didn't RTFA and did not really RTFSummary (that's not the point of /.).

    If we consider all digital data, not just the stuff that flows over the internet, then this is way too low. Consider the data in all the DTVs, GPS receivers etc.

    A top-end GPS is grinding over 10^9 bits per second in its correlators (about 50 correlator channels x 20Mbps or so sampling rate). That ends up being approx 3x10^15 bytes per year per GPS... or 40,000-odd top-end GPSs would be grinding 1.61x10^20 bytes per year. There are far more than 40k high end GPSs in the world, so the budget is already blown...

    --
    Engineering is the art of compromise.
    1. Re:The number is way too low! by Simon80 · · Score: 2, Informative

      RTFS - it's not about bandwidth, it's about unique data, knowledge, ideas, information.

    2. Re:The number is way too low! by Anonymous Coward · · Score: 0

      RTFP - He wasn't talking about bandwidth, he was talking about "unique data" being produced by devices on other networks.

    3. Re:The number is way too low! by Dun+Malg · · Score: 3, Insightful

      Again, RTFS: "IDC estimates that in 2006 we created, captured, and replicated 161 exabytes of digital information."

      If a NMEA lat-lon string gets spit out of the serial port of a GPS and there's nothing there to capture it, it is not part of their count. They're not counting bitrate on data generators and multiplying times bandwidth. They're counting discrete blocks of saved data. You cannot arrive at the latter from the former, just like you can't tell how much water is behind Hoover Dam on average during the year by measuring the average daily flow rate of the Colorado river and multiplying by 365.

      --
      If a job's not worth doing, it's not worth doing right.
    4. Re:The number is way too low! by bobcote · · Score: 1

      All stored on drives with a one year warranty

  33. Low SNR by Jekler · · Score: 5, Insightful

    As interesting as the sheer volume is, most of it is garbage. I'd rather have 50 terabytes of organized and accurate information than 500 exabytes of data that isn't organized, and even if it were, it's accuracy is questionable at best. In essence, even if you manage to find what you want, the correctness of that information is likely to be very low.

    I've long said we are not in the information age, we are in the data age. The information age will be when we've successfully organized all this crap we're storing/transmitting.

    1. Re:Low SNR by Anonymous Coward · · Score: 0

      Clearly you have not learned to properly misinterpret the text.

    2. Re:Low SNR by General+Wesc · · Score: 1

      I'd rather have 50 terabytes of organized and accurate information than 500 exabytes of data that isn't organized
      Everybody stand back!
    3. Re:Low SNR by ioshhdflwuegfh · · Score: 1

      I've long said we are not in the information age, we are in the data age. The information age will be when we've successfully organized all this crap we're storing/transmitting. Or, to paraphrase Travis: "One of these days, I'm gonna get my data organizized"
  34. The Singularity is coming! by Ikoma+Andy · · Score: 1

    Run!

    1. Re:The Singularity is coming! by Patrik_AKA_RedX · · Score: 1

      The singularity is still a far way off, it's not like, hold on I got an email, we can't keep up with, just a second got to answer the phone, the information provided, damn a sms, to us or something. We still can, just a minute got to answer my cellphone, easily keep up with, shit an instant message popped up, our daily communications, isn't it?

  35. Malthus has just gone down to the shops by roesti · · Score: 2, Insightful

    We won't be running out of space just like we didn't run out of food. New technology will allow us to store ever more data.

    I remember when software came on cassettes and when food came from close to where you live.

    When floppy disks were too small, we made higher-density floppy disks, and we still needed a whole box of them.
    When there wasn't enough of a particular food, we got it shipped from further away.

    When CD-ROMs came out, we still ended up not only filling them but spreading things over multiple CDs.
    When the imported food got too expensive, we started using chemical fertilisers to grow more of them closer to home and more cheaply.

    We had to invent bigger CDs. DVD became HD-DVD and Blu-Ray. People are already complaining that they're not big enough.
    We got bigger trucks and bigger boats to cover food with more preservatives and ship it here from further away, and the more of this we bought, the cheaper it got.

    You got that bigger hard disk, so you could amass data and store it forever. Remember how you said you'd never fill it up? Then broadband happened, and P2P happened, and fill it up you did.
    You didn't worry about it, though, the same way you didn't worry about not having enough food, either. Your supermarket is awash with thousands of varieties of food, from wherever it's cheap, and you can eat as much as you want of whatever you want.

    Because everything is more available, more quickly and more easily, you now have more stuff than you could ever use. Nowadays, people don't think twice about Tivo-ing or downloading something that they're never even going to watch. As the technology gets better - as disks get bigger, and as networking gets faster - this is only going to become more prevalent.

    But there is a physical limit to what can be done. Do you need a new hard drive, or a new router? What metals and chemicals are required to make them? How much energy is required? Where are they built, and how do they get to you? There's only a finite amount of this stuff in the ground, and none of this is invincible to exponential growth. The people who think this can go on forever, or even for the rest of their natural lives, are kidding themselves.

    Eventually, these materials will be harder to get, things will start to become more difficult to make and more expensive, and everyone will be complaining about how expensive their last computer was. Really, though, I don't even want to know these people. They've gotten their priorities all wrong.

    The parent poster says we won't be running out of anything. All that's really happened is that we haven't run out yet. The planet simply can't sustain the 6.5 billion of us there are now, let alone the billions more to be born in the next few decades. The problem is that when there isn't enough to go around, some of us will be lining up for new video games and iPods, and some of us will be lining up for food, water and fuel.

    I should warn you to choose wisely, but really, what do I care? Choose unwisely, and leave more for the rest of us.

    1. Re:Malthus has just gone down to the shops by Anonymous Coward · · Score: 0

      Mod parent down -1 Useless fluff

    2. Re:Malthus has just gone down to the shops by LighterShadeOfBlack · · Score: 1

      I think you're pushing the scarcity doomwatch angle a bit hard here. Yeah, after 40 odd years we've probably pushed the basic HDD design about as far as it'll go, we'll probably never get more than 1-2TB on a 3.5" drive. But then there are numerous other technologies being developed that seek to improve on the performance and data density that HDDs provide. While they might be a way behind right now, the fact is that many of these are technologies in their infancy that will, with enough R&D, increase in performance and density just as HDDs have for the past 30-40 years. While there are inherit limits on all things, to think that we are remotely close to them right now in computing strikes me as a little premature - kind of like a guy flying around in his bi-plane in 1920 at 80mph thinking "man, nobody will ever fly faster this". Of course he was wrong, the technology he was using at that moment might not have allowed faster travel, but there were new designs and new technologies that would take its place and provide so much more.

      Kinda offtopic here but the Earth can sustain 6.5 billion people, in fact I'm sure I've read various studies that have come closer to 8 or even 10bn people being sustainable with the proper agriculture and social framework*. The reality of our situation right now isn't that people are dying because the Earth can't sustain them, it's because they don't have the necessary framework to survive - it's not an issue of scarcity but one of inequality.

      *: Whether 8-10bn people on Earth is practical or even pleasant to live with is another matter, I'm just saying that with the right setup people could breathe and eat and do other all the things that people do without the world collapsing.

      --
      Spelling mistakes, grammatical errors, and stupid comments are intentional.
    3. Re:Malthus has just gone down to the shops by babyrat · · Score: 1

      The planet simply can't sustain the 6.5 billion of us there are now, let alone the billions more to be born in the next few decades

      I think you underestimate the size of the planet - it's pretty big.

      There are 6.5 billion people on the earth (well, roughly - last time I counted, I think I lost count at about 4 billion, but I'm pretty sure I was more than halfway at that point).

      Assuming everyone lived in households of 3, and each household had it's own acre of land, you would be able to fit the entire population of the world in the US. That would leave the rest of the world empty - like empty - with no people.

      Of course you are somewhat correct in that any finite resource (no matter how big) can not stand up to infinite growth however to imply that it will happen in our lifetime (or even the lifetime of a child born tomorrow) is absolutely absurd!

  36. Re:dude by maxume · · Score: 1

    Packrats are awesome:

    http://en.wikipedia.org/wiki/Packrat_midden
    http://www.google.com/search?q=packrat+middens

    Watch what you say about them!

    I try not to keep junk around; keyword is try. I set my 5MP camera to 3MP and don't feel like I am losing anything. Stuff I consider important fits on a DVD; stuff I will bother moving to a new drive is more like 120GB.

    --
    Nerd rage is the funniest rage.
  37. Oblig Seinfeld? by iminplaya · · Score: 1

    Yotta, yotta, yotta...

    --
    What?
    1. Re:Oblig Seinfeld? by Anonymous Coward · · Score: 0

      It's Yadda Yadda Yadda, moron.

    2. Re:Oblig Seinfeld? by iminplaya · · Score: 1

      Not with MY wife, you don't!

      --
      What?
  38. Re: exactly right ... by enselsharon · · Score: 1

    A lot of people are going to get burned by offsite file storage services - and I place this in two categories:

    - people who get burned by AOL/xdrive/Amazon/S3 when the privacy of their data is ruined - either by sharing information to enable advertisements or by rolling over like cowards to any government agency that even picks up the phone

    - people who get burned because the wacko distributed custom database-driven filesystem in the sky that they trust their data to has a glitch and goes down for days or weeks if there is ever any significant disruption in Internet connectivity and/or routing.

    This is of special importance to me because regulatory concerns _require_ that I store critical data offsite, so I had to bite the bullet ... I'm pleased so far with my choice (rsync.net) for offsite backup of my linux data, but it took a lot of research and a lot of reassurances before I would take the plunge. I concluded that I was far better off with a provider that put my data on real, rational unix filesystems (like exavault, strongspace, rsync.net, etc.) and then the decision came down to who had the most reassuring privacy and search warrant policy.

    We'll see...

  39. Surbanes Oxley by Billly+Gates · · Score: 1

    I wonder if it would explain the amount going up as paranoid companies are heavily auditing everything they do and all financial data

    1. Re:Surbanes Oxley by Anonymous Coward · · Score: 0

      the worst that SOX auditing is resulting in is better document/log retention policies... requiring 12 months of access log retention instead of 3, etc.

  40. Yes...but is it useful by stoicio · · Score: 4, Interesting

    It's well and fine to have a statistic like 161 exabytes
    of data, but what's the point. Is that data any more useful
    to people than the selective data that was used to run the world
    50, 60 or 100 years ago?

    We as individuals are only capable of assimilating a limited amount
    of information so most of those exabytes are just rolling around
    like so many gears in an old machine. If they are minimally used or
    never used they simply become a storage liability.

    As an example, the internet has not made *better* doctors.
    Even with all the latest information at thier finger tips
    professionals are still only the sum of what they can
    mentally absorb. Too much data, or wrong data (ie: wikipedia)
    can lead to the same levels of inefficiency seen prior to
    the 'information age'. What would a single doctor do with
    160 exabytes of reading material, schedule it into the work day?

    Also, if the amount of information is rated purely on bytes
    but not in *useful content* the stats get skewed. Things like
    movies and music should be ranked by the length of script
    and/or notation. That would make the numbers much less than
    160 exabytes.

    Saying that the whole world produced 160 exabytes of information
    is like saying the whole world used 50 billion tonnes of water. ...was that water just running down the pipe into the sewar or
    did somebody actually drink it to sustain life?

    Mechanistic stats are stupid.

    1. Re:Yes...but is it useful by jdschulteis · · Score: 1

      As an example, the internet has not made *better* doctors. Even with all the latest information at thier finger tips professionals are still only the sum of what they can mentally absorb. Too much data, or wrong data (ie: wikipedia) can lead to the same levels of inefficiency seen prior to the 'information age'. What would a single doctor do with 160 exabytes of reading material, schedule it into the work day?

      I agree with your premise that too much information, unorganized and of unknown quality, won't help. However, here are two examples of how the internet can make doctors more efficient:

      Amirsys' STATdx provides fast access to authoritative reference information.

      Vocada's Veriphy automates the process of notifying the doctor who ordered a test when a patient has a critical test result.

    2. Re:Yes...but is it useful by stoicio · · Score: 1

      I agree that tools may be able to provide some solutions, however, it
      has not been shown that these tools will provide better medical treatment
      or follow up than having well trained diagnositicians, a current medical
      reference set, and a good staff.

      In fact such tools can make diagnosis worse since a medical database
      does not and cannot have the same level of diagnostic intelligence
      as a human doctor.

      Another issue is 'Who produces the system'.
      We currently have a problem with doctors who have been trained with textbooks
      produced by drug companies. There are obvious biases in many texts to deferr
      the doctor to the use of a product rather than possible better treatments.

    3. Re:Yes...but is it useful by khallow · · Score: 1

      Let's us not forget obvious tools like Google which do a pretty good job of delivering search results on queries including obscure error messages.

  41. Dr Evil by steveoc · · Score: 4, Funny

    So DR Evil, after emerging from his suspended animation, would demand a computer big enough to store 100 Megabytes of evil data.

  42. Data or simply format shift? by Technician · · Score: 1

    The article implies there is a bunch of new data.. The fact is much of the data is simply format shifted into the new medium. Examples of this are;

    1 Photography
    2 Letters and corrospondance
    3 Fileing and records
    4 Music
    5 Telephone calls & faxes
    6 Newspapers and magazines
    7 Novels and books
    8 Board games and puzzles
    9 Movies
    10 Radio and TV broadcasts
    11 ??

    All these form of data existed before. None of them was digital before. The numbers represent a format shift, not new content. Not many people archived every newspaper, phongraph record, photo, magazine, telephone call, TV show, etc. Even in the 1950's there was not enough space to archive all the data.

    --
    The truth shall set you free!
    1. Re:Data or simply format shift? by Anonymous Coward · · Score: 0

      12 Profit

  43. Obvious Joke: by nick_davison · · Score: 1

    What's the difference between the Library of Congress and the House of Representatives?

    In the Library of Congress, you're not allowed to lick the pages.

  44. Identity Theory / Pythagorean philosophy by Todamont · · Score: 0, Offtopic

    I belong to a philosophical group who assert that existence IS identity, that for something to exist it must have physical characteristics which are defined and obey causality, because this is the very essence of what it means to exist. I personally follow the classic pythagorean belief that "everything is number", and that all that truly exists is information. I this sense we not only are producing information but are in turn made up from it. If you believe that the universe is finite, then there may be a total sum of all the information, but I doubt it is in the exabit range. Nature uses qbits, so there is presumably some type of wave-like uncertainty to all information, which may mean that nature itself could have a signal-to-noise ratio approaching 50%.

    --
    Kharma is like a boomerang. Mine is broken.
  45. Poor estimation, poor predictions, poor conclusion by suv4x4 · · Score: 1
    • Noone asked me or my family, or my friends, or my colleagues, how much data we store. How do they even come up with those estimates? Do they just count the more "visible" corporate data warehouses and ignore the millions of individial users?
    • You can't "produce more data than is stored", this is idiotic to claim. How do they imagine this happening? Millions of people stubbornly trying to save files on their full hard drives, leading to a global crisis? This should join the FUD that we're about to "clog" the Internet any minute now, because we're watching funny videos on YouTube.
    • The first prediction didn't account for data duplication, this does. Let's ignore for a minute how wildly inaccurate those "assumptions" are if you can change the methodology every time. They drew a straight line between those to and extrapolate a bit, to arrive at "catastrophic" predictions anyway. Never mind that statistically it doesn't make sense. If I ate 10 grams of food yesterday since someone didn't account for bananas yesterday, but I ate 1 kilo of food today since they did... extrapolating this to tommorow says I'm about to eat 100 kilos of food.
  46. 30 years go, the EXABYTE STRINGY FLOPPY by Anonymous Coward · · Score: 0

    30 years ago, the Exabyte Stringy Floppy (see http://en.wikipedia.org/wiki/Exabyte_Corporation ) was invented to solve our storage problems http://en.wikipedia.org/wiki/Stringy_floppy

  47. This is a complete lie by patio11 · · Score: 1

    As no one has actually seen a Dragonball Z fight conclude, we have no way of knowing how long a typical fight seen would go on for. Personally my bet is that after a few trillion episodes the vibration from the copious yelling of the same three words over and over causes the scene to collapse in on itself like a Moebius strip. This suggests the unpleasant possibility that the typical Dragonball Z fight scene is infinite in length. The truth of this conjecture is a hotly debated unsolved problem in mathematics, and unraveling it has already cost seventeen grad students their lives, and burned out the department's DVD player. The DVD player was deeply mourned.

  48. Re:Poor estimation, poor predictions, poor conclus by MooUK · · Score: 1

    It's amazingly simple to produce and duplicate more data than you have room to store. You simply don't permanently store it all.

    Is this a concept that is so hard to understand? Many replies above don't seem to grasp the concept of data not actually being kept.

  49. 50 Exabytes for $30.5 Billion by nbritton · · Score: 4, Informative

    50 Exabytes = (50)1024 petabytes = (50)1048576 terabytes:

    RAID6 (24 Drives -2{Parity} -1{Hot Spare} = 21) 750GB, 13.48TB ZFS/Solaris:
      93,345,048 750GB Hard Drives:     $17,735,559,120
       3,889,377 Areca ARC-1280ML:       $4,317,208,470
       1,944,689 Motherboards/Mem/CPU:     $766,207,466
       1,944,689 5U Rackmount Chassis's: $4,546,682,882
         194,469 4 Post 50U Racks:          $45,700,215
           3,684 528-port 1Gbps Switches:  $374,294,400
              40 96-port 10Gbps Switches:   $11,424,000
       1,948,935 Network Cables:             $2,020,812
               ? Assembly Robots/Misc.     $111,000,000

    Sub Total:                          $27,910,097,365
    Tax/Shipping:                        $2,645,915,779
    Grand Total:                        $30,556,013,144

    $470 billion cheaper then the IRAQ war.

    1. Re:50 Exabytes for $30.5 Billion by Anonymous Coward · · Score: 0

      Great post. And I liked the last line - shows that forecasting doom just isn't appropriate for those numbers. Hell, I have a terabyte of hard drive storage myself (plus about half a terabyte of various optical disks, plus a few shelves of analog magnetic tape) and it's not made any noticable dent in my roughly-minimum-wage salary.

  50. Did we boil the oceans 10 times as well? by michaelz · · Score: 1

    As everyone can read on wikipedia; Bonwick wrote: ...Thus, fully populating a 128-bit storage pool would, literally, require more energy than boiling the oceans. Zfs can "only" store 16 exabyte. This 161 exabyte would need to be on at least 17 zfs storagepools in order to hold this and most of it would be full. Sounds like, bye bye fishies to me. Frying them in their ocean 10 times would surely kill most of them. Besides; 3.4x10^27 J would be needed to boil the oceans. "Just to propel a ship to Mars, requires 1.4e8 joules per kilogram. This includes leaving Earth, making the transfer orbit insertion, and matching velocities with Mars at the end of the Trip." We could transport 2.43*10^19 kilogram to mars on this energy. Would this be enough to transport the Netherlands there?

    1. Re:Did we boil the oceans 10 times as well? by unablepostAC · · Score: 1

      Why do you want to transport the Netherlands to mars????

  51. dev/urandom vs. dev/random by bobv-pillars-net · · Score: 1

    Actually, when /dev/urandom is read at a higher rate than the Entropy Gathering Device (EGD) can deliver randomness, it reverts to a pseudorandom number generator, which will NEVER generate Slashdot text, as its periodicity is too low. (Although it may be argued that Slashdot's periodicity is even lower.)

    Reading from /dev/random would theoretically generate such text (eventually), but its data rate is severely limited, absent a dedicated hardware random-number generator.

    Even a hardware random-number generator may not generate TRULY random numbers (i.e. sufficiently random that they are mathematically guaranteed to eventually include all possible bit patterns, including inane Slashdot chatter). The proof is left as an exercise for the reader.

    --
    The Web is like Usenet, but
    the elephants are untrained.
  52. gotta be careful with units by Anonymous Coward · · Score: 0

    Actually ZFS can store 16 exbibytes, so you only need 9 of those (ceil(8.72782749)) to store 161 exabytes.

  53. how much does google have? by CoolCat · · Score: 1

    Seriously, they are one of the few who have insanely large amount of data with meaning and powerful tools. Wounder how many megabytes that disk is.

  54. Re:Poor estimation, poor predictions, poor conclus by DimGeo · · Score: 1

    Like the previous poster said, we all produce tons of data daily without storing it. Mobile phone calls. VOIP. Video conferencing. IM without history. etc. That's countless gigabytes daily worldwide...

  55. Wow that's massive! by nbritton · · Score: 1

    To get a feel for the size and scale of 50 exabytes using today's technology,

    * Two copy's of the entire Library of Congress, 6000 TB[1], can be stored in the collective cache buffers of the RAID controllers.
    * It would need a 1,712 MW (peak) power source, a typical PWR nuclear power station produces 2,000 MW. Tack on another $5 billion for the construction of a nuclear power station.
    * You would likely need to employ an entire team (in 3 shifts) to replace defective drives every day.
    * You would need 1,684,804 sq. feet to house all the racks, a building the size of the John Hancock Center would be needed. Add another $385 million to the bill.

    [1] http://www.lesk.com/mlesk/ksg97/ksg.html

  56. Ah 1.1MLocs by Douglas+Goodall · · Score: 1

    1.1 Million Libraries of Congress

  57. Re:Poor estimation, poor predictions, poor conclus by suv4x4 · · Score: 1

    Like the previous poster said, we all produce tons of data daily without storing it. Mobile phone calls. VOIP. Video conferencing. IM without history. etc. That's countless gigabytes daily worldwide...

    I'm sorry that I have to clarify myself, but neither I or the survey include temporary data in the discussion. We're talking data that is stored and represents archived information..

    Otherwise where do we stop? Do we count copies of the programs in RAM, swap files, temporary caches and so on? It'll become pointless pretty soon... That said, the whole study is pointless anyway.

  58. Guaranteed by bobcote · · Score: 1

    99% of which will be cell phone videos of college students holding beer bottles and yelling "Woooo!"

  59. Re:Poor estimation, poor predictions, poor conclus by DimGeo · · Score: 1

    Good point. I was thinking in terms of new data, not maintained data, but anyway.