Slashdot Mirror


"Digital Universe" Enters the Zettabyte Era

miller60 writes "In 2010 the volume of digital information created and duplicated in a year will reach 1.2 zettabytes, according to new data from IDC and EMC. The annual Digital Universe report is an effort to visualize the enormous amount of data being generated by our increasingly digital lives. The report's big numbers — a zettabyte is roughly a million petabytes — pose interesting questions about how the IT community will store and manage this firehose of data. Perhaps the biggest challenge isn't how much data we're creating — it's all the copies of it. Seventy-five percent of all the data in the Digital Universe is a copy, according to IDC. See additional analysis from TG Daily, The Guardian, and Search Storage."

137 comments

  1. Who cares? by Anonymous Coward · · Score: 0

    A zettabyte is more data than you generate during your whole lifetime. It's pointless to have so much space.

    1. Re:Who cares? by Stooshie · · Score: 4, Insightful

      Maybe you won't but then you are not CERN or the Hadron Collider

      --
      America, Home of the Brave. ... .and the Squaw.
    2. Re:Who cares? by wallyhall · · Score: 1

      A zettabyte is more data than you generate during your whole lifetime. It's pointless to have so much space.

      For those wondering: 1,000,000,000,000,000,000,000-ish, or some 10^21

      --
      I think therefore I am... a Linux geek.
    3. Re:Who cares? by FreeUser · · Score: 1

      A zettabyte is more data than you generate during your whole lifetime. It's pointless to have so much space.

      Speak for yourself.

      --
      The Future of Human Evolution: Autonomy
    4. Re:Who cares? by natehoy · · Score: 4, Funny

      Yes, 640 petabytes should to be enough for anybody.

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    5. Re:Who cares? by insnprsn · · Score: 1

      Didnt it say that they are talking about created data (and assorted copies of it)... not how much data we can store

    6. Re:Who cares? by plover · · Score: 1

      A zettabyte is more data than you generate during your whole lifetime. It's pointless to have so much space.

      Ooooh, it sounds like somebody's pr0n collection seems a bit inadequate today.

      --
      John
    7. Re:Who cares? by Dthief · · Score: 1

      I have 10^21 + 1 desktop shortcuts

      --
      www.RacquetUp.org - Helping Detroit Youth
    8. Re:Who cares? by Monkeedude1212 · · Score: 1

      My Hot Fresh Brewed Coffee shot out my nose. It still burns a little, but that joke was totally worth it.

    9. Re:Who cares? by natehoy · · Score: 1

      Glad I could assist in the steam cleaning of your nasal cavities.

      Just another service we offer in addition to sarcasm.

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    10. Re:Who cares? by ByOhTek · · Score: 1

      That's everyone, not per person.

      That's ~1/7th TB per person

      Man am I over quota.

      --
      Self proclaimed typo king, and inventor of the bear destroying coffee table (patent not pending).
    11. Re:Who cares? by Anonymous Coward · · Score: 1, Informative

      Yes, or roughly a million petabyte, where a million is roughly 10^6 and peta is roughly 10^15.

      I'll make roughly one post about this matter.

    12. Re:Who cares? by CODiNE · · Score: 4, Funny

      Don't you mean

      Yes, 640 petabytes should to be enough for everybody.

      --
      Cwm, fjord-bank glyphs vext quiz
    13. Re:Who cares? by SmackTheIgnorant · · Score: 1

      OK, I laughed when I read that, and then I wondered: How much of that would be porn? NOBODY needs a porn collection that big. And I considered making a Library of Congress type comparison, but saying the LoC would be a single hair...... it got creepy. Can we just, as a collective, hit the delete key a couple dozen more times a day? And keep the porn down to under a terabyte?

    14. Re:Who cares? by Anonymous Coward · · Score: 0

      NOBODY needs a porn collection that big... And keep the porn down to under a terabyte?

      Blasphemy!

    15. Re:Who cares? by Anonymous Coward · · Score: 0

      NOBODY needs a porn collection that big.

      Welcome. You must be new here.

    16. Re:Who cares? by natehoy · · Score: 1

      No, what I meant specifically was "(Score:5, Funny)".

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    17. Re:Who cares? by natehoy · · Score: 1

      Ah, so you've finally taken my advice and started cleaning those up. Thanks.

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    18. Re:Who cares? by natehoy · · Score: 1

      NOBODY needs a porn collection that big.

      You're forgetting backups.

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    19. Re:Who cares? by religious+freak · · Score: 1

      lol - now that's funny

      --
      If you can read this... 01110101 01110010 00100000 01100001 00100000 01100111 01100101 01100101 01101011
    20. Re:Who cares? by networkBoy · · Score: 1

      Funny, me too, I'm at 7/1 TB rather than 1/7

      --
      whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
    21. Re:Who cares? by highacnumber · · Score: 1

      It's not that much really. Only about a millionth of a hellabyte.

    22. Re:Who cares? by ikeman32 · · Score: 1

      Zettabyte, pettabyte...pish pah, When they come up with the Teraquad then I will be impressed.

    23. Re:Who cares? by Meski · · Score: 1

      NOBODY needs a porn collection that big.

      You're forgetting backups.

      Just another kind of porn, I guess.

    24. Re:Who cares? by Rusty+KB · · Score: 1

      Hell, 640 petabytes isn't enough for me. I started burning the porn on DVDs!

    25. Re:Who cares? by plan10 · · Score: 1

      Umm, the article is pretty much about the fact that we did generate it in our lifetime

  2. Where do I get a Zettabyte Drive? by filesiteguy · · Score: 1

    Hmm - thinking that I'd like to pop over to cnet or tigerdirect or fry's and pick up a zettabyte drive. I'm sure that's "more than enough storage" for all my digital files...

    Are they on sale for $149.00 yet?

    1. Re:Where do I get a Zettabyte Drive? by plover · · Score: 1

      I'm sure that's "more than enough storage" for all my digital files...

      <oblig>640 zettabytes ought to be enough for anybody.</oblig>

      --
      John
    2. Re:Where do I get a Zettabyte Drive? by filesiteguy · · Score: 1

      LOL!

    3. Re:Where do I get a Zettabyte Drive? by IronDragon · · Score: 1

      Not yet, but soon.

  3. Hardware: "Digital Universe" Enters the Zettabyte by Thanshin · · Score: 2, Interesting

    "In 2010 the volume of digital information created and duplicated in a year will reach 1.2 zettabytes, according to new data from IDC and EMC. The annual Digital Universe report is an effort to visualize the enormous amount of data being generated by our increasingly digital lives. The report's big numbers -- a zettabyte is roughly a million petabytes -- pose interesting questions about how the IT community will store and manage this firehose of data. Perhaps the biggest challenge isn't how much data we're creating -- it's all the copies of it. Seventy-five percent of all the data in the Digital Universe is a copy, according to IDC."

  4. ZFS? by CuriousKumar · · Score: 1

    With Z already in place and with [not so] recent inline deduplication feature, I think ZFS should do it.

    1. Re:ZFS? by Stooshie · · Score: 1

      Like Yotta

      --
      America, Home of the Brave. ... .and the Squaw.
  5. Kids know what KiB is just fine by Anonymous Coward · · Score: 0

    "You talk to a kid these days and they have no idea what a kilobyte is. The speed things progress, we are going to need many words beyond zettabyte."

    Sure they do. When their download speed drops under 100 kilobytes, they start sending "seed pls" comments to the Pirate Bay.

  6. Annoying audio ad at the 'digital universe' link by Anonymous Coward · · Score: 0

    i know some people here frown down on those

  7. How do we have copies of all this data? by HockeyPuck · · Score: 2, Insightful

    Since this is EMC, let me tell you...

    EMC loves to tell you to use RAID1. - 2 copies of your data
    If it's important, you should use timefinder (snapshots), 1 more copy of the data.
    If you want DR, then you should implement SRDF, 1 more copy of the data (this one is remote)
    If you want to do data warehousing on what you just replicated, you run timefinder on the remote copy, 1 more copy.

    So that makes it 5 copies of my data on disk.

    Oh, and to protect myself from data corruption (or a deleted file) being replicated to all these copies, it's still recommended that I backup to tape/VTL/MAID.

    Total of 6 copies of data. That is if I'm using dedup on my VTL or TSM (which stores versions of a given file). If i'm using a traditional (daily incrementals plus weekly fulls) I could have lots of duplications within my tape infrastructure.

    Ever wonder why EMC stands for Endless Mirroring Company.

    1. Re:How do we have copies of all this data? by Thanshin · · Score: 1

      Ever wonder why EMC stands for Endless Mirroring Company.

      And if the endless mirroring company starts moving really fast, until they couldn't possibly move faster, they'd represent the relationship between mass and energy.

    2. Re:How do we have copies of all this data? by ArsonSmith · · Score: 1

      i thought they were the Excessive Margins Company.

      --
      Paying taxes to buy civilization is like paying a hooker to buy love.
  8. Re:Hardware: "Digital Universe" Enters the Zettaby by cgenman · · Score: 2, Insightful

    Only 75%? Considering that all DVD's are copies, all local caches are copies, I wouldn't be surprised if that number was much larger.

    Also, cutting out all the copies would only reduce the problem to .3 zettabytes. For day-to-day IT purposes, that's about the same number.

  9. I'm happy to see by ltning · · Score: 2, Funny

    That we have all become good citizens, backing up all our data. I presume the data recovery firms are all panicking now that all their potebtial customers have backups of everything, and thus no longer need their services.

    Not bad to have a global backup ratio of >1:1

    Personally I use RAIM (Redundant Array of Instant Messages) to back up all my important notes and communications. It only works as long as all my friends log everything too, of course.

    --
    Love over Gold.
    1. Re:I'm happy to see by natehoy · · Score: 2, Funny

      Dude, that's so old-school. I use RAT (Redundant Array of Tweets). My data is backed up... 140 characters at a time.

      I'm thinking of upgrading to a system with a larger packet size. RASC (Redundant Array of Slashdot Comments) might work, but I'm afraid of having my pr0n collection marked "insightful".

      --
      "This post contains words, known to the State of California to cause thought. Wash brain thoroughly after reading."
    2. Re:I'm happy to see by Ollabelle · · Score: 1

      Well sure, now that the Library of Congress is involved, I'll let them be my backup.

      --
      Ibid.
  10. What is the data? by sadtrev · · Score: 2, Interesting

    I was told about 10 years ago that "70% of the world's digital data is stored under MVS" which surprised me a bit, even then.
    After some thought when you consider that almost all commercial transactions (banks, telcos etc) whould have been running MVS then it may have been true.
    SETI and CERN and other large scientific endeavours are small fry in comparison.

    1. Re:What is the data? by Yvan256 · · Score: 1

      10 years ago MVS carts were considered huge compared to the other gaming systems of the era.

  11. Challenge? by O('_')O_Bush · · Score: 2, Insightful

    " Perhaps the biggest challenge isn't how much data we're creating — it's all the copies of it. "

    Why is that a challenge? Digital media is somewhat unique in that you can carefully craft media or information (reports, programs, videos much in the same way you'd carve a chair) but risk instantly and nearly irrecoverably lose it (much unlike a chair).

    Copies of data are a safeguard by redundancy. A website gets taken offline, well good thing there is a mirror. My camera breaks or my hard drive disk fails, well good thing I have an external backup or copies on my DVDs.

    --
    while(1) attack(People.Sandy);
    1. Re:Challenge? by Menkhaf · · Score: 1

      " Perhaps the biggest challenge isn't how much data we're creating &mdash; it's all the copies of it. "

      Slashdot apparently even manages to create new data while still backing up the old data...

      --
      A proud member of the Onion-in-Hand alliance
  12. Re:Hardware: "Digital Universe" Enters the Zettaby by Rockoon · · Score: 2, Insightful

    In the world of home storage, 75% is definitely way too low. The average personal desktop probably has 20 to 40 gigabytes of used storage, with far less than 1 gigabyte being original content. If they also back up this data, the fraction grows even lower.

    Everything on their DVR is also not original.

    Now, in the business world things are a bit different. Here you can expect the same 20 to 40 gigabytes of used storage on the median machine, but backed by a massive networked database of original uptime-critical content with at least a couple mirrors.

    It is this second category that is clearly driving their estimate.

    --
    "His name was James Damore."
  13. a kb is a lot of data by jDeepbeep · · Score: 1

    640K ought to be enough for anybody

    --
    Reply to That ||
    1. Re:a kb is a lot of data by CecilPL · · Score: 1

      I need that much every second, at least!

  14. But it's pretty by hierofalcon · · Score: 1
    When I first started, information was hard to produce... punched cards and all that.
    Information storage was expensive.
    At some point we started word processing on the desktop.
    Information storage was still expensive.
    Files were still small and the majority of the bytes in each file was information.

    As time progressed and Microsoft Office has permeated the work area, the information content of each file hasn't changed much.
    Each release seemed to take more space to store the same information.
    Today, the portion of the file consumed in making it pretty though has gone through the roof.

    We could always go back to just plain text files that were easy to search and cheap to store. Project Gutenberg has taken that approach for saving books. Keep it simple. Of course if we did that the productivity might go through the roof and layoffs might be high.

    Guess pretty isn't so bad, even if it is part of the zettabyte problem.

    1. Re:But it's pretty by maxume · · Score: 1

      That's a silly analysis of it, text markup and layout is some tiny fraction of it. 150 pages of text layout information takes up about the same amount of space as 2 crappy snapshots, or a few seconds of high quality video.

      --
      Nerd rage is the funniest rage.
    2. Re:But it's pretty by tendrousbeastie · · Score: 1

      I think Hierofalcon was probably referring more the huge inefficiency of MS Word and co to store even a simple text based document.

      I have seen 70MB+ Word files, which you can open, Ctrl-A, Ctrl-C and then paste into a new empty doc - save that doc and you have a 50kb Word file.

      Plain text doesn't really have the capability of being inefficient (unless I suppose you fill the file with crap, but then it is simply efficiently storing a load of crap).

    3. Re:But it's pretty by maxume · · Score: 1

      The part at the end where he talks about pretty favors my interpretation.

      I don't know what causing what you describe, but there is probably something tracking changes to the document. And maybe somebody posted in a large bitmap (from what I have seen, people think that is a great idea), or perhaps of series of them, and then deleted them.

      --
      Nerd rage is the funniest rage.
    4. Re:But it's pretty by tendrousbeastie · · Score: 1

      Yeah, Word has a lovely feature whereby when you remove sections from a document it doesn't so much delete that content from a file as just delete the references to it. So, if someone changes one image in a doc for another it will keep a copy of both images in the file but only show the new one.

      It would be a good feature if it was actually made use of in some sort of revision history system, but as far as I can tell the only effect of it is the increased file size of some docs.

      I agree with you that the talk about making it pretty must refer to markup code of some kind, so you're right, my talk of plain text doesn't really address the issue.

    5. Re:But it's pretty by Smauler · · Score: 1

      Gamefaqs still hosts all its faqs in text, which is one of the reasons I use it pretty exclusively. That and it being easily the most authorative faq site out there.

  15. Re:Hardware: "Digital Universe" Enters the Zettaby by PlusFiveTroll · · Score: 3, Insightful

    If every piece of digital data doesn't have a copy made of it, it is one hardware failure away from non-existence. Most of the storage space used in businesses that I administrate is not for the original data, but for multiple backup copies. Copies are not a bad thing, in the business we call them redundancy.

    # Only wimps use tape backup: real men just upload their important stuff on ftp, and let the rest of the world mirror it ;)
            * Torvalds, Linus (1996-07-20).

  16. Library of congress by nlann · · Score: 1

    How many libraries of congress is that?

    1. Re:Library of congress by Anonymous Coward · · Score: 0

      About 100,000

    2. Re:Library of congress by iamhassi · · Score: 1

      And if it were in phonebooks stacked on top of each other how far would it reach?

      --
      my karma will be here long after I'm gone
    3. Re:Library of congress by OctaviusIII · · Score: 2, Interesting

      According to Wikipedia, it's about 10^9 Libraries of Congress, not including images.

      --
      What's this? Another weblog? On transit?
  17. Of COURSE most of it consists of copies! by King_TJ · · Score: 2, Insightful

    A typical individual wouldn't have a whole lot of unique information to store in the first place.... Basically, a collection of photos and some video from a few vacation trips or holidays, and some handwritten notes .... Maybe some artistic works (a few original songs or paintings, or ?) if he/she was interested in such endeavors. Oh, and your tax records and resume. But let's face it. Most of us are FAR more of content consumers than creators. Content creation usually results in mass re-distribution of the original work, as others want to enjoy a copy of it.

    I don't see any harm with this either, since duplication is the best way to protect against data loss. (When my parents were trying to trace their family history, they reached a dead-end because a library had burnt up in a fire that contained the only known records of some of the people they needed to research. With so much data going digital, on media that's practically EXPECTED to fail after less than 10 years of regular use? You better believe we need lots of duplicates out there!)

  18. I have often said.... by hesaigo999ca · · Score: 1

    I have often spoken to a many engineers from gmail and hotmail....pertaining to the data they store and how they could improve their
    systems by having pointers to emails instead of actual copies per storage account. if someone sends a joke email from one gmail account to all his friends which have 80% gmail accounts (so let's say, 25 in 30) you would only still have one copy of that joke email sitting on their server accessible by all who have that pointer reference, but in fact looks like they all have their own copy, unless of course someone modifies that email, which then becomes its own version , so splitting into many copies....

    If you were to add that to many more things in life like cloud computing for storage purposes, across the board, you would have a lot less bandwidth being used, imagine being able to reference first by using lookup pointer tables, and instead of downloading, you already know someone on your own network has that same service pack for xxx application, so the download supposedly goes quicker because you are not streaming the packets from outside the network to get the info. etc...etc...

    Anyways, I can't wait for our first zetta drive being sold at a futureshop near you, i will be the first in line to buy one!
    In a few years i think.....lol

    1. Re:I have often said.... by iamhassi · · Score: 1

      Or you could just have pointers to words in a dictionary. All you need is a few million words and you could recreate any possible email written in English

      --
      my karma will be here long after I'm gone
    2. Re:I have often said.... by CecilPL · · Score: 1

      Or you could just have pointers to letters in the English alphabet! Then you can store all your emails in only 26 bytes (plus some overhead for the pointers).

  19. Roughly? by Anonymous Coward · · Score: 0

    "a zettabyte is roughly a million petabytes"
    Surely that should read *exactly* ...?
    /me ducks

  20. 1.21 zettabytes? by nonregistered · · Score: 5, Funny

    1.21 zettabytes? Great Scott!

    1. Re:1.21 zettabytes? by nebaz · · Score: 1

      To aptly apply the mispronounced "jigawatt" paradigm: <Doc Brown>1.31 settabytes? Great Scott!</DocBrown>

      --
      Rhymes that keep their secrets will unfold behind the clouds.There upon the rainbow is the answer to a neverending story
    2. Re:1.21 zettabytes? by Anonymous Coward · · Score: 0

      Actually, "jigawatt" is the correct pronunciation.

      http://www.merriam-webster.com/dictionary/gigawatt

    3. Re:1.21 zettabytes? by Anonymous Coward · · Score: 0

      And yet, "jigawatt", i.e. soft-g, would not be a mispronunciation by most references one might care about.

  21. Retarded IP by static416 · · Score: 5, Insightful

    This beautifully illustrates how idiotic the concept of "copy right" and IP in general is in the digital universe. When 75% of 1.2 zettabytes is mostly untracked copies of other information, just storing the licenses alone would be an impossible task.

    How do you maintain a business model built on the exclusive right to copy information in world where everything is a infinitely copied and copyable? It's like trying to legislate and sell access to saltwater while floating on a raft in the middle of the pacific.

  22. Re:Hardware: "Digital Universe" Enters the Zettaby by iamhassi · · Score: 2, Insightful

    HD home movies and photographs are far more than 1gb

    --
    my karma will be here long after I'm gone
  23. Re:Hardware: "Digital Universe" Enters the Zettaby by basso · · Score: 1

    The comment about all the duplication of storage makes me think of the current pop culture obsession with hoarding.

    I'd guess that all slashdotters have known someone who obsessively downloads music - to the point that they've got more music stored than they could possibly listen to.

  24. Headline... by Anonymous Coward · · Score: 0

    kinda makes me wonder what it would take to store the non-digital universe.

    It hurts my head.

  25. Of course... by Anonymous Coward · · Score: 0

    1.1ZB of that is porn.

    1. Re:Of course... by gmuslera · · Score: 1
  26. Re:Hardware: "Digital Universe" Enters the Zettaby by Anonymous Coward · · Score: 0

    I can't believe none of you caught this... mod funny please, not interesting. Too bad there's no +1 Redundant.

  27. Space Program by Ukab+the+Great · · Score: 5, Funny

    - 1 zettabyte / 1.44MB floppy disk = approx 694,444,444,444,444 floppy disks.

    - 694,444,444,444,444 * 3.5 inches per disk = 2,430,555,555,555,550 inches if you laid the floppies end to end.

    - 2,430,555,555,555,550 inches / 63360 inches per mile = 38,361,040,965 miles

    - 38,361,040,965 miles / 2.7 billion miles to pluto = approx 7 round trips to Pluto via floppy disk.

    In conclusion: Don't kill NASA yet, President Obama. We've found a way to get to Pluto!.

    1. Re:Space Program by RickyG · · Score: 1

      Dang! I was wishing someone wouldn't fall back on the old saw ("That is x number of floppy disk!) Why didn't you use the even older one, that is x number of copies of the Britannic Encylcopdia? We would have reached the nearest star with that example!!

    2. Re:Space Program by Anonymous Coward · · Score: 0

      Fine. Let's use a 64 GB SD card instead.

      - 1 ZB / 64 GB SD card ~= 14551915228 SD cards
      - 14551915228 * 2.1 mm per card = 30559021 m
      - That's roughly 31 000 km, or 3/4 of the distance around the surface of the Earth. Can't even climb a stack of 'em to the Moon.

    3. Re:Space Program by gmhowell · · Score: 1

      If only we'd saved all of those AOL floppies...

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
  28. Too many duplicates consuming disk space? by RhapsodyGuru · · Score: 2, Insightful

    No problem...

    zfs set dedup=on tank

    there... that should do the trick.

  29. 75% sounds about right by SlippyToad · · Score: 1

    75% of everything I have on disk is a copy of something else, but unfortunately I usually have lost the copy somewhere in the process of moving, moving from one machine to the next, or trying to clear up disk space so I can download more stuff to leave on my disk.

    --
    One day I feel I'm ahead of the wheel / the next it's rolling over me / I can get back on / I can get back on
  30. A zettabyte is EXACTLY a million petabytes by swillden · · Score: 0, Troll

    By definition. And since EMC is a storage company, they're almost certainly using the SI prefixes properly.

    The author of the summary is, I think, confusing zettabytes and petabytes with the base-2 units, zebibytes and pebibytes. For all of the binary prefix haters, when you get up into these sizes the difference between base 2 and base 10 units is more than big enough to justify the effort to use the correct terms. The difference between one zebibyte and one zettabyte is over 180 exabytes.

    --
    Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    1. Re:A zettabyte is EXACTLY a million petabytes by Anonymous Coward · · Score: 0

      And when you get to those kinds of numbers, they are the same order of magnitude so the difference is fairly irrelevant.

    2. Re:A zettabyte is EXACTLY a million petabytes by swillden · · Score: 0

      And when you get to those kinds of numbers, they are the same order of magnitude so the difference is fairly irrelevant.

      Order of magnitude, yes, but a power of 10 is a pretty wide range.

      Put it this way: A zebibyte is almost 20% larger than a zettabyte. That's a pretty big difference.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    3. Re:A zettabyte is EXACTLY a million petabytes by Anonymous Coward · · Score: 0

      Most amusing to me is that 1,048,576 (if that's what he meant) is fewer characters than "roughly a million", so there was really no point in being vague.

    4. Re:A zettabyte is EXACTLY a million petabytes by swillden · · Score: 1

      Most amusing to me is that 1,048,576 (if that's what he meant) is fewer characters than "roughly a million"

      And 2^20 is even fewer!

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
    5. Re:A zettabyte is EXACTLY a million petabytes by gmhowell · · Score: 0, Offtopic

      Do you have Asperger's?

      --
      Jesus was all right but his disciples were thick and ordinary. -John Lennon
    6. Re:A zettabyte is EXACTLY a million petabytes by swillden · · Score: 1

      Do you have Asperger's?

      No. Do you have Alzheimer's?

      I'm assuming that's how we play this game. I haven't seen it before, though, so I'm just guessing.

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  31. I would have expected 100% duplication by Anonymous Coward · · Score: 0

    Everything should be backed up ...... Right?!?!

    Then I would expect 100% duplication, or higher should be the norm.

    1. Re:I would have expected 100% duplication by Anonymous Coward · · Score: 0

      If 75% of the data out there is a copy of something, then 25% of the data is not a copy of something. That is 3 to 1, on average.

  32. Damn pirates by Issarlk · · Score: 1

    75% of 1.2ZB = 1E14 megabytes = 150 GigaCDs. At arround 10 tracks per CD costing each 22000USD, that makes it 34 petadollars in lost sales for the music industry !

  33. Re:Hardware: "Digital Universe" Enters the Zettaby by Rockoon · · Score: 1

    The average person doesnt have the ability to take HD home movies because they dont even own the equipment necessary.

    I've seen you project your geek lifestyle onto the world before.

    --
    "His name was James Damore."
  34. 75% of all percent of all the data in the Digital by Anonymous Coward · · Score: 0

    "75% percent of all the data in the Digital Universe is a copy"

    Slashdot Editors unavailable for comment.

  35. Re:Hardware: "Digital Universe" Enters the Zettaby by masshuu · · Score: 0

    The average personal desktop probably has 20 to 40 gigabytes of used storage
    This is slashdot, i doubt many here qualify as average.
    I myself fill my drives with 680GB of stuff, with 40-50GB easily being original.

    --
    O.o
  36. How do you know which one is the copy? by Sir_Dill · · Score: 1
    If a digital copy is identical to the source file, then how do you know which one is the copy?

    Identical meaning everything down to the create date and last modified date.

    1. Re:How do you know which one is the copy? by Anonymous Coward · · Score: 0

      How do you know that one of them is a copy in the first place? The same file could have been independently created.

    2. Re:How do you know which one is the copy? by Anonymous Coward · · Score: 0

      Both are?

      But really, for the definition of "copy" meaning "duplicates" (yeah, I know "replicates" sounds more general). There are N-1 "copies" if there are N identical stored instances of the data, defined for N>=1. You don't have to identify "copies", just count them. Because the whole point was to get a general measure on the *redundancy* of data, not come up with a scheme for identifying "copies".
        And if they are true replicates, it doesn't really matter if none of them were "first generation" (yet that is another way to define away your problem), it is just that for replicates the "generation" really doesn't matter.

      This definition is usually only meaningful when the archiver and content creator are the same or a related entity. If I have purchased "School of Cock 6" on DVD and made a backup, I would probably say I have 2 copies.

      Obviously only the content (and possibly file record data) is identical, the actual storage locations are not, so the whole philosophical issue of "identical" is not really a problem.

  37. Re:Hardware: "Digital Universe" Enters the Zettaby by Anonymous Coward · · Score: 0

    a zettabyte isn't ROUGHLY a million petabytes - it is EXACTLY a million petabytes, that being its definition, and all.

  38. Volume of digital data, not information by rcamans · · Score: 1

    Actually, information is useful stuff. The internet, and the world is saturated with useless stuff (data, noise). Also, the world's stuff is considerably smaller when de-duplicated. And then if you remove redundancies, and different ways of saying the same thing...
    I am pretty sure that all the world's Information can be contained on a single petabyte. That would include all the world's literature, and all the newspapers, magazines, etc. If you include pictures, maybe significantly more.
    Part of the Data problem is tons of data which has not been analyzed. Of the analysis could keep up with the data collection rates, then no problem. And no zetabytes needed, either.

    --
    wake up and hold your nose
  39. questions about how to store and manage the data by Kopretinka · · Score: 1

    "the big numbers pose interesting questions about how the IT community will store and manage this firehose of data" - just like the construction community will house and manage the firehose of over 6 billion people, so to speak.

    Everybody takes care of their own bit(s) & backups; there is no single entity dealing with managing 1.2ZB.

    Questions not so interesting. Move on.

    --
    Yesterday was the time to do it right. Are we having a REVOLUTION yet?
  40. Depends on whether you're SI & IEC or JEDEC by Ungrounded+Lightning · · Score: 1

    a zettabyte isn't ROUGHLY a million petabytes - it is EXACTLY a million petabytes, that being its definition, and all.

    Depends on whether you're using SI & IEC units (Z/zeta- and Zi/zebi- for 1000^7 / 1024^7) or extrapolating JEDEC (which would come up with Z/zeta- for 1024^7).

    --
    Bantam Dominique roosters crow a four-note song. Once you've heard it as "Happy BIRTHday" you can't NOT hear it that way
  41. Re:Hardware: "Digital Universe" Enters the Zettaby by Anonymous Coward · · Score: 0

    Are you kidding? I bought, at retail, a Sony cam, with 60GB internal drive and HD resolution 2 years ago for $900 from Costco. Most people buying new camcorders have ready access to HD quality cameras. Given the purchasing behavior of my cousin and brother-in-law; I think far more people cycle their personal tech more frequently than you suspect.

  42. The scary part by jgreco · · Score: 1

    Is what percentage of it ISN'T backed up AND should be (which will be something less than 25% but much greater than 0%).

  43. Re:Hardware: "Digital Universe" Enters the Zettaby by Anonymous Coward · · Score: 0

    Seriously, what is up with the mods on this one?!?

  44. Re:Hardware: "Digital Universe" Enters the Zettaby by kybur · · Score: 0, Redundant

    "In 2010 the volume of digital information created and duplicated in a year will reach 1.2 zettabytes, according to new data from IDC and EMC. The annual Digital Universe report is an effort to visualize the enormous amount of data being generated by our increasingly digital lives. The report's big numbers -- a zettabyte is roughly a million petabytes -- pose interesting questions about how the IT community will store and manage this firehose of data. Perhaps the biggest challenge isn't how much data we're creating -- it's all the copies of it. Seventy-five percent of all the data in the Digital Universe is a copy, according to IDC."

  45. Re:Hardware: "Digital Universe" Enters the Zettaby by Jenming · · Score: 1

    HD home movies and photographs are copies, even if only one digital copy exists.

    --
    Morpheus, God of Dreams.
  46. Hardware: "Digital Universe" Enters the Zettabyte by Darric · · Score: 0, Redundant

    "In 2010 the volume of digital information created and duplicated in a year will reach 1.2 zettabytes, according to new data from IDC and EMC. The annual Digital Universe report is an effort to visualize the enormous amount of data being generated by our increasingly digital lives. The report's big numbers -- a zettabyte is roughly a million petabytes -- pose interesting questions about how the IT community will store and manage this firehose of data. Perhaps the biggest challenge isn't how much data we're creating -- it's all the copies of it. Seventy-five percent of all the data in the Digital Universe is a copy, according to IDC. "

  47. Re:Hardware: "Digital Universe" Enters the Zettaby by ArsonSmith · · Score: 1

    So when this is duped in a few hours will that be irony or just funny?

    --
    Paying taxes to buy civilization is like paying a hooker to buy love.
  48. Re: EMC's take by storagedude · · Score: 1
    FWIW, here's EMC's take on all this - the future is clouds, SSDs and Ethernet storage:

    http://www.enterprisestorageforum.com/ipstorage/news/article.php/3879726

  49. Roughly? by Anonymous Coward · · Score: 0

    a zettabyte is roughly a million petabytes

    No, a zettabyte is exactly a million petabytes.

    A zebibyte is roughly a million pebibytes.

    Sheesh.

  50. Need some smart software by Anonymous Coward · · Score: 0

    I have just a new terabytes sitting around the house plus back up on various computers, USB sticks here and there. I have also tried to find a nice unified storage device, say RAID 5, that can collect it all and remove all duplications (but keep a kinda of symbolic link to the one true copy and only fork it if it is changed). So if I have two files A and B in two different directories but they where the exact same file then they could be just stored a one file on the disk. If I decide to change A then I should be ask if I should update it or fork a copy of it, leaving B alone.

  51. Re:Hardware: "Digital Universe" Enters the Zettaby by archshade · · Score: 1

    My dads probably got more music (excluding the stuff he downloads) than he will ever get round to listening to. He is almost 50 and I seriously doubt he will double that and his entire basement manages to fit 2x chair + HI-FI + 90% record collection + 75% CD collection. If I go stay the rest of the records and some CDs make it impossible to get into my bed without knocking a stack over.

    This is all after getting rid of tapes/MD and quite a lot of the vinyl/CD sometime I think he hold up the entire UK music scene on his own.

    My point is you don't need to download to have to much music just spend all your time in record shops.

    --
    Most Damage is done by people who are AWAKE
  52. Seriously? They included duplication? by pclminion · · Score: 1

    The number is meaningless, because "duplication" is arbitrary. Where do you draw the line? If duplication means "copying data from one place to another" then data is duplicated every time function parameters are pushed on the stack, every time memcpy() is called, every time something is loaded from disk into RAM. I could write a simple loop that copies a 32-bit quantity from EAX to EBX three billion times per second. If you include all that shit going on, I bet their number would be higher by a factor of a thousand or more. What the fuck is the use of this metric? How do you justify the arbitrary stopping point?

  53. Re:Hardware: "Digital Universe" Enters the Zettaby by iamhassi · · Score: 1

    Where have you been, stuck in 2000? I said HD home movies and photographs. First, average joe can't hardly find a camera that's not digital since walmart only sells 2 cameras that still use film.

    Second, you can't buy a camcorder that's not flash or hard disk. Yep, you heard me: Walmart only sells 2 camcorders that record directly to DVD, the other 150+ are all flash and hard drive. The camcorder offering the smallest hard drive capacity is still 80gb for a paltry sum of $350 and HD camcorders start at only $89.

    So it is not I that is projecting my geek lifestyle on the world, it is you who is out of touch with modern consumer electronics.

    --
    my karma will be here long after I'm gone
  54. Re:Hardware: "Digital Universe" Enters the Zettaby by d474 · · Score: 1

    Someone please Mod parent +Funny - Did no one get the irony?

    --
    Authority questions you. Return the favor.
  55. Re:Hardware: "Digital Universe" Enters the Zettaby by Anonymous Coward · · Score: 0

    In the world of home storage, 75% is definitely way too low. The average personal desktop probably has 20 to 40 gigabytes of used storage, with far less than 1 gigabyte being original content. If they also back up this data, the fraction grows even lower.

    Also, it implies that there a lot of un-backed up data out there.

  56. Re:Hardware: "Digital Universe" Enters the Zettaby by Rockoon · · Score: 1

    Are you kidding? I bought, at retail, a Sony cam, with 60GB internal drive and HD resolution 2 years ago for $900 from Costco.

    Yeah.. thats means everyone has a Sony cam with a 60GB hard drive.. oh wait.. NO IT DOESNT

    It means that you are a geek. Most people do not own *ANY* camcorder.

    You heard me. Most people do not own a camcorder.

    Let me repeat that one more time. Most people do not own any camcorder.

    --
    "His name was James Damore."
  57. Re:Hardware: "Digital Universe" Enters the Zettaby by Rockoon · · Score: 1

    Average Joe does not even LOOK for a camera, let alone buy one.

    You are projecting your own lifestyle onto others. The average person does not own a digital camera, and most of the ones that do are sporting one integrated into their pay-as-you-go $30 cell phone.

    The average person does not have a smart phone. The average person does not have a camcorder. The average person does not have a digital camera. The average person doesnt even have a game console. They have a laptop which they send email with. Thats it. A laptop for email.

    --
    "His name was James Damore."
  58. Re:Hardware: "Digital Universe" Enters the Zettaby by grrrl · · Score: 1

    You know people who don't own a digital camera? Really? I don't know anyone who _doesn't_!

  59. Anybody have a link to the torrent? (n/t) by gmhowell · · Score: 1

    Anybody have a link to the torrent?

    --
    Jesus was all right but his disciples were thick and ordinary. -John Lennon
  60. Re:Hardware: "Digital Universe" Enters the Zettaby by iamhassi · · Score: 1

    LOL he's joking, obviously. Average Joe does not have a digital camera, smartphone, camcorder, game console... all they own is a laptop for email

    --
    my karma will be here long after I'm gone
  61. Re:Hardware: "Digital Universe" Enters the Zettaby by grrrl · · Score: 1

    ha yeah it was pretty dead-pan humour (not that that's a bad thing...)

  62. Do YOU have a degree in CSC or CIS, gmhowell by Anonymous Coward · · Score: 0

    gmhowell, do you even have a CSC or CIS degree? That's a better question. Answer = No. What exactly then qualifies you as somekind of expert on the subject material of computing on this forums then?? Your talking??? LMAO, not.

  63. If only you had a CSC or CIS degree, talker by Anonymous Coward · · Score: 0

    gmhowell, do you even have a CSC or CIS degree? No. What exactly then qualifies you as somekind of expert on the subject of computers around here then?? Your talking??? LMAO, not. You're just another dime-a-dozen slashdot ne'er do well.

    1. Re:If only you had a CSC or CIS degree, talker by Anonymous Coward · · Score: 0

      Oh look, it's the troll that's been stalking Clone53421, Squiggleslash, GM Howell, and Tom Hudson.

      For those new to this, the troll is someone claiming to be a defender of a certain Alexander Peter Kowalski, the author of a tool, apkapp2backgrounddaemonprocessengine, generally considered malware by a large number of anti-malware companies and organizations.

      CA
      PestPatrol
      SpywareDB ("Dangerous!")
      Freedom Anti-Spyware
      Spycheck (Spanish-language) - "Recomendacion: DESACTIVAR Y ELIMINAR"
      Spyware No-More ("Threat risk: High risk", "Advice: Remove This is a very high risk threat and should be removed immediately as to prevent harm to your computer and / or to protect your privacy")

      Mr Kowalski, or his admirer, got upset because someone had the audacity to link to a threat describing Kowalski's attempts to remove some embarrassing comments posted under his name. Rather than deal with it maturely, this person has been attempting to stalk said poster and those who pointed out Kowalski wasn't doing himself any favors.

      So if you see these comments posted as replies to clone, squiggleslash, gmhowell, or Tom, now you know why they're appearing. And if you feel like joining in, making it clear to Mr Kowalski that spamming, sliming, and stalking are unacceptable, well, come on in, the water's lovely!

  64. 1.2 ZBy is about 1/8th of a gram mol of bits by Richard+Kirk · · Score: 1

    1.2 ZBy is about 1/8th of a gram mol of bits. With molecular memory, it all ought to fit on a few grammes of carbon.

  65. Re:Hardware: "Digital Universe" Enters the Zettaby by plan10 · · Score: 1

    No it means he bought a camcorder sometime in the last 5 years.

  66. Zeta is one with 21 zeros by bigsexyjoe · · Score: 1

    In plainer language it is one with 21 zeros. I personally didn't know petabyte off the top of my head.

  67. Do you have a degree in CSC or CIS, loser? by Anonymous Coward · · Score: 0

    See subject and of course not. gmhowell = yet another "slashdot wannabe expert", albeit with no degrees in computer science or computer information systems.

    1. Re:Do you have a degree in CSC or CIS, loser? by squiggleslash · · Score: 1
      Oh look, it's the troll that's been stalking Clone53421, Squiggleslash, and Tom Hudson, and now GMHowell.

      For those new to this, the troll is someone claiming to be a defender of a certain Alexander Peter Kowalski, the author of a tool, apkapp2backgrounddaemonprocessengine, generally considered malware by a large number of anti-malware companies and organizations.

      CA
      PestPatrol
      SpywareDB ("Dangerous!")
      Freedom Anti-Spyware
      Spycheck (Spanish-language) - "Recomendacion: DESACTIVAR Y ELIMINAR"
      Spyware No-More("Threat risk: High risk", "Advice: Remove This is a very high risk threat and should be removed immediately as to prevent harm to your computer and / or to protect your privacy")

      Mr Kowalski, or his admirer, got upset because someone had the audacity to link to a threat describing Kowalski's attempts to remove some embarrassing comments posted under his name. Rather than deal with it maturely, this person has been attempting to stalk said poster and those who pointed out Kowalski wasn't doing himself any favors.

      So if you see these comments posted as replies to clone, squiggleslash, Tom Hudson, or GM Howell, now you know why they're appearing. And if you feel like joining in, well, come on in, the water's lovely!

      --
      You are not alone. This is not normal. None of this is normal.
  68. Re:Hardware: "Digital Universe" Enters the Zettaby by Mycroft_VIII · · Score: 1

    I dunno, my anecdote agrees with the other guy. My dad had a digital camcorder and was looking into upgrading. This is the man who screw up a mostly locked down laptop he only used for web browsing and email (o.k. mostly instead of entirely locked down was my mistake.)
          I have a 20 year old sub $10/hour employee with a mid-range blackberry, and the rest of them probably average $200 phones (a couple of geezers like me and two single moms bring that average way down).
          My thinking is look at the prices and availability of them, that suggests a fairly high number of people have them, I'd guess (just that a guess) well over 20%, but probably not as high as 60%, say 1/4 to 1/3.

    Mycroft

    --
    https://signup.leagueoflegends.com/?ref=4c3ed6600b6ea
  69. Do you have a CSC or CIS degree, squigglenobody? by Anonymous Coward · · Score: 0

    Do you have a degree in CSC or CIS squiggleslash? No?? We thought not. You're by no means an expert to comment on anything in the art and sciences of computing then. The same things you note have happened to others that write good softwares, such as Dr. Mark Russinovich of Microsoft even and also Nir Sofer of Nirsoft, to name only a couple. Your lack of expertise is showing itself squiggleslash. Go get a degree and then get back to us, because until then? Your credibility as an expert is non-existent.

  70. gmhowell's now replying as AC to avoid answering by Anonymous Coward · · Score: 0

    Do you have a degree in CSC or CIS gmhowell? No?? We thought not. You're by no means an expert to comment on anything in the art and sciences of computing then. The same things you note have happened to others that write good softwares, such as Dr. Mark Russinovich of Microsoft even and also Nir Sofer of Nirsoft, to name only a couple. Your lack of expertise is showing itself gmhowell. Go get a degree and then get back to us, because until then? Your credibility as an expert is non-existent!

  71. Re:Do you have a CSC or CIS degree, squigglenobody by Anonymous Coward · · Score: 0

    The same things you note have happened to others that write good softwares, such as Dr. Mark Russinovich of Microsoft even and also Nir Sofer of Nirsoft

    This has been debunked multiple times already. Alex Peter Kowalski's code wasn't fleetingly described as malware, it is currently classified this way by multiple anti-malware organizations, who have placed it under that category for several years.

    You want to talk about libel? What about pretending that Russinovich's software is comparable to that written by an author of software widely considered malware who promotes snake-oil anti-virus solutions.

    And continue pretending that nobody's qualified to slap you down. Until you can show that any of us also writes tools considered malware, and "anti-virus guides" that, by your own admission, do not work (USE IT AND YOU STILL GET TWO VIRUSES A MONTH?!), you don't have a leg to stand on.