'Digital Universe' To Add 1.8 Zettabyte In 2011
1sockchuck writes "More than 1.8 zettabytes of information will be created and stored in 2011, according to the fifth IDC Digital Universe study. A key challenge is managing this data deluge (typified by the Large Hadron Collider at CERN, which generates 1 petabyte of data per second)."
the experiments may generate the PB per second but most of the data is rejected before it hits any storage system...
Indeed, only about 25PB are stored every year from the LHC.
Can we get that in a proper measurement like Libraries of Congress.
Time to offend someone
I wonder how much of that data is redundant. I know that for one of my side projects I have "redundant" data that I got from the Minnesota DNR, various MN counties, the state legislature, and the federal gov. Even after it had been preprocessed and trimmed down so it only has what I care about it is still around 12GB of vector data which is about 1/3 the original size.
Time to offend someone
Don't worry, the large size won't be an issue. You can put it in a ZIP and then put that into another ZIP and so on.
That's just stupid. They use the same compression algorithm!
Put it in a ZIP in a TAR in a RAR in a 7z in an ACE in a bZip in a CAB in a dmg in a a ARJ, and finally save it as a GIF. You can't use JPEG as it's lossy.
Finally had enough. Come see us over at https://soylentnews.org/
So it generates 1PB of data per second, yet from the article "[T]he data comes from the four machines on the LHC in which the collisions are monitored â" Alice, Atlas, CMS and LHCb â" which send back 320MB, 100MB, 220MB and 500MB"
That's a few orders of magnitude short of 1 Petabyte, folks. Where are these numbers coming from?
Finally had enough. Come see us over at https://soylentnews.org/