800 Megs of Data Per Person Last Year?

umm..repost by Anonymous Coward · 2003-11-02 05:13 · Score: 4, Funny

i bet half this increase is due to the number of slashdot reposts also increasing over the same timespan.

FP by jusdisgi · 2003-11-02 05:13 · Score: 1

...but didn't we see this the day before yesterday?

--
Given a choice between free speech and free beer, most people will take the beer.

Re:FP by GaelenBurns · 2003-11-02 05:20 · Score: 1

This is just the Slashdot editors way of taking our attention off the real news out there - the Diebold memos. They're all part of the same conspiracy to keep us "shuffling slack-jawed in the same direction."

What about redundant information? by gmplague · 2003-11-02 05:14 · Score: 1, Redundant

The article fails to address the issue of redundant information, so this number I'm sure is inflated. It raises an interesting question though, to what extent are we becoming more redundant in our data storage? Once we answer that, we also answer exactly how much new information is being generated per person, per year.

--
__________________________________________
Take comfort in your ignorance.
Grandmaster Plague

Re:What about redundant information? by t0rnt0pieces · 2003-11-02 05:16 · Score: 5, Funny

The article fails to address the issue of redundant information

You mean stuff like this?

--
Karma: Excellent (In Soviet Russia, karma pimps YOU)
Re:What about redundant information? by FiloEleven · 2003-11-02 05:17 · Score: 3, Funny

Funny you should mention redundant information, as THIS IS A DUPE. Woo! 2.8% of the 800 megs per person average comes from Slashdot dupes alone.

--
Your brain is not a computer.
Re:What about redundant information? by Davak · 2003-11-02 05:22 · Score: 4, Interesting

Bravo.

Is a cracked version of some latest software package new data?

Honestly, only about 25k in the *.exe has been changed... but this would count as a doubling of information, hard drive space, whatever.

Likewise, when we chart medical information, we often duplicate the information from note to note to remind ourselves and others about the important aspects of the patient's history. Really...it's just data duplication.

Davak
Re:What about redundant information? by lanswitch · 2003-11-02 05:31 · Score: 3, Interesting

how much new information is being generated per person, per year
When a person rips a cd to mp3, does s/he create new data? And when s/he copies existing mp3's, is that new data? I think it's impossible to define the term "new information", unless you go with the strictest definition: before the information was digitally genarated, it did not exist in any other form. Only then it is new.
Re:What about redundant information? by lanswitch · 2003-11-02 05:52 · Score: 1

Data on a cd is digital. A copy of digital data is not new information, as far as I'm concerned. Maybe I'm missing something?
Re:What about redundant information? by mschaffer · 2003-11-02 05:57 · Score: 1

After all, how many people downloaded the same p0rn files to their computers?
Re:What about redundant information? by chabotc · 2003-11-02 06:11 · Score: 1

Thats a somewhat shortsighted statement. Ofcource information is replicated a lot, but that goes for paper and magnetical storage alike.. Think how often we zerox things, books and papers are printed, etc.

The article mentions "producing information", and not creating unique content for induviduals..

No Shock by GOPWillC · 2003-11-02 05:15 · Score: 1

It's no shock since this message is data that I'm generating, my dentist appointment generates data, my email generates data, etc.

With HD's so cheap.. by flamingantichimp · 2003-11-02 05:16 · Score: 1

With data/harddrives so cheap now a days, most people don't even take notice to what they are filling up. I can only see this number growing since there is very little reason not to have data for people (since the data is so cheap)
but I'm no expert.

Go for 900 by AppHack · 2003-11-02 05:17 · Score: 1

The Slashdot editors are trying to achieve 900 megs of space per user by duplicating the article over and over.

Then they can duplicate the 900 megs article. :)

Re:Go for 900 by akeyes · 2003-11-02 06:12 · Score: 1

Only go for 900? Try something like doubling the current like 1600Mb/person.

Makes you feel... by Chordonblue · 2003-11-02 05:17 · Score: 4, Interesting

...all warm and fuzzy inside doesn't it? All that data about you and yours? Think it'll get any better? Think again.

It amazes me how much people don't think about privacy anymore. How the concept of supermarket sales has given way to 'Bonus Cards' which track what you buy. Few understand how this information can be used to piece together a bigger picture.

Some Wachovia Bank branches are now requiring a FINGERPRINT before you can cash a check. The situation is this: If you are not a customer, you are now required to give them a an electronic finger scan to cash a check made out under Wachovia.

Where does it end? Should I just give them a hair sample now or wait until my implant is required?

--
"...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."

Re:Makes you feel... by Anonymous Coward · 2003-11-02 05:53 · Score: 3, Funny

Dude, lighten up and stop being paranoid.

The NSA doesn't care which Mary Kate & Ashley video is your favorite.
Re:Makes you feel... by Chordonblue · 2003-11-02 05:58 · Score: 3, Insightful

What is it about a 'slippery slope' that you don't understand?

You're a great example of who I mean. No consideration at all...

--
"...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."
Re:Makes you feel... by jeffmeden · 2003-11-02 06:18 · Score: 1

According to the article, the 800mb/person is just a figure of all the data divided by all the people. Its not like every phone call you make is getting recorded in a database somewhere (despite what some may think, that kind of extremeness is beyond banks require a fingerprint? Banks always need to know who you are before they go and hand you a stack of cash. Thats never changed, and i doubt they will be requiring RFID implants, unless incidences of passing bad checks continues to rise, in which case they will react the way anyone would in trying to protect their money. Supermarket grocery cards are a bit appaling in their ability to track information, but is knowing that you bought less butter this month at a certain store and more cheese at that store really going to lead to the end of privacy?

The simple fact is, at present, that its simply not economical to invade everyones privacy, all the time. When that changes, i.e. wireless cameras become so cheap that they show up on every streetsign and lamppost, then we can worry.

Jeff
Re:Makes you feel... by jeffmeden · 2003-11-02 06:20 · Score: 1

(despite what some may think, that kind of extremeness is beyone feasability at the current time)
Re:Makes you feel... by Anonymous Coward · 2003-11-02 06:26 · Score: 1, Insightful

I reckon you don't understand the "fallacy" part.
Re:Makes you feel... by brunes69 · 2003-11-02 06:29 · Score: 1

Where does it end? Should I just give them a hair sample now or wait until my implant is required?

Neither. Under US law the cops don't need a warrant for anything you willfully disregard, and that extends to bodily waste. All they need to do is follow you around for a few days till they see one of yours fall, everyone loses hair constantly 24x7. Either that or they can just go through your trash and find a comb / hairbrush / kleenex, also without a warrant
Seriously, your paranioa is way too little, way too late. It's even against the law in most states now for an adult to walk around without carrying ID. So unless you're a homeless hobo, you're being tracked. Everywhere. .
Re:Makes you feel... by Chordonblue · 2003-11-02 06:36 · Score: 1

I don't feel comfortable giving my fingerprint to just anyone - particularly a company that I have no business relationship with. In other words, I don't trust them. I'm just there to cash a check!

Oh, I'm sure they have standards in place, but there are exceptions - people make mistakes that aren't necessarily criminal but end up looking that way. See 'Brazil' sometime for an extreme example of this sort of thing. You know, it's closer than you think.

--
"...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."
Re:Makes you feel... by espo812 · 2003-11-02 07:12 · Score: 1

Under US law the cops don't need a warrant for anything you willfully disregard, and that extends to bodily waste.
The same applies for everyone in the country, not just cops. Also, this is a practice that I don't have a problem with. If you are discarding something, why should you have any expectation of controlling it?

--

espo
Re:Makes you feel... by gl4ss · 2003-11-02 07:34 · Score: 1

the slippery slope isn't as slippery as you think(it's more like a cliff anyways).

totally tyrannical survey system of every individual doesn't need you to know their groceries, have logs of their habits, all you need is some weasels.

1984 equivalent system doesn't depend on high tech, it depends on enough fucked up people. take a look on what east germany was, high tech surveillance(rfid tags, face recog cameras, whatnot) were not needed, just enough people spying each other. spying on people stepping on their privacy has been possible for ages, what's needed is a functional goverment that doesn't do it and makes sure nobody else misuses their records either(and sees that there are no unneceassary records of private information).

just like the system should(and to some extent, in some countries) works already. the cliff is that the gov(that has the real power ultimately, throug army and police, how it got it's position varies from country to country) decides that it's more important for them to access the information and let others do so as well than it is to have privacy.

--
world was created 5 seconds before this post as it is.
Re:Makes you feel... by brunes69 · 2003-11-02 07:39 · Score: 1

The reason it's a problem is because this, combined with cheap DNA checking nowadays, makes EVERYONE's genetic profile available to someone with a few grand to blow.

No one can avoid shedding DNA, it is everywhere. It shouldn't be covered under the "willfully discarded" umbrella, but since it is, all bets are off.
Re:Makes you feel... by Safety+Cap · 2003-11-02 08:36 · Score: 1

Sez you, but not the fine folks at Menwith Hill, England.
Did you know, for example, that th--
#^$~(NO CARRIER

--
Yeah, right.
Re:Makes you feel... by glitch23 · 2003-11-02 10:47 · Score: 1

Where does it end? Should I just give them a hair sample now or wait until my implant is required?

I just watched the movie Gattaca last night. Don't scare me with the hair sample thing. That picture of reality is just way to close to that movie.

--
this nation, under God, shall have a new birth of freedom. -- Lincoln, Gettysburg Address
Re:Makes you feel... by turgid · 2003-11-02 21:29 · Score: 1

Either that or they can just go through your trash
Where I come from (UK) it's considered theft to remove something from someone's trash without asking consent. (I got a couple of PCs from a skip outside someone's house by knocking on the door and asking politely :-> )
So unless you're a homeless hobo, you're being tracked. Everywhere.
Hmmmm.... Possibly. But why the heck would anyone want to track me all the time? Seriously, I can't think why.

--
Stick Men
Re:Makes you feel... by 16K+Ram+Pack · 2003-11-02 22:04 · Score: 1

now required to give them a an electronic finger
My sentiments exactly!

Excess... by PaintyThePirate · 2003-11-02 05:17 · Score: 1

800 megabytes per person, and yet most people have 40+ gig hard drives... Is there something wrong here?

--
eclecti.cc

Re:Excess... by ceejayoz · 2003-11-02 05:26 · Score: 2, Insightful

Are you an idiot, or just trying to be funny?

Ethopian refugees are counted in the "total number of people in the world", yet they probably don't own a hard drive.
Re:Excess... by Rosco+P.+Coltrane · 2003-11-02 05:29 · Score: 1

Ethopian refugees are counted in the "total number of people in the world", yet they probably don't own a hard drive.

I'm willing to bet the INS produces a lot of data for them (and about them) when they apply for residency under the status of refugee ...

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash
Re:Excess... by PaintyThePirate · 2003-11-02 05:44 · Score: 1

I was joking, but I decided to do a little research about it. There are approximately 500,000,000 PCs out there (not including servers, etc.), and 4,800,000,000,000 megabytes of data was created per person last year. Thats about 10 gigs per computer... which is a lot per year, so I suppose I should shut up now.

--
eclecti.cc
Re:Excess... by PaintyThePirate · 2003-11-02 05:47 · Score: 1

Oops, I meant to write "total", not "per person".

--
eclecti.cc
Re:Excess... by ceejayoz · 2003-11-02 08:51 · Score: 1

I'm willing to bet it's significantly less than 800 megabytes each, unless they're taking full-length video of interviews at immigration.

Only 800Mb per year by jonbryce · 2003-11-02 05:17 · Score: 1

That doesn't sound like a lot. Slightly more than 1 CD, or about 12 hours on the phone every year.

Re:Only 800Mb per year by bersl2 · 2003-11-02 05:26 · Score: 5, Funny

Personally, I find that figure insulting.

I mean, I'm worth at least a gig...

Obvious Joke by Ex+Machina · 2003-11-02 05:17 · Score: 1

And how much of that data was a duplicate?

Re:rip in the matrix? by GOPWillC · 2003-11-02 05:17 · Score: 1

Too much coffee this morning?:P

More and more data by Davak · 2003-11-02 05:18 · Score: 3, Interesting

An audio producer may lay down gigs of tracks for one song. In my research lab we burn a DVD almost full of new data each day. In the hospital we record more and more detailed information into our systems.

Sadly, an assload of this information is useless, useless, useless. I spend more time detailing information in the medical chart than I actually spend with the patient.

In my lab, more data is better... however, when it's just useless information to keep the shark lawyers off my back it's a bad thing.

Davak

Re:More and more data by Richard_at_work · 2003-11-02 05:25 · Score: 3, Funny

How many Libraries of Congress is an "Assload"? Is the data easily retrievable?
Re:More and more data by Davak · 2003-11-02 05:39 · Score: 3, Funny

Definition of assload

Assload is a relative term... like "a lot"

Normally one wouldn't ask "how many Libraries of Congress is a lot?"

/end of my stupid point
Re:More and more data by MythoBeast · 2003-11-02 06:42 · Score: 1

Oddly enough, real life is starting to resemble the Usenet. A few tidbits of useful information floating in a sea of refuse. This is why effective information mining tools are necessary. I quake with fear at the thought of Microsoft taking over Google.

--
Wake up - the future is arriving faster than you think.
Re:More and more data by EvilTwinSkippy · 2003-11-02 06:56 · Score: 1

What, they are buying out Quake too! Egads the next version will never be out at this rate.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming
Re:More and more data by confused+philosopher · 2003-11-02 08:47 · Score: 1

And when you think about how much bit space could be saved by using superior compression methods, and lower quality storage techniques for low priority data, you can't help but cry. Well, I can't anyway, but I guess I'm overly sensitive about data storage.

--
Why slashdot? Why not?
Re:More and more data by Weirsbaski · 2003-11-02 10:38 · Score: 1

How many Libraries of Congress is an "Assload"? Is the data easily retrievable?

Yeah, but you probably won't like the interface...

--

I am not a sig.

This article is so clearly a dupe... by FrankoBoy · 2003-11-02 05:18 · Score: 2, Informative

Damn... Go read the first article from like frou days ago...

--

United States of America, good ol' backers of world peace.

Re:This article is so clearly a dupe... by FrankoBoy · 2003-11-02 05:21 · Score: 1

Bleh, "four" days ago...

--

United States of America, good ol' backers of world peace.

As this is a repeat by Epistax · 2003-11-02 05:18 · Score: 1

How about we talk about something else?

How about them sporting events?

Thats an interesting unit. by -kertrats- · 2003-11-02 05:19 · Score: 1

It would take 500,000 Libraries of Congress to equal five exabytes. Since when does a 'Library of Congress' qualify as a unit of storage? Yes, i realize that they were trying to give a comparison, but it looked very odd to me.

--
The Braying and Neighing of Barnyard Animals Follows.

they've cloned it! by Celt · 2003-11-02 05:20 · Score: 1

News for nerds, news that reapeats....

--
"WebTV: bringing the Internet into the shallow end of the gene pool since 1995" - Martin Bishop

weird by larry+bagina · 2003-11-02 05:20 · Score: 1

I just got a banner ad (on slashdot) for slashdot. "Missed yesterday's news? Read Slashdot!" So, now the fact that you dupe stories and post old shit days, weeks, or months after the fact is a selling point?

Where do I subscribe again?

--
Do you even lift?

These aren't the 'roids you're looking for.

It's really 400. by Eric_Cartman_South_P · 2003-11-02 05:22 · Score: 4, Funny

If everything is posted twice, like on slashdot, and like THIS story, that 800 MB is really 400 MB.

Also, if If everything is posted twice, like on slashdot, and like THIS story, that 800 MB is really 400 MB.

Re:It's really 400. by redhog · 2003-11-02 05:49 · Score: 1

The question, then, is if _your_ comment is also poste twice...

--
--The knowledge that you are an idiot, is what distinguishes you from one.
Re: It's really 400. by debilo · 2003-11-02 06:18 · Score: 1

After reading your post, I find your .sig pretty ironic!
Re:It's really 400. by EvilTwinSkippy · 2003-11-02 07:20 · Score: 1

If you haven't seen it before, it's new to you.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming
Re:It's really 400. by giminy · 2003-11-02 10:39 · Score: 1

Slashdot really needs the "reason" for moderation separate from the "score."

I, for one, would moderate this comment "+1, Redundant".

--
The Right Reverend K. Reid Wightman,

It would be interesting by Pingular · 2003-11-02 05:25 · Score: 1

to find out how many mb of data people write on paper per year. I suppose you'd have to take a sample of about 10000 people, then enter all the things they write in a year into some handwriting recognition program. Of course it'd take less time than normal handwriting recognition because the program would only have to scan for the number of letters, rather than what the letters actually were.

--

When anger rises, think of the consequences.
Confucius (551 BC - 479 BC)

800M of what data? by Rosco+P.+Coltrane · 2003-11-02 05:26 · Score: 1

If it's intelligent reflected work ppl produce, I would think it's less. If it's blogging, grocery lists, Slashdot articles (yes, dupes make the number rise quite a lot probably), then that's probably a lot more than that.

Oh well, besides, I don't really know what that amounts to, the official Internet storage unit being the Library of Congress ...

--
"A door is what a dog is perpetually on the wrong side of" - Ogden Nash

Re:in case of slashdotting..... by Davak · 2003-11-02 05:26 · Score: 1

The parent post is an obvious example of the cause of the information explosion. :)

Who cares though? As data expands, it gets cheaper and cheaper to store it...

Re:rip in the matrix? by lhaeh · 2003-11-02 05:27 · Score: 1

Just woke up, still in bed actually - not all that ergonomic...
25mg of dexedrine is a bit better then coffee :)

Here is the link if someone hasn't posted it already:

http://developers.slashdot.org/article.pl?sid=03 /1 0/29/1355259&mode=thread&tid=137&tid=188&tid=1 98

Get your units straight by FrankoBoy · 2003-11-02 05:27 · Score: 2, Funny

As discussed in this previous thread - following an article ABOUT THE SAME SUBJECT - the new unit is the Great Pyramid.

Someone should program a calculator to convert all these units from one to another. Elephants to Great Pyramids, Great Pyramids to K-Marts, K-Marts to Libraries of Congress... Now that'd be innovation for ya !

--

United States of America, good ol' backers of world peace.

Re:Get your units straight by FrankoBoy · 2003-11-02 05:38 · Score: 1

Nah... Peep this, it's just not working ;)

--

United States of America, good ol' backers of world peace.

Thats it? by Bob+McCown · 2003-11-02 05:29 · Score: 1

800 Megs? I grab that in pr0n every day!

Wow I bet I'm the first to notice! by mercan01 · 2003-11-02 05:32 · Score: 1

Ha, I bet no one else noticed that this article was a dupe! I'll ebt he first to make a witty comment on dupes accounting for half of all the data! ...oh, wait...

Sjure by Cpt.+Fwiffo · 2003-11-02 05:32 · Score: 1

Hmmmm...

800 megs of data... Would that be the server logs of the FTP's which I get my stuff? :)))

Hmmm. Well, that and probably the 20+megs of spam I get to have every month would make it easy to put it up to 800 megs/year...

Koos

My take on that figure by Ignorant+Aardvark · 2003-11-02 05:34 · Score: 1

Doesn't that seem ... I dunno, a low? Granted, I have hundreds of gigabytes of hard drive storage filled, but I didn't create any of it ... the movie studios, TV studios, and game studios did.

However, the stuff I do create are digital pictures. Lots of them. I take everything in 1600x1200 resolution, so each image is about 800KB, and my camera has a 128MB flash card. I fill it up quite often. I'd say I take on average 20 pics a day (which averages out to around 6 GB per year), and that's just in pictures!

What about save game files - do those count? And I also create text files, but those probably don't total over a few megabytes per year.

I don't shoot videos or record songs or anything - but yet I do enough data creation for 10 other people. I shudder to think how much information people who shoot digital videos are creating.

--
Cyde Weys Musings - Scrutinizing the inscrutable

So then... by TerminalInsanity · 2003-11-02 05:34 · Score: 1

i downloaded enough music for 31.43875 people...

Everyone relax by konmaskisin · 2003-11-02 05:34 · Score: 2, Interesting

it's not like we have to read it all. Most of it is as important as receipts for toilet paper (and production, shipping and marketing data for said ass-wipe).

The medium is the message ... and if we look for patterns we'll see the forest and know what are the important bits. Plus we'll have the ability to search for individual trees instantaneously.

world: USING LINUX since 1991!!

sco: SUING LINUX til 2011!!

What? by SirChris · 2003-11-02 05:35 · Score: 1

No way, I get that much per movie.

this is a duplicate article from the other day by RouterSlayer · 2003-11-02 05:36 · Score: 1

come on now, don't you guys check?
This is an obvious duplicate...

must be damn slow cuz there's no SCO news ;)

This would be more interesting by mcc · 2003-11-02 05:36 · Score: 5, Insightful

If in generating the average they could discount the extremes.

Some of us go through a truly silly amount of data. There's a nontrivial number of people reading this discussion who exhaust their dorm's 1 GB bandwidth cap every day.

On the other hand there's somewhere a barefoot palestinian refugee child for whom not so much as a piece of paperwork was generated since he was born.

These two extremes would probably tend to distort things. It would be interesting to find out if the study was based on usage of storage data as it appears and these extremes were included in the study, or if they just (being Americans) couldn't be bothered when compiling their study to talk to geeks and starving african children. If the former, i'd be curious how their results would change if they could somehow just like chop off the ends of the bell curve.

--
Irritable, left-wing and possibly humorous bumper stickers and t-shirts

Re:This would be more interesting by HoldenCaulfield · 2003-11-02 07:05 · Score: 1

Your post is a little confusing. You mention the two extremes and suggest that the amount of data per person is a bell curve. By definition a bell curve is a normal distribution and is symmetric. If the distribution really is a bell curve, and you ignored your outliers, you'd have the same average of 800 megs per person . . .

The question really should be is it a normal distribution - I would guess yes . . .
Re:This would be more interesting by MrIcee · 2003-11-02 07:18 · Score: 1

On the other hand there's somewhere a barefoot palestinian refugee child for whom not so much as a piece of paperwork was generated since he was born.
While I certainly agree with your post (except for the bell curve part) - I should like to point out that I was born a barefooted palestinian refugee - though these days I utilize many gigabits a day. While your correct in the fact that many people around the world do not access technology - nor have electronic (or even paper) records on them (I, for example, have no birth certificate) I suspect that number is shrinking at an alarming rate.
Re:This would be more interesting by mcc · 2003-11-02 07:42 · Score: 1

By definition a bell curve is a normal distribution and is symmetric.

Oops. Me == uninformed :) Sorry.

Please pretend that in the place I said "bell curve", I instead said "distribution curve".

--
Irritable, left-wing and possibly humorous bumper stickers and t-shirts
Re:This would be more interesting by Apple+Acolyte · 2003-11-02 18:42 · Score: 1

If the so-called "Palestinians" weren't permanently invested in the Jew-murdering, destruction of Israel mindset, perhaps they wouldn't have so many problems. And for the record, the Arabs of Israel enjoy a higher standard of living and more rights than their brethren who reside in Arab homelands. And if you think I'm lying, try to figure out why a supposedly repressed, victimized population continues to grow by leaps and bounds. Apparently things aren't as bad as our Salinger devotee would admit.

--
Part of the hardcore faithful who believed in Apple long before it was cool again to do so

Hm, that's funny... by Shoten · 2003-11-02 05:36 · Score: 2, Funny

I thought the pr0n I was in amounted to WAY more than 800 MB last year...

--

For your security, this post has been encrypted with ROT-13, twice.

Re:Hm, that's funny... by No+Such+Agency · 2003-11-02 12:25 · Score: 1

Well, I've been collecting pr0n with you in it for three years, and I only have 2300 MB of it so far. Is there a newsgroup or something I don't know about? "alt.sex.pictures.shoten"?

--
Freedom: "I won't!"

Relevance...??? by cribcage · 2003-11-02 05:42 · Score: 1

US researchers estimate that every year 800MB of information is produced for every person on the planet.

The vast majority of the global population neither owns computers nor uses the internet. (Check statistics.) So what, exactly, is the relevance of this estimate?

--

Please don't read my journal

Powerpoint effect by buus · 2003-11-02 05:42 · Score: 1

You have to wonder how storage business data would take if you could compress powerpoint/word docs into ascii.

This is DUP - better check out this by kamzik · 2003-11-02 05:43 · Score: 1

NTFS write support for Linux!!

The stuff in this post was already discussed in Info Glut - Five Exabytes of Data Created in 2002

Re:rip in the matrix? by Pingular · 2003-11-02 05:44 · Score: 1

Please enlighten me as to what your sig means, I tried decoding it myself but couldn't, and don't want to try all the decryption methods I know :)

--

When anger rises, think of the consequences.
Confucius (551 BC - 479 BC)

On the edge of the Singularity by headkase · 2003-11-02 05:45 · Score: 2, Informative

With information growning exponentialy, one must wonder if we're on the edge of the Singularity as anticipated by Vernor Vinge.

--
Shh.

Paper vs. Trees by BanjoBob · 2003-11-02 05:47 · Score: 1

The amount of information stored in books, journals and other documents has grown 43% in three years.

Too bad the number of trees has NOT increased 43% in the past 3 years. At this rate we'll have a naked planet in no time.

--
Banjo - The more I know about Windoze, the more I love *nix

Re:Paper vs. Trees by danny256 · 2003-11-02 07:17 · Score: 1

That's a stupid thing to say. If the number of trees on the planet was ever reduced significantly the price of wood/paper would go up really high, and so less people would use them. You need to read about the laws of supply and demand.

Judging by these posts... by whiteranger99x · 2003-11-02 05:51 · Score: 1

I'm pretty sure we'll have at least 800MB worth of "dupe article" posts by people who think such posts get funnier every time they do it! :\

--
Join the TWIT army now!

one big flaw in the study by antaeus · 2003-11-02 05:52 · Score: 1

They keep talking about all this information, but data would probably be more accurate. My guess is the actual information created in 2002 amounts to less than 1 kb/person, and that's probably being optimistic amount humanity.
(My contribution to the glut: long-shrift.blogspot.com

redundant ... like dupe articles ? by clarkie.mg · 2003-11-02 05:53 · Score: 2, Funny

This is the article a few days ago ...

--
Men are born ignorant, not stupid; they are made stupid by education. Bertrand Russel

As the Cree say... by dexter+riley · 2003-11-02 05:53 · Score: 1

Only after the last tree has been cut down
Only after the last river has been poisoned
Only after the last fish has been caught
Only then will you find you cannot eat data

My 800M or the other guy's? by valdis · 2003-11-02 05:58 · Score: 1

Do all the virus/worm generated mail that I get counted against my 800M since it's sent to me, or against the poor Microsoft user who didn't patch their machine in time?

And does eliminating spam/virus email make a noticable hit in the numbers, or is it not even counted?

More relevant problem by infolib · 2003-11-02 06:06 · Score: 1

Only after the last oil rig has been sunk
Only after the last supertanker has called port
Only after last gas station has closed
Only then will you find Greenpeace doesn't sell beer at night

--
Any sufficiently advanced libertarian utopia is indistinguishable from government.

Re:More relevant problem by dexter+riley · 2003-11-02 07:09 · Score: 1

You know, I always thought we should have less data and more beer.

Porn stars are the intelectuals of the future? by digitalgimpus · 2003-11-02 06:08 · Score: 1

They do much more than 800MB's.

Perhaps they are the academics of the future? ;-)

I'm gunna marry that genious I saw the other day.

800 Megs of Data Per Person... by neodymium · 2003-11-02 06:13 · Score: 1

800 Megs of Data Per Person Last Year - is this the amount of surveillance data the Department of Homeland Security was generating per person ?

Re:No, even less by Epistax · 2003-11-02 06:15 · Score: 1

No, it's less. No duplicate is really a duplicate.
The first time around, the article gets a bunch of discussion.
The second time around, the article gets the same discussion, same arguments. Then it gets an equal amount of "dupe" talk.
X + 2X = 800. "Content" = 800/3.

(insane referece) That's like 17 volkswagon beetles.

True it's a lot of info to create, but... by bconway · 2003-11-02 06:16 · Score: 1

...how much info is destroyed each year to offset these numbers. I mean shredded files, stuff thrown in trash, bills, deleted data files, discarded/lost storage media, etc... In the end (of each year), I wonder, what is the actual increase in stored information?

--
Interested in open source engine management for your Subaru?

Sounds about right. by bconway · 2003-11-02 06:18 · Score: 1

That's a believable number. Consider the amount of published data on Kazaa, or that 45 minutes of raw DV video is roughly 12.5 GB. Move 100 of your CD's to MP3s and you're consuming/creating roughly 3.5 GB (or more if you're using higher than 128kbps MP3s). And I'm not even commenting on pr0n.

--
Interested in open source engine management for your Subaru?

...and 3/4 of that are raw bitmap images by Spoing · 2003-11-02 06:29 · Score: 1

I helped out a coworker a few weeks ago, by pointing out the benifits of using a compressed loss-less format (PNG) over an uncompressed one (BMP). For one image, the size was reduced down to 5% of the original size -- surprising even me (large areas were one color).

While I used Pngcrush to squeek out the last few %, even the moderate compression offered by Photoshop was enough to make her happy for a week.

--
A firewall can not protect you from yourself. Turn off what you do not need. Do not use the firewall to do your work.

Re:...and 3/4 of that are raw bitmap images by EvilTwinSkippy · 2003-11-02 07:18 · Score: 2, Informative

Actually images with a large field of one color compress quite well. The compression algorythems sees "Oh blue again" and represents all the blue pixels as a single bit. Some just say "it's blue until I tell you otherwise" or "It's blue for the next 65242 pixels."
Images with complex shapes compress terribly. I was out at a botanical garden trying to photograph the ends of tree branches as they fork off into millions of buds. It looks crappy in JPEG form.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming

hmmmmm by AmoebafromSweden · 2003-11-02 06:34 · Score: 2, Funny

Thats a lot of porn...

Drawing the line.... by Chordonblue · 2003-11-02 06:39 · Score: 1

"Under US law the cops don't need a warrant for anything you willfully disregard..."

I agree, too little too late, but I wasn't talking about law enforcement; I was talking about a corporation! Big difference, or at least I think so.

--
"...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."

Re:Drawing the line.... by EvilTwinSkippy · 2003-11-02 06:59 · Score: 1

We are all shareholders in this country, we get to vote in all 614 members of the board, the CEO, and the 9 executors. That's more than we can say for any corporation out there.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming
Re:Drawing the line.... by brunes69 · 2003-11-02 07:41 · Score: 1

I was talking about a corporation! Big difference, or at least I think so.

ROTFL, if you think there's really any kind of a line between corperations and the government nowadays, you are either living in a fantasy land, or have been asleep for the past 50 years.

In a country where laws can be bought and sold, and rich CEOs can commit fellonys and get off scott free, the govenment *IS* a corperation.
Re:Drawing the line.... by Chordonblue · 2003-11-02 08:38 · Score: 1

Heh.. I believe this was my point exactly! Sorry this is just simply accepted behavior now... :P

--
"...Well, there's egg and bacon; egg sausage and bacon; egg and spam; egg bacon and spam; egg bacon sausage and spam..."

Re:Why Journals are popular.... by EvilTwinSkippy · 2003-11-02 06:54 · Score: 1

There are some exhibitionists out there, I'm sure.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming

what is information? by snarkh · 2003-11-02 06:54 · Score: 1

If you store the same image in two different resolutions, does the high-res version contain more information?

Even if you cannot see the difference?

What IS information by EvilTwinSkippy · 2003-11-02 07:14 · Score: 2, Informative

According to Claude Shannon's groundbreaking work on the subject, Information is measure by its surprise quotient. 80 bits that you are expecting to see carry no information at all. 1 bit that is a surprise is a tremendous amount of information.

The game of scrabble is a good illustration. Common letters (a,e,i,o,u,s) have 1 point. Uncommon letters (z,x) are 10 points. All letters have a different point score based on their frequency of use in the english language. (At least for the english version of the game. I know the scores on the letters are different for the German version at least.)

All of the modern compression algorythems work on this principle. They detect the parts of your "signal" that contain the least information, and convert them to a smaller form. Of course you have to know a bit about your signal before you can be good at all at predicting what is common or not.

LZW compression, for instance, is great at compressing text. Images OTOH LZW is not so good at. At least color images (GIF actually uses LZW.)

For full color images we use JPEG, which breaks the image into 8x8 tiles and then compares the tiles to the output of the inverse cosine transform. So instead of storing the actual RGB information it actually stores the coefficients of the transform needed to reconstruct the tile, and the varience of the original from the ideal.

MPEG uses a JPEG-like compression for key frames, and then simply stores what pixes change in between frames. Some implementations also attempt to compensate for motion, which is starting to get beyond what I can explain in the space provided.

Suffice to say information is the level of surprise inside a signal. It doesn't really matter what form of signal it is.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming

Re:What IS information by snarkh · 2003-11-02 10:13 · Score: 1

Using the notim, if you cannot tell the difference between two images than they are the same as far as the information is concerned, which seems reasonable.

Also surprize is a relative notion. Something might be surprising to me but not to you or the other way around.
Re:What IS information by EvilTwinSkippy · 2003-11-09 02:31 · Score: 1

Also surpize is a relative notion. Something might be surprising to me but not to you...
I think I heard a single hand clapping.

--
"Learning is not compulsory... neither is survival."
--Dr.W.Edwards Deming

Seems like significant overcount by Lulu+of+the+Lotus-Ea · 2003-11-02 07:21 · Score: 3, Interesting

The thing about information is that it's not quite so easy to count as the article suggests. If the question is solely one of how many magtapes to buy, sure the exabyte thing is interesting enough. But in a "human" sense, that's not all that interesting.

For example, the article cites 18 exabytes of what is basically analog data--sound and images--over telephone, radio, TV. It claims that 98% of that is in telephone calls, essentially all, in other words.

First thing is that most telephone calls are not recorded. Well, I dunno, maybe Carnivore and Eschelon are even worse than I think. So mostly this is just a question of how much bandwidth AT&T and MCI need to buy; I'm sure they care about that question, but most people have no reason to. Maybe how many tape drives the NSA needs to buy too.

Just how much information *IS* there in a telephone call though. At a certain level, ten million calls about the same snowstorm aren't really that information rich. But I understand that you want to hear YOUR sister complain about shoveling the snow, not somebody else's sister do so. But just at a technological level, how much is there to a phone call?

If I record the call as CD-Audio WAV format it comes to something like 9 MB a minute. But then, if I compress it to MP3, or Ogg Vorbis, or AAC, I'm down to something more like 1 MB a minute. In fact, if I go for a 56k bandwidth, or something along those lines, I can probably get it down to less than half a MB... and that's not really much different from what I could discern originally on my cell-phone on a noisy street, or over my old wiring in my house. So far, we've reduced the "information content" by 20 times by purely technial means. Then again, it's not clear if this is fair... in those cop shows where they reconstruct background noises to filter the gunshot or car crash in the background, they probably want the full original data... but do *I* care about that when I talk to my sister?

Moreover, audio compression is just the start. There's this old thing called TRANSCRIPTION that compresses quite a bit more. A stenographer (or maybe a computer program, at least at the NSA) can type up our conversation perfectly well. How much information is lost by reducing the "data" to:

Lulu's Sister: We got over 10" of snow, and it took me an hour to shovel it.

Even at the highest audio compression I can find, I need tens of kilobytes to encode this remark... as text we're down to a couple tens of BYTES. Maybe I've lost a little of Sis's inflection, but how much INFORMATION was there really, to start with? Some probably, but is it worth a thousand words? Moreover, I expect some lossy compression to reduce that text by at least another half.

Depending on just what you think is information, perhaps 300,000 times compression is possible. That brings exabytes down to gigabytes. Given some automated transcription technology, maybe I can store the whole last year of family chats on my local harddisk!

--
Buy Text Processing in Python

pfft by ShadowRage · 2003-11-02 07:22 · Score: 1

obviously they didnt count the software makers, or the people who encode movies and redistribute them (illegally, but still, it doesnt mean it doesnt happen)
I wonder how much of that report they left out..
I know 800 mb is an average, but it still seems a bit small for what most people do these days.

Well of course books are up by kfg · 2003-11-02 07:38 · Score: 1

Look at how many more of them we need just to tell us how to store things on magnetic and optical media.

KFG

What's the point of measuring "data"? by Kjella · 2003-11-02 07:54 · Score: 1

I mean, my thesis that I and a friend wrote, a full years work (half year x 2) was 2.2 megabyte, including front page, illustrations, graphs and tables. Yet when I digitize a short video clip from my video camera, it's literally gigabytes, at least until I compress it.

Unless you're a hard disk manufacturer, does its size have any relation whatsoever to its value? If the figure is correct, I can store all the data I create, over my entire life (say 100 years to make it simple) on a 80GB hdd.

Which is of course, complete rubbish. I could set up a DV cam (hey, even a webcam would do fine) and record everything that happens here, and I'd create Terrabytes of information per year. Why don't I? Because it'd have no value to me.

I'd rather wager that the reason the information "amount" increases is better tools for collection, not much else. Like getting a 6 Megapixel digital camera over your 2 Megapixel. Maybe that counts as "more" information, but is it really those extra pixels that have value, or the picture itself reminding you of the occasion?

Kjella

--
Live today, because you never know what tomorrow brings

Re:What's the point of measuring "data"? by telstar · 2003-11-02 09:49 · Score: 1
"I mean, my thesis that I and a friend wrote, a full years work (half year x 2) was 2.2 megabyte, including front page, illustrations, graphs and tables."
- Sure hope you checked your thesis for grammar better than your slashdot posts. It should be 'a friend and I wrote.'
Re:What's the point of measuring "data"? by snarkh · 2003-11-02 10:19 · Score: 1

my thesis that I and a friend wrote
Seems perfectly grammatical to me.

File size bloating. by biolabrat · 2003-11-02 08:22 · Score: 1

Could a large part of this "growth" be file size bloating as opposed to more information being stored? An office XP *.doc file is 3 times as large as a *.txt file (In my quick test of a two page document).

It's all because of "banned" material! by ka9dgx · 2003-11-02 09:35 · Score: 1

Ok.. C:\freedom is taking up 131 Megabytes already... and I expect it to grow, what with DeCSS, the Diebold voting stuff, the DEA report about the Israeli spies, etc...

--Mike--

I produced 400 MBs of data on my computer, by use_compress · 2003-11-02 09:40 · Score: 2, Funny

but I pirated 80000 MBs of data.

Re:rip in the matrix? by JM+Apocalypse · 2003-11-02 09:48 · Score: 1

It doesn't seem that anyone is aware of ROT13 "encryption".

LBH NER VA IVBYNGVBA BS GUR QZPN
YOU ARE IN VIOLATION OF THE DMCA

Here is some more stuff to overcome slashdot's "yelling" lameness filter. Sure, it's like I am actually yelling .... some more junk ... and here it goes... so how's the weather today? If you are seriously reading this, please just stop now. I told you to stop ... You are still reading this aren't you .... ok ... I give up!

--

- - - - - - -
Orppf urp mf y.ppcxn. yflcbi otcnnov C am yflcbi yr n.apb Ekrpatv (Dvorak -> Qwerty)

Data, or information? by ca1v1n · 2003-11-02 10:09 · Score: 1

From an information theory perspective, if you make a copy of a perfectly compressed 100 MB file, the amount of information you have created is just a few bytes, the copy command and the paths of the files, and even then there's a lot of redundancy. I have a large portion of my 60 GB hard drive filled with oggs, but you could just as well describe them all by listing all the albums I've ripped. That would just take a couple k, and would also be highly redundant. I've probably created well over 100 GB of data this year, but I wouldn't say that much of it was new information.

--
WARNING: there is a trojan on your

Well, English isn't my first language by Kjella · 2003-11-02 10:33 · Score: 1

...but the thesis was in English. Though it couldn't have been all that bad since we got an A (about 40% get an A, typically). And yes, I proofread that one a bit more, in fact the abstract I think has the lowest words/manhour ratio of anything I've ever written.

Kjella

--
Live today, because you never know what tomorrow brings

no kidding by SethJohnson · 2003-11-02 11:43 · Score: 1

These kneejerk paranoid kooks need to chill out. Because of their distorted & overly-suspicious perspectives, they interpreted that the 800mb was about each person. It's not, as the parent has pointed out. The article is about how much data is being generated about everything under the sun. This includes BLOGS, books, notes, or what have you. It's unique data, not including duplicate copies of mp3s or DVDs, or whatever else.

--
$5 / month hosted VPS on linux = awesome!

Re:Bonus Cards? by symbolic · 2003-11-02 11:51 · Score: 1

Actually not a bad idea. Collecting data may be invaluable to them, but inaccurate data is worthless.

I made more than that when I by cyberworm · 2003-11-02 16:05 · Score: 1

pooped in the toilet. I make more data than that sleeping. call me when it's over half a terrabyte per person...

Re:redundant? by mabinogi · 2003-11-02 19:37 · Score: 1

doesn't look like it...

--
Advanced users are users too!

And its entropy? by DerPflanz · 2003-11-02 23:33 · Score: 1

Generating 800 Megs is easy. And it says nothing. The researchers better count the amount of 'information' or entropy in bits. But I guess that would be too hard, because a lot of data on my harddisk(s) is duplicated on other disks (for example OS's and programs) or has lots of overhead (think of the huge sizes of a Word document with nothing in it, or the recurring use of the brackets in a XML document).

--
-- The Internet is a too slow way of doing things, you'd never do without it.

What about redundant information? by Big+Nothing · 2003-11-03 01:14 · Score: 1

Seems to me there's a lot of redundant information 'round.

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!

What about redundant information? by Big+Nothing · 2003-11-03 01:16 · Score: 1

Seems to me there's a lot of redundant information 'round.

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!

Please mod parent down! by Big+Nothing · 2003-11-03 01:21 · Score: 1

-1 Redundant.

--
SIG: TAKE OFF EVERY 'CAPTAIN'!!

So, do the bank tellers look at you funny... by Giggle+Stick · 2003-11-03 04:29 · Score: 1

... when you go to cash a check with your tin-foil beanie on?

I'm being facetious, of course. Or is it fascist? I do worry about it a little, what with the Patriot act, etc.

I wonder, have you tried putting some Elmers glue or something on your finger before letting it scan your fingerprint? I bet this could be a fun thing to mess with. I'd be willing to try a gelatin fingerprint transplant like we heard of for foiling finger scanners before. I'll give someone my fingerprint to cash a check at Wachovia. Would this be considered some type of fraud?

data != information by SoulSkorpion · 2003-11-03 15:24 · Score: 1

I love how commentaries on the volume of data generated\transferred on the 'net inevitably provide a handy reference to how many Libraries of Congress it would fill, or how many tonnes of paper it would take to print. Has anyone else noticed that libraries aren't full of pointless posts like this, journals nobody reads, and well, any of these? It's not information, damnit. Archaelogists 2000 years from now are not going to dig up some kid's Angelfire site looking for documentation of our race. (Heh, at least I hope not).

Value of information by mark303 · 2003-11-03 22:10 · Score: 1

Hey Guys does anyone know a resource where u can see the estimated size of information of the internet? By the way Berkley research is quiet interresting, but wouldn't it be more interesting how the value of this information might be/is messured. Easy for everyone to create Information, but what about good information eh?

Slashdot Mirror

800 Megs of Data Per Person Last Year?

133 of 177 comments (clear)