300 Years to Index the World's Information

Longer than expected by powerpuffgirls · 2005-10-09 09:28 · Score: 5, Funny

I always thought 42 years ought to be enough.

Re:Longer than expected by Celsius+233 · 2005-10-09 10:20 · Score: 1

No, I'd say it'd be more like... seven... and a half... million... years.

--
Denham's Dentrifice, Denham's Dentrifice, Denham's Dandy Dental Dentrifice, Denham's Dentrifice Dentrifice Dentrifice.
Re:Longer than expected by Anonymous Coward · 2005-10-09 10:50 · Score: 3, Funny

64 years should be enough for anyone.
Re:Longer than expected by the+real+darkskye · 2005-10-09 10:56 · Score: 1

No, I'd say it'd be more like... seven... and a half

What? Not until next week?

--
Music is everybody's possession.
It's only publishers who think that people own it.
Fuck Beta
~John Lenno
Re:Longer than expected by daveed · 2005-10-09 14:55 · Score: 1

Even evolution doesn't take this long!
Re:Longer than expected by Almost-Retired · 2005-10-09 15:04 · Score: 2, Interesting

42 years, from Douglas Adams HHGTTG? Yes, I expect it will be enough since storage and computer power growth will foreshorten his estimated 300 years. But one possible constraint might exist, that of finding the energy to power all that, and to cool it. But who knows what we'll be using to add 2&2 15 years from now, I don't & won't because I'll probably be returning to dust by then, although some of the whatif press sure seems positive.

On a side note, since they are restricted to doing verbatum the works that are out of copyright, how about we start lobbying our reps to pass a law that says if the material is rights protected by some encodeing where the DMCA prohibits the defeat, and there is not an AUTOMATIC expiration of the restrictions based on the time when the material would pass into public domain, then such material, since it can never pass into the public domain without violating the DMCA, is to have no copyright protections under the copyright laws whatsoever. After all, if it cannot pass into the public domain without breaking the DMCA restrictions, it will never pass into the public domain.

Such material should be granted a copyright ONLY if it can legally pass into the public domain at the end of the copyright period. Put the RIAA and MPAA on notice that they can have their cake ONLY if they don't eat it. One or the other but not both.

FWIW, I do not consider the maintainance of a securely vaulted, unprotected copy of the work to be a valid defense unless this copy is transfered absolutely verbatum, to whatever lossless media is the currently used favorite about every 5 years so that it would become available and usable on the equipment of the time when the copyright does run out, along with suitable high penalities for not meeting their obligations under the copyright statute.

Make it a part of this proposed copyright addendum that the continuence of the copyright is contingent on the court, at someones request, requireing they trot out the equipment in common use at the time, and perform or otherwise show the court that they have an unrestricted copy instantly available in case its copyright should end that day. If they cannot do this, then the DMCA is null and void for that work and the copyright is terminated instantly.

And, the copyright holder going bankrupt immediately causes the material to become public domain since there will then be no one to assure the copyright statute is observed and obeyed. Bankruptcy is too often used as a means to transfer such "property" in such a manner as to cause the ownership trail to be so obfuscated that there is no one in authority to see to it the copyright statute obligations vis-a-vis the transfer to public domain will ever be done. Remove that glaring loophole and quite a few bankruptcies will be stopped.

What say the rest of the /.ers here? Can we do it? Write your reps, on paper, expressing your views on the subject & lets see what happens...

--
Cheers, Gene
"There are four boxes to be used in defense of liberty:
soap, ballot, jury, and ammo. Please use in that order."
-Ed Howdershelt (Author)
99.35% setiathome rank, not too shabby for a WV hillbilly
Re:Longer than expected by Anonymous Coward · 2005-10-09 21:01 · Score: 0
"since storage and computer power growth will foreshorten his estimated 300 years"
- If Google actually did a ‘back of the envelope’ calculation, they would have taken this into account
- computer power growth will also increase the amount of ‘information’ to index. this may (more than?) offset the growth of Google's capacity to index information

The major question is by the-amazing-blob · 2005-10-09 09:29 · Score: 1, Interesting

Does that estimate include how much pocket lint I have?

Seriously, though, why would anyone want to index all the info in the world? That's kinda weird, in my opinion.

Re:The major question is by Anonymous Coward · 2005-10-09 09:34 · Score: 2, Insightful

I agree. At Google's scale and beyond, the concept of 'information' is such a wooly one.

How the hell did they come to that figure of 300 years?
Re:The major question is by michaeltoe · 2005-10-09 09:58 · Score: 2, Interesting

Because once it's all there, you don't have to look for it anymore.
Re:The major question is by jupiter909 · 2005-10-09 11:13 · Score: 3, Informative

They take the rate of current indexing of data, then take the rate at which data is being added to the pile by looking at current trends and possible future trends of people hooking up to the net and adding to that pile, then take the rate at which their systems advance to do the indexing of that pile. They then pass those variables through a custom magic google app and wait a bit and then, tada, the answer 300 is spat out.

You need remember that they could be way off, if some major breakthrough in storage technolgy happens tomorrow all those figure would need be recalculated. At best it is a very very rough idea of how long it is going to take them to catch up to the worlds information and keep it in a current index.
Re:The major question is by NickFitz · 2005-10-09 11:43 · Score: 2, Funny

Seriously, though, why would anyone want to index all the info in the world? That's kinda weird, in my opinion.

New here?

--
Using HTML in email is like putting sound effects on your phone calls. Just say <strong>no</strong>.
Re:The major question is by Kevin+Mitnick · 2005-10-09 12:37 · Score: 3, Funny

They're almost there
Re:The major question is by Anonymous Coward · 2005-10-09 22:50 · Score: 0

But you forgot the key variable in all this - the quantity of information in the world.

That's incalcuable, surely.
Re:The major question is by CreatureComfort · 2005-10-10 02:44 · Score: 1

But remember, that magical google app is still in Beta.

--
"Unheard of means only it's undreamed of yet,
Impossible means not yet done." ~~ Julia Ecklar

10,000,000 years by masterzora · 2005-10-09 09:30 · Score: 1, Funny

Would this include indexing the Ultimate Question? Because last time I checked, that would take 10,000,000 years in and of itself.

--
Remember, open source is free as in speech, not free as in bear.

Re:10,000,000 years by DyslexicLegume · 2005-10-09 09:34 · Score: 1

Not to mention an invasion of privacy.

I'm thinking some people wouldn't want their penis lengths stored in some great vault of information.
Re:10,000,000 years by masterzora · 2005-10-09 09:37 · Score: 1

I think you hit the wrong reply button...

--
Remember, open source is free as in speech, not free as in bear.
Re:10,000,000 years by Anonymous Coward · 2005-10-09 10:13 · Score: 1, Funny

I'm thinking some people wouldn't want their penis lengths stored in some great vault of information.

Particularly the women, but I could be wrong.
Re:10,000,000 years by Hogwash+McFly · 2005-10-09 10:25 · Score: 2, Insightful

What question is that? What happens inside a woman's head?

--
Mother, do you think they'll like this sig?
Re:10,000,000 years by masterzora · 2005-10-09 10:32 · Score: 1

Okay, I don't really care about being modded up or anything (wouldn't even get me Karma if I wanted it...), but how is this redundant? It's the first post I see that mentions the 10,000,000 years...

--
Remember, open source is free as in speech, not free as in bear.
Re:10,000,000 years by Anonymous Coward · 2005-10-09 11:07 · Score: 0

If we knew the Question, we wouldn't have to calculate it.
Re:10,000,000 years by eonlabs · 2005-10-09 11:25 · Score: 3, Funny

Hey, check it out! I'm in the lead!!!! ^_^

--
I wouldn't consider the mad hatter mad. Just reality impaired. He sure can make a mean cup of tea.
Re:10,000,000 years by Billly+Gates · 2005-10-09 11:25 · Score: 1

If you don't know, then they are not going to tell you.

--
http://saveie6.com/
Re:10,000,000 years by bhiestand · 2005-10-10 04:00 · Score: 1

I'm thinking some people wouldn't want their penis lengths stored in some great vault of information.

Hey, check it out! I'm in the lead!!!! ^_^

No, no, no, you've got it all backwards. The larger the number the better.

--
SWM seeks new sig for a brief fling

FUD by CDPatten · 2005-10-09 09:31 · Score: 0, Troll

they just say stuff to stay on the front page of the news. If MS did this kind of thing you anti-ms people would be screaming bloddy murder.

Re:FUD by CDPatten · 2005-10-09 09:48 · Score: 0, Offtopic

once again, anything that isn't I hate ms gets a troll modding.

I don't know... by N1ghtFalcon · 2005-10-09 09:31 · Score: 0, Redundant

We did a math exercise and the answer was 300

Really? I thought the answer was 42?

Re:I don't know... by Anonymous Coward · 2005-10-09 09:37 · Score: 0

Naw, 42 is just the number of people who will make that comment here.

New hardware needed by nizo · 2005-10-09 09:33 · Score: 4, Funny

The hardest part will be developing the hardware that is able to recursively index the Google data itself an infinite number of times.

--
I Am My Own Worst Enemy

Re:New hardware needed by kihjin · 2005-10-09 09:59 · Score: 1

6.8GHz should be enough.

Even so, no advancement in hardware will allow for infinite recursion in a finite amount of time.

--
This slashdot-related signature is a stub. You can help kihjin by expanding it.
Re:New hardware needed by Anonymous Coward · 2005-10-09 10:37 · Score: 1, Funny

Wow. You should read this.
Re:New hardware needed by spuzzzzzzz · 2005-10-09 11:11 · Score: 5, Funny

It's OK, they use linux. It does infinite loops in 5 seconds.

--

Don't you hate meta-sigs?

What About... by Adrilla · 2005-10-09 09:33 · Score: 4, Insightful

Did they take into account the information that is being created as they are indexing? Do they plan on live indexing everything that's being made. Information doesn't stop getting created just because they've stored everything that's already been done.

--

"Plans are for fools! Oglethorpe, the plutonian (Aqua Teen Hunger Force)

Re:What About... by antdude · 2005-10-09 09:39 · Score: 2, Insightful

And information that are deleting.

--
Ant(Dude) @ Quality Foraged Links (AQFL.net) & The Ant Farm (antfarm.ma.cx / antfarm.home.dhs.org).
Re:What About... by htrp · 2005-10-09 09:41 · Score: 5, Interesting

I would assume that it would be to index the collective sum of information, even as it is growing. It's probably a lot quicker to index something than it is to generate it. With probable future advances in computing power and the development of new algorithms, it should be entirely possible that the speed of indexing (which already probably surpasses the speed of information production) would catch up to all the data that still hasn't been indexed.

Think of it in terms of taking a ratio comparison of two infinite series.
Re:What About... by Anonymous Coward · 2005-10-09 12:04 · Score: 0

In 300 years information will index itself. Heck, in 5 years we won't need a central google to do this stuff, indexes will be decentralised and public.
I think 300 years is on the mark. None of us will be around to gloat if they got it wrong.
Re:What About... by barum87 · 2005-10-09 12:05 · Score: 2, Insightful

Most of the information we are creating now is already electronic; therefore it's a lot easier and less time consuming.
Re:What About... by Max+Nugget · 2005-10-09 12:19 · Score: 3, Interesting

Did they take into account the information that is being created as they are indexing? Do they plan on live indexing everything that's being made. Information doesn't stop getting created just because they've stored everything that's already been done.

Funny you mention that. In some versions of Superman, Brainiac, a living computer whose mission is to gather all information about every planet in the universe, entered into the world of villainry because he logically reasoned that the only way he could ever "complete" his mission would be to gather information about each planet and then destroy the planet, since allowing the planet to continue existing would result in a never-ending cycle of new information that would need to be recorded, making it impossible to ever reach a "done" state. Not surprisingly, then, Brainiac's goal is ultimately to destroy the entire universe. :)
Re:What About... by danhirsch · 2005-10-09 13:56 · Score: 1

They could set up two systems...one to index the existing..one to scour and index whats being created....periodically merge...then when they catch up..set both to index the new stuff.
Re:What About... by Anonymous Coward · 2005-10-10 03:24 · Score: 0

You never saw that episode of Futurama, did you? After they gather all the information, they'll destroy the universe, thus ensuring no new information will ever exist!
Re:What About... by Sajma · 2005-10-17 14:12 · Score: 1

The index is out of date; Google's cache still says:
"And information that they are deleting."

Of course.. by Anonymous Coward · 2005-10-09 09:34 · Score: 0

Then you have to index the index, which may very well cause the whole universe to break down.

But... by Bobzibub · 2005-10-09 09:34 · Score: 1

..only 150 years to do it next year.

Re:But... by adam.conf · 2005-10-09 09:56 · Score: 1

On a similar note, however, just as man's processing power increases in accordance with Moore's Law, his appetite and his ability to procure new knowledge will similarilly increase. Further, as some knowledge will always be so "cutting-edge" that it is un-indexable, and some knowledge is inherently unknowable (read Heisenberg Uncertainty Principle), it would appear to be alogical to even carryout such a mathematical excercise.
Re:But... by BewireNomali · 2005-10-09 12:58 · Score: 2, Insightful

i read the article, and this is what I got from it. i could be wrong.

-5 million TB of data.
-170 TB have already been indexed.
-it would take 300 years to index that data and make it searchable.

I don't think it's an exercise to index all knowledge. As you point out, that would be alogical. I think it's more of an understanding of what it would take to effectively and completely serve the world's information needs given current indexing capabilities.

I guess establishing a benchmark currently, both of how efficiently they index information, as well as a general number for the amount of data is out there, they can gauge how efficient they get relative to the rate at which the amount of potentially indexable data increases.

--
un burrito me trampeó.

300 years? by RonnyJ · 2005-10-09 09:35 · Score: 5, Funny

300 years? I'd have thought their other plan would have been a lot quicker.

Re:300 years? by SCVirus · 2005-10-09 10:05 · Score: 0

That plans still in effect... this was just a hypothetical plan B.
Re:300 years? by raoul666 · 2005-10-09 13:51 · Score: 1

Dude...it's both. Watch the futurama with the giant brains. The second one. It all becomes horribly clear. Google saw the episode, and is now emulating the giant brains. Now if you'll excuse me, there's some nasty in the pasty I must attend to.

--
When cryptography is outlawed, bayl bhgynjf jvyy unir cevinpl

I... can't... stop! by Anonymous Coward · 2005-10-09 09:35 · Score: 0

But the mere fact that you're indexing information and the way you're indexing it, that's information. So when does one stop?

I'm indexing data.
I'm indexing how I'm indexing data;
I'm indexinf how I'm indexing how I'm...

I'd like my house indexed by obli · 2005-10-09 09:36 · Score: 5, Funny

How long until Google decides that your house is information? Just imagine an army of small robot spiders invading your home every night, registering the position, name and contents of every single object you own, making it searchable from house.google.com. Unless you nail a robots.txt to your front door, that is...

Re:I'd like my house indexed by Anonymous Coward · 2005-10-09 09:43 · Score: 0

Well, at least then I could find my socks...
Re:I'd like my house indexed by jacksonj04 · 2005-10-09 10:30 · Score: 5, Funny

locate:keys | pocket
locate:phone | pocket
locate:underwear -girlfriend | rm

--
How many people can read hex if only you and dead people can read hex?
Re:I'd like my house indexed by the_hellspawn · 2005-10-09 11:04 · Score: 1

Don't forget the floor plans in the sitemap.xml

--
"The laws of science be a harsh mistress." --Bender
Re:I'd like my house indexed by WilliamSChips · 2005-10-09 11:28 · Score: 3, Funny

locate: girlfriend: no such thing for a slashdotter

--
Please, for the good of Humanity, vote Obama.
Re:I'd like my house indexed by Anonymous Coward · 2005-10-09 17:35 · Score: 0

Did you mean: first-person pornography
Re:I'd like my house indexed by johndierks · 2005-10-09 17:46 · Score: 1

It's one thing to claim slashdotters have girlfriends... The law of averages would stipulate that this is somewhat plusable. Claiming that a slashdotter has a girlfriend and is getting ass on the side... never.
Re:I'd like my house indexed by waterlogged · 2005-10-10 03:07 · Score: 1

Locate:Girlfriend -panties|rm

Access denied. Blocked by root (AKA Wife)

print " sorry, I didn't mean it... it was a joke. Honey, put the frying pan down!!!"

Core.dmp

--
I couldn't fail to disagree with you any less.
Re:I'd like my house indexed by Farmer+Tim · 2005-10-10 04:28 · Score: 1

Surely that should be:

locate girlfriend: fille not found

--
Blank until /. makes another boneheaded UI decision.
Re:I'd like my house indexed by software_trainer · 2005-10-10 09:06 · Score: 1

Doesn't the PATRIOT act allow them to do that now? Except they don't have to tell you that you've been indexed.

--
User Training for Busy Programmers

Carnac the Magnificent by Anonymous Coward · 2005-10-09 09:37 · Score: 1, Funny

*holds envelope to his head*

"42"

*opens envelope and reads*

"What is the number of duplicate posts that will say 'I thought it was 42'."

Re:Carnac the Magnificent by b00tleg · 2005-10-09 22:54 · Score: 0

Fine, 300 is the answer... But whats the question!? haha Oh yeah, how long to index... nm.

A never ending spiral by Anonymous Coward · 2005-10-09 09:38 · Score: 0

Surely if they index everything they will have to index their own indexing, then index that indexing ad infinitum, starting a destructive information vortex wiping out the entirety of civilisation?

i hereby propose by circletimessquare · 2005-10-09 09:38 · Score: 1, Interesting

i hereby propose a new measurement for time: the google year

as we can measure disk space in libraries of congress,

and measure distance in light years...

a google year will be thus: the 300 year span it will take google to morph from geek-friendly search engine to big brother overlord who knows more about you than you do yourself

for example: it has been (2005-1492)/300 = 1.71 google years since columbus first set foot in america

and it will be 1 - (2005-1998)/300 = 0.9767 google years until your great-great-great-great-great-grandchilren will be nothing more than information slaves to the great google dominion

--
intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it

Re:i hereby propose by b100dian · 2005-10-09 10:33 · Score: 5, Funny

...Google indexed it all in 6 days, and took a rest in the 7th...

--
gtkaml.org
Re:i hereby propose by AsmCoder8088 · 2005-10-09 10:55 · Score: 1

We should also add support for chain conversion. -> Unit of measure in land survey equal to 66 feet; 80 chains equal 1 mile

300 years from now? by Anonycat · 2005-10-09 09:39 · Score: 0

I had it pegged at the year 2330. Not too far off.

Re:300 years from now? by Anonymous Coward · 2005-10-09 10:01 · Score: 0

Thank you for that useless bit of information.
Re:300 years from now? by BlueCodeWarrior · 2005-10-09 12:24 · Score: 1

Yes, thank you. Now it can be indexed.

Oooo.... No it's the giant brains all over again by technoextreme · 2005-10-09 09:41 · Score: 1

Let's just hope google doesn't destroy the universe when it's done collecting all of it's information like the giant brains.

--
Ooo man the floppy drive is broken. No wait. The computer is just upside down.

Everybody! by Slashdiddly · 2005-10-09 09:42 · Score: 5, Funny

Please stop creating new information and let Google catch up! You can resume later.

Re:Everybody! by ZakuSage · 2005-10-09 13:26 · Score: 1

On that note, Google has announced that once it has indexed all the world's information, it will destroy the world to prevent any new information from arising to compramize it's massive index.

Yeah right.. by Klowner · 2005-10-09 09:42 · Score: 5, Funny

It's going to take them a hell of a lot longer than that, considering my car keys are always moving.

When I read the summary by colonslashslash · 2005-10-09 09:42 · Score: 5, Funny

I immediately thought of the Futurama episode - The Why of Fry - where the giant brains build the brainsphere and assimilate all the knowledge in existance, before attempting to destroy the entire universe so no new information can be added.

Googlesphere anyone?

--
She's built like a steak house, but she handles like a bistro....

Re:When I read the summary by Anonymous Coward · 2005-10-09 10:51 · Score: 0

Beavers mate for life
Quality carpet warehouses
There are no results about 11 > 4. Not at least one that makes any sense. Maybe in 300 years google will have an entry for that important piece of information.
Or maybe we have to think what the question is... perhaps some planet-sized computer could help us...
Re:When I read the summary by tdemark · 2005-10-09 13:14 · Score: 1

All I know is that Scooty Puff Junior sucks.

- Tony

On a related note... by RyanFenton · 2005-10-09 09:42 · Score: 4, Interesting

I wonder how many man-years it would take to listen to all the music and video that could be indexed. Be interesting at least to find out what the order of magnitute would be - millions, or perhaps billions or trillions of man-years of unique recorded audio and video? It would have to be a game of gross estimation - but it would at least put into perspective how much material is out there, even if most of it is boring "security" footage, compared to the scope of our lives.

It'd be interesting, if, perhaps in a couple generations, we could have a cheap media volume that contained "recorded media, prehistory - to - 2050ad"... if the media that exists today even survives a couple generations, and copyrights aren't extended indefinetly. The idea of an indexing system that can even put all that information into a meaningful context would be fascinating to consider though, if it could be possible.

Ryan Fenton

Re:On a related note... by lukej · 2005-10-09 13:45 · Score: 1

Sounds alot like a Fermi Problem
Re:On a related note... by kbahey · 2005-10-09 14:06 · Score: 1

a cheap media volume that contained "recorded media, prehistory - to - 2050ad

This is indeed possible. If I think back 20 years, and how much information was digitized vs. today, it is just plain staggering.

Think about Britannica, Project Gutenberg, CCEL, Perseus, Alwarraq, and many thousands of other sites that offer reference works in digital form. Think about the commerical CD-ROMs that have that.
All that in just 20 years.
So, 100 more years, we can have all the heiroglyphs off of Egyptian temples, all the Sumerian tables, all the manuscripts in hundreds of languages in libraries around the world.
Yes, and all those silly blogs too ...

--
2bits.com, Inc: Drupal, WordPress, and LAMP performance tuning.
Re:On a related note... by Mad_Rain · 2005-10-10 02:23 · Score: 1

I wonder how many man-years it would take to listen to all the music and video that could be indexed.

Probably a lot longer than you think.

"Goddammit, I can't watch or listen to any more of this crap! I'm going out for a cup of coffee and a smoke."

--
"What do you think?" "I think 'What, do you think?!'"

sarcastic by Anonymous Coward · 2005-10-09 09:44 · Score: 0

The comment was obviously made tongue-in-cheek. You readers need to start reading things critically and asking if it even makes sense.

Competition? by psst · 2005-10-09 09:44 · Score: 4, Interesting

From the article:

Of the approximately 5 million terabytes of information out in the world, only about 170 terabytes have been indexed, he said earlier during his speech.

Storing 5 million terabytes has got to cost a lot of resources. It would be very inefficent if every competing search engine stored that much data. Makes me wonder if it would make more sense to nationalize Google's index and share it amongst competitors (just like it makes more sense for goverments to build airports and share them amongst airlines rather than every airline building its own airports).

Re:Competition? by drrngrvy · 2005-10-09 09:50 · Score: 1

What, governments having the monopoly on information? Sounds like a great idea; if people are already getting jittery about a single company defining what's 'important' and what's not, then things can only get better by having Dick Cheney having the keys to the Library of the United States (LOTUS). ;)
Re:Competition? by Shihar · 2005-10-09 10:29 · Score: 4, Insightful

Nationalize Google? Are you joking me or just insane? You want to take one of the most innovative and successful companies that the US has right now a nationalize it!?

I have a better idea, how about you just send out a government hit squad to kill to put a bullet between the eyes of single entrepreneur in the US. It will accomplish the same sort of freeze in the growth of innovative small businesses but look far less insane.
Re:Competition? by Hoch · 2005-10-09 10:33 · Score: 1

Believe me, the benefits of competition almost always outweigh the benefits of monopoly. Second, why should the government be allowed to steal google from its owners? Don't try to pretend that that would be imminent domain. When people try to destroy capitalism, it makes me sick, what else works?

--
2*31*37*263
Re:Competition? by claes · 2005-10-09 10:54 · Score: 1

Indexing all the world's information would be kind of stealing too.
Re:Competition? by ajdecon · 2005-10-09 11:30 · Score: 1

Makes me wonder if it would make more sense to nationalize Google's index and share it amongst competitors (just like it makes more sense for goverments to build airports and share them amongst airlines rather than every airline building its own airports).

Or, better yet, Google or another indexer could index all the information itself, thus driving competitors out of business. The difference between your cases is that while governments build airports to share, Google has already built the index which would be "nationalized"--therefore taking away almost all the company's benefit for having done so. Good for all their competitors (free information!), but incredibly bad for the company that did the work.

--
"Science is a way of trying not to fool yourself." -Richard Feynman
Re:Competition? by rm999 · 2005-10-09 11:34 · Score: 1

you forget that storage capacity increases exponentially with time. In 30 years we will have 15,000-20,000 terabyte drives (assuming that capacity doubles every two years and current drives are 500 gb). At that point we could easily hold 5 million tb of information.
Re:Competition? by Halfbaked+Plan · 2005-10-09 12:00 · Score: 4, Interesting

Oh, come on. You're talking about a company that is mostly an advertising enterprise now. Who is Google hiring? Admen and their ilk. It's sometimes depressing how enamored the 'community' had become in a company whose main purpose is leveraging eyeballs to look at their ads.

(how DARE I say anything bad about Google. Mod this down IMMEDIATELY.)

--
resigned
Re:Competition? by Anonymous Coward · 2005-10-09 13:03 · Score: 0

Second, why should the government be allowed to steal google from its owners?

So they could give search contracts to Halliburton?
Re:Competition? by Tony+Hoyle · 2005-10-09 14:30 · Score: 1

However at that point there will be 5 billion tb of information to index.

Google will never finish their task, because the amount of information in the world (or at least raw data.. not all of it is really information) is increasing much faster than it's possible to index it.
Re:Competition? by rm999 · 2005-10-09 14:53 · Score: 1

I agree that they will never be able to complete this task, because it is ridiculously impossible. how are they going to get their hands on an ancient chinese text whose only copy is in some guys attic in bejing?

But assuming the problem is indexing 5 Mtb of information that is already available on the internet (songs, movies, whatever - just not things that haven't been digitized because this introduces the need to have humans working) it will be possible in the next 50 years. Both computation power and storage are exponential whereas the amount of new information is constant (or at best log) which would make the total amount of information the integral of linear or log, which is exponential. Therefore, at some point (probably less than 30-40 years) google would be able to do have an index of 5 million tb
Re:Competition? by argStyopa · 2005-10-09 15:07 · Score: 1

I know, then lets internationalize the internet by handing over control to the UN.

I'm *positive* these moves would be at least equally beneficial to society as a whole. They should be done together!

(w00t for Stalinist centralization and the power of the state! Go MAO!)

--
-Styopa
Re:Competition? by HAMgeek · 2005-10-10 02:16 · Score: 1

That might be a good idea IF the government had built their index. It makes sense with airports because, gee. The gov't built the airports. Doing such a thing with google would be tantamount to theft. But then again, when was the last time something was nationalized and made more efficient? National gov't is only useful for national defense, interstate road construction, and similar large public works beyond the ability of local gov'ts. Even then, it's rarely if ever efficient. Look at the dept of homeland security. Evil bastids flew airplanes into buildings and what was the reaction of gov't? Wrap yet another layer of beurocracy around the problem. It reminds me of how oysters wrap layer after layer around an irritant. However somehow we've not yet been able to achieve the same results.

--
"Just because you do not take an interest in politics doesn't mean politics won't take an interest in you." --Pericles
Re:Competition? by danila · 2005-10-10 03:13 · Score: 1

Private enterprise is not better than state-funded organisations. The inefficiency of the government is a libertarian myth. The very Internet that Google is based on was created by the government.

P.S. Mod me down, but public ownership of the means of production always was better, is better and will always be better. Workers of the world, unite!

--
Future Wiki -- If you don't think about the future, you cannot have one.
Re:Competition? by ahodgson · 2005-10-10 10:11 · Score: 1

I'd mod you funny, but I'm almost convinced you're serious.

Any country that believes that crap usually ends up with their "workers" standing in line trying to buy the only potato in the city, and never actually doing a whole lot of work.
Re:Competition? by Shihar · 2005-10-10 19:20 · Score: 1

You miss my point. My point is not if Google is good or bad. My point is that nationalizing successful companies at their peak while they are doing a perfectly good job at what they do is like shooting the market in the head and throwing the body into a pit of spikes. Arbitrarily nationalization any business that is doing good kills all entrepreneurial drive. You might as well tell all the entrepreneurs and people willing to take a risk to invest in such start ups to go fuck themselves. No sane entrepreneur or venture capitalist would spend a second in a market where the government punishes anyone who does well by seizing their business. No sane person would start a business in the US. Hell, they would move to Europe to start businesses.

You know your economic policy is broken when people flee to Europe to avoid nationalization.

That said, all of this is moot. The president who would authorized the nationalization of Google or any successful startup without it being a matter of life and death would be executed by the market, then executed by the public for fucking up the market so badly.
Re:Competition? by Shihar · 2005-10-10 19:27 · Score: 1

Shit, someone should have told the USSR and communist China that public ownership produces an economy of pure pwn4g3. It seems that the USSR got confused and collapsed despite the dandy state of its public economy in the early 90's. China on the other hand apparently forgot to tell its people that they are in fact not starving to death in the 60's and 70's because of their marvelous collective command economy. For some reason they got it in their head that millions were dying as their economy fail to provide the trivial basics, like food.

Command economies are a quick and easy way to mass starvation and crippling economic devastation.
Re:Competition? by danila · 2005-10-10 20:48 · Score: 1

Wrong. You see, I actually lived in the Soviet Union and live in Russia now. And it doesn't take a Ph.D. in economics (which incidentally I do have) to note that economic situation today (and for the past 15 years) is drastically worse than it was before 1990. As a matter of fact, you can just listen to the President to hear him admit just that.

About one-third of Russian people are starving today (i.e. they are not getting enough nutrition according to WHO health norms). The average caloric intake has been decreasing since 1990. Almost every economic indicator you can think of is worse today than it was in 1990 (with the exceptions of number of cars, mobile phones and Internet use). The freedom is only nominal, people travel abroad as much as they did in the Soviet Union (not more), the circulation of newspapers is much less than it was, the self-censorship is prevalent and so on and so forth.

Soviet Union economy was pure pwn4g3. It started that way in 1930s (and it was admitted by Financial Times, Nation and Western media in general), it continued that way through the 1960s and 1970s and 1980s. If you consider the fact that the climate in most of the Soviet Union is much worse than in Western Europe and the US, when you consider that Soviet Union never exploited third world countries chanelling wealth from them, then you must realise that the planned economy was extremely efficient. Also, if you consider real-world quantitative indicators, such as fuel used per 1 ton*km of cargo transported, number of loading/unloading operations per 1000 km of transit, energy used per unit of output, you will realise that the planned economy was in fact more efficient than the capitalist market economy of the United States.

Such comparisions, however, are totally Unamerican and are doubleplusungood to make in the US. So you must immediately forget that I said it and never question the authority.

Please continue believing in myths about the Soviet Union, such as the myth that millions of people were dying because of lack of food. You must believe that it weren't isolated cases of famine caused by the Civil War devasation and exceptionally bad harvests in 1920s and 1930s. No, you must believe that in 1970s and 1980s people were starving. Who cares that this is total bullshit? Who cares that people are starving today in Russia, with free market and capitalism? Nobody cares about that.

--
Future Wiki -- If you don't think about the future, you cannot have one.
Re:Competition? by Shihar · 2005-10-11 03:22 · Score: 1

Pointing out that a vaguely totalitarian government that switched from being a communist state to an oligarchy is failing is not exactly a damning indictment of capitalism. The transition away from a planned economy was done in a terribly manner that let a few basically loot whatever wealth Russia had left over from its collapse. Russia's "managed democracy" as Putin likes to call it ensures the continuation of an oligarchy and cripples the democratic mechanisms of the nation that are supposed to act as a check against such blatant abuse. You don't need to argue with me that the current oligarchy held intact by Russia's "managed Democracy" has utterly failed the Russian people.

The Soviet Union's economy was never good. It was always terribly inefficient. Was the West scared shitless of the Soviet economy? Hell yes. Why? The West was convinced that the Soviet economy was kicking ass for the same reason why the Soviet's thought it was kicking ass right up to the point where it collapsed. The West was using spies to read the same economic reports that the Soviet leaders were reading. The problem of course was that economic reports were pure works of fiction. Soviet factories massively over reported their productivity to appease politicians.

As to starvation of the Soviet Union, it is true that the Soviet Union had all but eliminated starvation by the 80's. Of course, this had more to do with the fact that it was because they had all but abandoned collective farming and received large amounts of food aid from the West then the efficiency of planned economies.

Whatever the case, comparing the Soviet Union to the Russia of today and using that as proof that democracy and capitalism are failed systems is at best dishonest. Russia is not a democracy and it isn't a free market. Russia is closer to the Chinese system then it is to the American or European system.
Re:Competition? by danila · 2005-10-11 04:23 · Score: 1

As usual in a discussion, you have a choice - either to repeat baseless accusations that have no ground in reality, but which make you feel very confident and smart, or to actually try to understand the facts available, the reasoning of the opponent and reconsider your position. So far you seem to be choosing the first option... :(

The Soviet Union's economy was never good. It was always terribly inefficient.
I have gave you several real economic indicators (that can be looked up with some effort). You chose to dumbly repeat the "it was inefficient" lie. Do you have any data at all? What exactly economic efficiency indicator was much lower in the USSR than in the US? I dare you to find such indicators.

As to starvation of the Soviet Union, it is true that the Soviet Union had all but eliminated starvation by the 80's. Of course, this had more to do with the fact that it was because they had all but abandoned collective farming and received large amounts of food aid from the West then the efficiency of planned economies.
The paragraph quoted above consists of a bunch of lies.
1) The Soviet Union has eliminated starvation by the 1950s. It last happened naturally in 1930s and in 1940s the Nazi Germany burned and pillaged half of the country. But after the 1950s there wasn't and couldn't have been starvation. Again, if you stop repeating empty false claims and instead look at the figures, you would realise that Soviet Union was in world's top ten in food consumption (for most food groups, with the exception of milk and meat it actually beat the US at the time).
2) There is absolutely no evidence that Soviet Union "abandoned collective farming". This is patent nonsense! Of course, it haven't done that, collective farms were more strong than ever, the production was growing each year, significant investments were made and the Soviet agriculture was actually in a great shape, especially considering the climate, lack of government subsidies (a la Western Europe) and relatively low mechanisation.
3) Soviet Union haven't been and couldn't be receiving food aid in the 1980s. Are you smoking crack? Soviet Union did import small amount of food, but the amounts were insignificant compared to total production and the idea that it was receiving aid is ridiculous. Of course, you won't be able to provide any references, so we must assume that you just invented this lie.

Whatever the case, comparing the Soviet Union to the Russia of today and using that as proof that democracy and capitalism are failed systems is at best dishonest. Russia is not a democracy and it isn't a free market. Russia is closer to the Chinese system then it is to the American or European system.
And you are an expert on Russia? Or an economist? Right? Or not?

If you don't like the comparision, what do you suggest instead? What should we compare the Soviet Union with? The Czarist Russia? Soviet Union wins? The modern Russia? Soviet Union wins again. The third world countries? Another win for the Soviet Union. The United States, an imperialist country exploiting half of the world and consuming a disproportionate amount of world resources, thanks to its global military domination? Well, if you look deep into this last comparision, you would actually see that Soviet Union may still win. Yes, the per capita consumption is lower, but mostly because there were no millionaires in Soviet Union. However, there was almost no poverty either.

But you don't care about facts and real figures, so I won't tell you how many people had access to free education and free health care in Soviet Union. I won't tell you the amounts of money spent on free public housing, I won't tell you per capita nutrition indicators, I won't cite growth in life expectancy, I won't talk about sport and scientific achievements, no...

Let's just close our eyes and repeat the thinktank-invented government-sanctioned media-spread capitalist mantra: "The Soviet Union's economy was never good. It was always terribly inefficient." And I hope you don't forget that "The United States was always at war with Iraq". Don't miss your Two Minutes Hate, your Prosium injection and don't forget to attend the daily brainwashing session.

--
Future Wiki -- If you don't think about the future, you cannot have one.
Re:Competition? by Shihar · 2005-10-11 18:19 · Score: 1

Okay, I surrender. You win. The Soviet economy did not collapse. The Soviet Union was a happy fairytale land where all was right and magical. ...but still I am curious as to what exactly is your explanation is as to why the government was toppled and all the satellite states that the Soviet Union had gobbled up ran like hell to get as far away from Russia as they could? What is it that provoked the happy and joy filled little empire to crumble practically over night?
Re:Competition? by danila · 2005-10-11 22:59 · Score: 1

First, it didn't happen overnight. The seeds of destruction were planted as early as in 1960s. The destruction of the country was started by Gorbachev in 1985 and a small minority of people saw where it's leading the Soviet Union.

The main reason for the collapse is the wealth- and power-hungry elite. The leaders in Soviet republics wanted power for themselves, so they played the nationalism card. Dissidents in the center wanted to use every opportunity to weaken the Soviet Union, so they supported the nationalists (this lead to bloody wars in Armenia, Moldova, Chechnya, etc.).

The elite in Moscow wanted to have money, to live like Western capitalists do. Just look at the Gorbachev and his wife. They embraced the lifestyle of American presidents, changing suits and dresses every day, which was never accepted before. All Soviet leaders before Gorbachev led a relatively modest life. This was, of course, not the direct cause for the collapse, but an indicator. Parts of the Soviet nomenclature wanted to control the wealth, so they allied with the organised crime and started to channel the national wealth into their own pockets.

Economic growth in Soviet Union continued until the very 1990. Soon after that the planned economy was destroyed and a huge race to steal the most assets started. It started with rare-earth metals, tractors, trucks, wood, fertilizers, etc. In 1993-1995 it continued with wholesale theft of huge Soviet enterprises during the privatisation.

So in a sense the government toppled itself because a number of factors combined to make it possible, primarily the election of Gorbachev the Secretary General. It would have never happened if, for example, Andropov lived long enough.

And there is no joy today in the former republics. For example, in Georgia (the proud independent marionette of the US) more than 80% of people would like to leave their country. The poverty in the former republics is simply abhorrent. Yes, they believed the lies of their nationalist leaders, just like all Soviet people beleived the lies of their Soviet leaders - "let us pass a few laws, destroy the horrible Soviet system and you will all live as well as Swedes or Americans".

Obviously, this is a simplistic account. It doesn't say anything about the role of the United States that led a 40-year long Cold War against the Soviet Union, it doesn't say about many factors. But the simple truth is that Soviet Union didn't collapse because it was too bad. It was destroyed because it was too successful.

--
Future Wiki -- If you don't think about the future, you cannot have one.

I'm curious... by DeepBlueDay · 2005-10-09 09:46 · Score: 2, Interesting

How is 'information' defined in this context? Is a thirteen-year-old girl's blog considered information?

Re:I'm curious... by Anonymous Coward · 2005-10-09 09:52 · Score: 2, Funny

The blogs of thirteen-year-old girls are examples of the recently discovered negative information. If more young girls can be encouraged to write this will actually reduce Google's workload.
Re:I'm curious... by obli · 2005-10-09 09:57 · Score: 1

If there's someone looking for a blog of a thirteen-year-old girl, Google is probably determined to help them find one.

I think Google's problem will be determining which information is actually worth indexing, I for one wouldn't give a damn if they opened a search index for lorem ipsum text.
Re:I'm curious... by Hogwash+McFly · 2005-10-09 10:05 · Score: 2, Insightful

Only if it includes her home address.

--
Mother, do you think they'll like this sig?
Re:I'm curious... by vidarh · 2005-10-09 10:37 · Score: 5, Insightful

I take it from that comment that you don't see much value in a thirteen year old girl's blog? What about a thirteen year old girls diary?
Like Anne Frank's?
Fact is, it's incredibly hard to determine today what will have value tomorrow. Most of those thirteen year old girls (or 20-something geek guys) blogs will have no historical value. But some of those people will grow up to have a profound impact on the world (or they may not grow up, but still have a profound impact, as was the case with Anne Frank). It may be ten years from now. Or 50.
Who knows what the writing they do now might tell us about what brought them wherever they end up? When people write diaries on paper chances are reasonable they'll survive and show up in an attic somewhere. But as more and more content get online, we also risk facing the loss of entire generations worth of many types of information to bit rot and simple lack of foresight.
Re:I'm curious... by Hurricane78 · 2005-10-09 10:53 · Score: 0, Offtopic

If that girl is your future president, then... ;)

--
Any sufficiently advanced intelligence is indistinguishable from stupidity.
Re:I'm curious... by DeepBlueDay · 2005-10-09 11:02 · Score: 1

The question was not loaded. I was merely asking. Good point, BTW.
Re:I'm curious... by Anonymous Coward · 2005-10-09 12:25 · Score: 0

Yeah. Look what happened to Bill and Ted.
Re:I'm curious... by Vellmont · 2005-10-09 12:50 · Score: 4, Interesting

I think the parents question is perfectly valid. What is considered "information"? I'd consider a blog information, but is a painting some random artist creates included in this list of "information"? Is my laundry list information? How about my individual handwriting in my laundy list?

The question of is something valuable isn't exactly an either-or proposition, but a matter of assigning a probability that a certain piece of information is valuable. Couldn't we agree that say the presidents day to day activities are more likely to be important in 100 years than say a single 13 year olds blog? Does that mean that 13 year olds blogs are worthless? Well no, but they aren't the thing I'd first choose to preserve.

The question I have is, is the greater difficulty in control over online information balanced by the greater ease of keeping it around? Google doesn't delete messages from email for this very reason. We tend to throw stuff away because it takes up too much space, or because it just becomes clutter. But with increased storage space every year and better ability to keep track of it (and seperate it from things we consider important), why ever throw away information?

Online information portability is obviously a problem. How do you move someones blog somewhere else, and have it mean anything in say 50 years? I think these problems will be solved as people expect information to be more portable and standardized. The solutions I think will come from the short term portability and needs rather than a few people wanting to preserve something for the next 100 years though. Many people make the assumption that standards are short lived things that are here today, gone tommorow. I'd have to disagree on a historical basis. How old are reel to reel tapes, and you can still find a player at say a thrift store. CD-audio has been around for 25 years and is still the default medium for music today. Ascii has developed I don't know how long ago and yet still is quite popular and if you have a computer that can't read it, you've got a fairly useless computer. Standards have a way of sno-balling and gathering momentum to live on a long time.

--
AccountKiller
Re:I'm curious... by Anonymous Coward · 2005-10-09 13:56 · Score: 0

And the right pics, if you know what I mean.
Re:I'm curious... by Lucractius · 2005-10-09 14:09 · Score: 1

with the capricious nature of art. id say that random painting has a likely chance of one day being worth a bucket load of money.

So wouldnt it be nice to have its entire history on file!

--
XML - A clever joke would be here if /. didn't mangle tag brackets.
Re:I'm curious... by Hurricane78 · 2005-10-09 21:51 · Score: 0

Hey, this is not offtopic. If you're unable to understand it, then STFU!

--
Any sufficiently advanced intelligence is indistinguishable from stupidity.

Assumptions made by Slashdiddly · 2005-10-09 09:46 · Score: 1

Near the bottom of the article they list a few assumptions made in the math exercise the Google conducted. Among other things they assumed that computer power will half every 18 months and that computers 30 years from now will be so large that they will fill an entire room.

copyright? by bcrowell · 2005-10-09 09:47 · Score: 1

Hmm...so he's assuming the U.S. government will let 1923 copyrights expire some time before 2305 AD? Or has Google, after emasculating Google Print in response to lawsuits, suddenly found a new legal strategy?

--
Find free books.

300 Years? Feed Those Pigeons! by Comatose51 · 2005-10-09 09:47 · Score: 4, Funny

Obviously they're not feeding those pigeons enough. Time to buy some quality feeds Google. Maybe even slip in some uppers every now and then. If all else fails, maybe it's time to consider the parrot upgrade. They're a lot more expensive but their index/poop ratio is much better.

--
EvilCON - Made Famous by /.

what is considered information? by bwy · 2005-10-09 09:48 · Score: 3, Insightful

I'd like to see their definition of information. Certainly, a lot of things that are already of common interest are on the net. Occasionally, I find things that aren't available online but the greatest majority of the time google is able to find what I want.

To further the example: at work we have several filing cabinets that haven't been opened in years. There are lots of papers and stuff in there, I can vouch for that. Some might consider it "information." But in reality all that stuff could be burned and I doubt it would make the slightest difference in the way the future rolls out. None of it is stuff that would ever be needed by an IRS audit or anything like that either. Does google consider this kind of stuff as part of their efforts? Because I think they can safely ignore it.

I Call Bullshit by Anonymous Coward · 2005-10-09 09:49 · Score: 3, Funny

It's going to take 300 years to index the grammer and spelling mistakes on Slashdot alone.

Re:I Call Bullshit by Hogwash+McFly · 2005-10-09 10:09 · Score: 1

Quite.

--
Mother, do you think they'll like this sig?
Re:I Call Bullshit by TRS80NT · 2005-10-09 10:35 · Score: 3, Informative

It's grammar.

--
Lorem ipsum dolor sit amet.
Re:I Call Bullshit by Anonymous Coward · 2005-10-09 13:24 · Score: 0

The 'whoosh' sound you just heard was the joke flying over your head.
Re:I Call Bullshit by Anonymous Coward · 2005-10-09 17:48 · Score: 0

I would hope he would have misspelled more than one word if he was attempting that joke.

Google really has no idea what it is talking about by Argonne · 2005-10-09 09:49 · Score: 0

Google really has no idea what it is talking about. There are too many variables to take into account, and they have no idea what the future will bring. Going back 30 years to 1974, look at the predictions made then. How many came true? Did any come true at all? Specific to the Google situation, it is easy to think of a few things that could throw the 30 year prediction into the wind: proliferation (increase) of privacy/DMCA/related acts, removal of such restrictions, computer speed improvements.

Won't happen... by mbone · 2005-10-09 09:51 · Score: 1

I don't think they'll ever get there. I would argue that the exponential increase of information and Google's indexing are both paced by Moore's law. As Google's indexing ability increases, the amount of information created will also increase, at least as fast, and probably faster. Unless you believe (and, who knows, the Googlites might, and that may be where the 300 years comes from) that Google will expand to fill the human universe, then Google will be able to keep up, never really get ahead.

Not the Moore model but the Bono model by tepples · 2005-10-09 09:52 · Score: 2, Interesting

No, the proper model is not Moore's law but Bono's law. If it takes 300 years now, then it'll take 320 years in 20 years, and most of the time will be spent waiting for exclusive rights to expire (if they ever do). For instance, indexing a literary work that's out of print and not widely available at libraries requires getting a new copy, and those aren't available until the copyright runs out.

Makes no sense by bobintetley · 2005-10-09 09:52 · Score: 4, Insightful

We did a math exercise? What exercise?

To estimate the time involved, you surely need to know the size of the information involved (don't quote me that bunkum about 170 terabytes in TFA - yes I did read it), and to know the size you need to know what all the information is, which you can't (and surely new information is created all the time?).

This translates as "I pulled my finger out my ass, waved it in the air and came up with 300 years."

Re:Makes no sense by LnxAddct · 2005-10-09 11:19 · Score: 1

Given a 10x10x10 box you can calculate how long it will take you to "scan" the entire box for any contents within. You can calculate it without knowing anything other then the size of the box. Now we already know to within extremely small margins of error how much recorded data there is, in a worst case you can simply extrapolate the average amount of information generated per generation going back to the dawn of man writing and drawing. There is a pretty good curve that is followed. In newer generations you also have to account for non-written data, i.e. sounds and videos. This curve (unfortunately I can't find the resource right now) has been strictly followed and is growing exponentially, obviously. Estimate how much data there is now, figure out its growth rate, you know the rate you can index it and assume you'll only get faster indexing capabilities as time goes on, and then you can get a pretty good estimate as to how long it would take. If you can't index it faster than it is generated then you'll never completely index everything, at least they've shown that they believe it can be indexed fast enough. It would by no means be within a year of accuracy, but maybe within a few decades. You have to start somewhere so don't blast Google for trying, at least they are thinking ahead 300 years from now rather then just their next quarterly earnings report.
Regards,
Steve

Zeno's paradoxes, googlified. by waltznumber3 · 2005-10-09 09:52 · Score: 0

In all technicallity there's always new information, so it would take infinately long to index an infinite amount of information asuming that it takes a finite amount of time to index a unit of information. Although, I supose if they wait until it takes an infinitly small amount of time to index a unit of information then it would be possible in a finite amount of time. I never understood why Achilles didn't just shoot an arrow at the damn turtle anyway. Oh wait...

--
If you just took anything I said seriously, read it again.

Quickly! Index faster... by David+Horn · 2005-10-09 09:54 · Score: 0, Redundant

Can someone call Google and ask them to hurry up indexing the world's information, specifically, my bedroom. I need my socks...

--
PocketGamer.org - For the gamer on the go!

filtering by Snorklefish · 2005-10-09 09:54 · Score: 1

Of the approximately 5 million terabytes of information out in the world... the vast majority of it can probably be labeled worthless. Focusing on some of the world's great libraries should accomplish the task of disseminating 95 percent of what people want to know and dispense with more than a few terabytes of bunkem.

how can google claim they index the web? by Anonymous Coward · 2005-10-09 10:03 · Score: 0

Kind of proves that the google index is not complete, doesnt it? Yet they claim they have the entire web indexed when you search?

And the Winner is... by bitspotter · 2005-10-09 10:05 · Score: 1

300 years to index all the info extant *today*?

So how much more information will exist by then? Is it growing faster than Google can index it? Think about how fast that much information came into being in the first place.

What a silly question.

Re:And the Winner is... by Fermatprime · 2005-10-09 10:12 · Score: 2, Funny

Well, it's just Zeno's paradox. Let's say it takes them 300 years to index all today's information, then another 150 to index all the new infomation, then another 75... By 2605, all information to that point will have been indexed by Google. Then they can start indexing the FUTURE.

--
I hate the one hundred and twenty character limit for signatures with an all-enveloping, all-destroying, incredible pass
Re:And the Winner is... by Sepodati · 2005-10-09 11:41 · Score: 1

Couldn't you make the assumption that after a certain point, everything that's then created would be already indexed? Like how more and more is automatically created for an online world and is automatically indexed (by something). So you'd really only have to index the past up until that point...

(ignoring that this is probably an off-the-cuff joke, anyhow).

---John Holmes...
Re:And the Winner is... by Prof.Phreak · 2005-10-09 14:48 · Score: 1

I think that's the point behind 300 years. It will take them this long to 1) index everything that's there, and be able to keep up with newly created information.

(ie: in one of their papers they said something along the lines that the rate at which humans can produce information isn't growing as fast as their ability to index it, and eventually they'll be able to keep up with the rate of all information everyone on the planet produces. I guess that's where that 300 years comes in. I'd imagine it would take them -way- less time to index everything that's there right now.)

In any case, 300 years is just a big guesswork number.

--
"If anything can go wrong, it will." - Murphy

Now the real question is.... by SensitiveMale · 2005-10-09 10:09 · Score: 1

As we go into the future will the time to index

1) Take longer because more info is created faster then the ability to index it?
2) Take less time because processors, storage, and databases get faster?
3) Take the same amount of time because data and and the ability to index it grow at the same rate?

Re:Now the real question is.... by Tony+Hoyle · 2005-10-09 14:46 · Score: 1

I suspect 1.

Every time google indexes a piece of information, you have a new piece of information (the fact that google has indexed it), so the amount of information to index has not decreased, it has remained the same & merely changed form.

There is also other information (eg. the amount of information that google has indexed, the amount of free space they have left) that is constantly changing and requires reindexing. Unfortunately storing that information causes it to change... quantum information anyone??

You also have to appreciate how *fast* information is generated in the real world - the human brain is an efficient information processor, and that still throws away 90% of everything it encounters. Imagine the amount of information generated in the entire world! Even in a small thing... The fact that I'm typing this post now, the time that I'm doing it, the tune I'm listening to whilst doing it, my typing speed, the room temperature, my exact location, my pulse rate...

There may be different opinions ... by Anonymous Coward · 2005-10-09 10:11 · Score: 0

on how long it will take. For some reason I am reminded of the SCO case. One of the longstanding arguments is about how long it will take to examine all the code that IBM has produced in discovery. The estimates range from extremely long to not very long at all.

I suspect that it very much depends on how you do it. In the SCO/IBM case, some experts have done convincing analysis that didn't take very long at all. I suspect that what is lacking here is a very clever algorithm (and, no, I'm not going to suggest one).

Keyword is "them" by gmuslera · 2005-10-09 10:12 · Score: 1

What about splitting the task and combining the resulting information between competing companies? Different search engines have a lot in common (in indexed pages) but also a lot that is convered by one but not by the others.

Of course, not all info is in the web, nor all info in the web is accessable by search engines, and even not all info accesable by search engines can be searched (think in graphics with text, i.e. just scanned books, or flash presentations with the actual content), but still that number looks like a bit too much for me.

Coming Soon: by VeganBob · 2005-10-09 10:13 · Score: 0

Google Millenium.

--
Being funny is my sig nature.

How much of that would be - by Elad+Alon · 2005-10-09 10:14 · Score: 1

How much of that would be Slashdot's news about Google's every move? I guess around 10 years.

Slashdot Breaking News!!! Google CEO buys new underpants! Stay tuned for their size and colour.

--
News for merdes. Shit that matters.
Ask me about my sig.

Re:How much of that would be - by fbjon · 2005-10-09 12:51 · Score: 1

Their location can be found on local.google.com. Colors and other specifics on various spots can be found on maps.google.com. Buy the classic Google Brand Underpants on Google Wear at wear.google.com. See it in 3D in Google Panties! Buy pro version for the women's version add-on! Use Google Sets to get the whole wardrobe!!
M'kay, time to go to sleep..

--
True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.

Future Knowledge? by arpk4n3 · 2005-10-09 10:16 · Score: 1

One minor detail, though. If it will take 300 years to index knowledge currently present, then what of knowledge to be produced in the next 300 years? I mean, not even everything from the year 2004 has been indexed yet. Knowledge is infinite, therefore cannot be completely indexed--it would be best for us to interpret the "300 years" quote as a sort of geeky joke.

Of course it is! by yoink23 · 2005-10-09 10:19 · Score: 1

All it was was a cheap joke made off the cuff...Mr. Schmidt is laughing at all of you right now. This is NOT news.

--
This too shall pass.

webcams and other continuous data collectors by G4from128k · 2005-10-09 10:21 · Score: 3, Interesting

This analysis must exclude entire categories of continuous data collection devices such as webcams, data loggers, OS log files, sensing equipment etc. All jokes aside about porn on webcam's, I can imagine that future historian would love such a rich data source on how people lived their lives, what they have in their surroundings, etc.

The point is that many current systems spew a huge volume of low value (but nonzero value) data (multiple MB or GB/day/device). The lack of storage means most of this is not captured and is thus never indexed.

Even massive companies can't keep all their data. Wal-Mart stores on the order of 460 TB in their data warehouse, but only has room for the last 13 months of data or so. At 138 million customers per week, they only have room for a paltry 59kB per customer per week.

--
Two wrongs don't make a right, but three lefts do.

Re:webcams and other continuous data collectors by hopethisnickisnottak · 2005-10-09 18:19 · Score: 1

I can imagine that future historian would love such a rich data source on how people lived their lives, what they have in their surroundings, etc.

I can imagine the poor fucker wailing about having to wade through all the useless bits of information to get the snippets of useful information he wants. Indexing is far from perfect.

--
-Shaunak
Re:webcams and other continuous data collectors by Eivind · 2005-10-09 18:28 · Score: 1

They could trivially store more if they wanted to, it must just be that there's no ROI on storing online data older than a year or so.
You can buy a terabyte of harddisk-capacity for like $500 these days. So 500TB, while sounding impressive is actually disks for $250.000. Storage requires more than just the raw disks. A rule of thumb that often works out ok is to add an order of magnitude to convert "raw disc" into "enterprise storage", that still mean a cost in the ballpark of 2-3 million.
Assuming your numbers are accurate, to finance this storage in a week, they'd need to take maybe 2 cents from every customer that week.
Furthermore, 59KB to store info from *one* customer for *one* week should be plenty. a 64-bit integer to index into their table of items is plenty. (a 32-bit would do, they don't have, nor will they ever have 4*10**9 products, but you design these things with space to grow) A typical shopping would then need to store only maybe 50 of these integers each with qantity, along with time-and-date and which store and so on. Should be trivial in 1KB. That leaves atleast 1, possibly 2 orders of magnitude for various indexes and other data.
I agree that noone can store "all" their data. (depending on your definitions of "data" offcourse), but storing a complete record of *all* transactions that has ever taken place in your company is trivial, even for WalMart.

a small margin of error by CupBeEmpty · 2005-10-09 10:22 · Score: 3, Informative

I think it is important to remember that this was a math exercise not a serious study with predictive power. I remember several years ago people thought the human genome project was insane. They thought it would take hundreds of years to catalog our entire genome and cost some ludicrous numbers of trillions of dollars.

Then:

In 1999, the goal of producing a "working draft" seemed very far away, with less than 15 percent of the genome sequenced. If the accelerated goals had not already generated a sense of urgency in the consortium, a decision by the sequencing center leaders at a February meeting in Houston would. At the meeting, the leaders accepted Dr. Collins' challenge to ramp up their efforts to produce a "working draft" by spring of 2000. By January 2000, the centers were collectively producing 1,000 base pairs a second, 24 hours a day, seven days a week, and 2 billion of the human genome's 3 billion base pairs were sequenced by March. At a White House ceremony hosted by President Bill Clinton in June 2000, Dr. Collins and J. Craig Venter of Celera Genomics, which had carried out its own sequencing strategy, announced that the majority of the human genome had been sequenced. [from here

I tried to find the graph of speed over time because I have seen itseveral times. It shows the exponential increase in the speed of the project. Apparently there are many scientists that believe with techniques as they are now we could repeat the project in 2 years if we started over. The indexing of information could have a very similar timeline. Very slowly at first and then as technology and specific methodology develop off you go. So the truth is... this is a guess. I wouldn't put too much faith in it.

Re:a small margin of error by eonlabs · 2005-10-09 11:15 · Score: 1

OK, so lets take a look at a 'Math Exercise' and see how far off it is from their estimate.

Google started in September 1998: http://www.google.com/support/bin/answer.py?answer =4866&topic=367&hl=en

They had a Terabyte of storage at the time, which I'll assume was maxed out with information as a starting point (it obviously couldn't have been, but for the sake of an argument): http://www.google.com/corporate/history.html

Now, 6 years later, they have 170 terabytes of data indexed. With processing power doubling every 2 years, cost of storage dropping at about the same rate, as well as assuming their ability to target customers with ads improving linearly, we can probably assume their growth will be exponential. (This is where I make my big guess)

Lets say that they've grown following the equation C*(e**(t/k)). C is at time 0, which is 1TB. We find k by saying 1TB*(e**(6yrs/k))=170TB.

6 years / (ln(170TB/1TB)) = k
k = 1.168 years ish.

so following the same growth rate, when does 1TB(e**(t yrs/1.168yrs))=5,000,000TB

ln(5,000,000) * 1.168 years = 18 years.

Please check my math and post if you disagree or have a different estimate.

I'm also not accounting for the fact that we are going to be producing information at an increasing rate as well, which could mean they will never catch up.

so it all depends on how you pick the equations.

They're probably taking a much more conservative look at potential growth.

--
I wouldn't consider the mad hatter mad. Just reality impaired. He sure can make a mean cup of tea.
Re:a small margin of error by AxelBoldt · 2005-10-09 12:22 · Score: 1

Rather than spidering websites, which is what they have done so far, they will have to scan real world books. So data about past growth cannot be used to estimate future growth; it's not even clear that scanning technology will grow exponentially.
Re:a small margin of error by Rick+Genter · 2005-10-09 14:07 · Score: 1

You're assuming that the amount of indexable information stays constant over time. It does not. As others have argued, the index itself is indexable.

Assume your equation is roughly correct however. We should be able to compute how much information would need to be indexed for it to take 300 years. 1TB * (e ** (300 yrs / 1.168 yrs)) = 3.533 * 10 ** 111 TB.

I gotta go buy stock in DVD-R companies ;-).

--
Don't underestimate the power of The Source
Re:a small margin of error by Prof.Phreak · 2005-10-09 14:55 · Score: 1

Funny thing about the human genome project is that they'll never be done. I'd imagine that every few years they'll have announcements saying "we're completed the hgp, etc.", but in truth, they may only be approaching 99.9% done.

It's really one of those things where 95% of the work is relatively easy (well, `several year's worth of work'), and the other 5% are what takes the rest of the century.

(there's also the question of -understanding- it as opposed to just writing it out to a file; and I'm sure that will take a -while-).

--
"If anything can go wrong, it will." - Murphy
Re:a small margin of error by eonlabs · 2005-10-09 18:04 · Score: 1

fair enough. So it depends on what we can consider data, and what the development of additional technologies beyond web dev alone are capable of. The question is how much of this can be run simultaneously. Parallel processing is an interesting aspect of google that allows it to speed up operations that would normally take significantly longer. Of course, indexing *current* human knowledge by hand could take millennia...

--
I wouldn't consider the mad hatter mad. Just reality impaired. He sure can make a mean cup of tea.
Re:a small margin of error by eonlabs · 2005-10-09 18:10 · Score: 1

You're forgetting that you can reference the index itself, since the index is itself the largest possible subset of the index. So you wouldn't need the data redundancy.

Re: dvd companies...I concur.

--
I wouldn't consider the mad hatter mad. Just reality impaired. He sure can make a mean cup of tea.

Google Purge by kflash15 · 2005-10-09 10:25 · Score: 1

Does that 300 years timeframe include the destrucion phase outlined in this article...

Or is this actually a "Duke Nukem Forever" type of timeframe???

Better Hurry it Up by tavilach · 2005-10-09 10:25 · Score: 1

I estimate the destruction of the earth to be within 300 years, so Google better hurry it up! From there, someone's gotta cram all the information on a flash drive and get the hell off this planet! Mars?

--

"Give me a lever long enough and a fulcrum on which to place it, and I shall move the world." -Archimedes

But how long is that... by NMZNMZNMZ · 2005-10-09 10:26 · Score: 1

in libraries of congress per terasecond?

Foundation series anyone... by Anonymous Coward · 2005-10-09 10:27 · Score: 0

I expect old Harry Seldon to appear before they get it all indexed.

Looking at their calculations... by xquark · 2005-10-09 10:27 · Score: 1

I think they forgot to carry the one, that makes the real time required to be a mere 3000 years... :)

Arash

--
Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.

was he joking ? by flynt · 2005-10-09 10:27 · Score: 5, Insightful

"We did a math exercise and the answer was 300 years," Schmidt said in response to an audience question asking for a projection of how long the company's mission will take. "The answer is it's going to be a very long time."

Since this was in response to an audience member's question, does anyone else think he was joking? Because it is such an outlandish question from an information theory and modeling point of view, perhaps he was mocking it? "Ah yes, we just came up with an equation and it should take 294.59 years." I think this also makes sense in light of his next comment, which was made on a more serious note. I interpret it, "We really didn't use an equation, it will obviously take a long time though." This is how I understod his comments, and I may be wrong, but it wouldn't surprise me if some reporter picked up on this "joke" and put it up as "news".

Re:was he joking ? by The-Pheon · 2005-10-09 10:57 · Score: 1

flynt you are my hero!
----------
Dynamic DNS from ThatIP
Re:was he joking ? by The-Pheon · 2005-10-09 11:05 · Score: 1

And Flynt, learning how to brush your teeth with a manual brush will be indispensible after the revolution.
------------
Dynamic DNS from ThatIP
Re:was he joking ? by Anonymous Coward · 2005-10-09 11:41 · Score: 0

my sonicare actually broke about 6 months ago, so i have been back to the manual!
Re:was he joking ? by wakdjunkaga · 2005-10-09 16:50 · Score: 1

I'd like to see the equations and assumptions used. It must have been figured for massive increases in capability as time passed, because (given that the starting figures are correct, and that in itself appears dubious) my math works out quite differently.
The article claims 170 TB has already been indexed. Lets say this was since Google turned on the lights - late 1997, so call it 1998, or about 7 years. 170 TB/7 years is roughly 25 TB/year. 5,000,000 TB (remaining to be indexed)/25 TB per Yr is 200,000 years give or take.
By this you'd need to index 17000TB per year to cover 5 million terabytes in 300 years.
Re:was he joking ? by khallow · 2005-10-09 21:02 · Score: 1

Don't forget an exponentially increasing ability to process data. Though we're also ignoring that the data in question is probably growing at a similar rate. So somewhere between 300 years and never is probably the correct answer. :-)

It will take... by nicobn · 2005-10-09 10:32 · Score: 1

The spins of all the electrons in the universe to store the spins of all the electrons in the universe...

Re:It will take... by eonlabs · 2005-10-09 11:32 · Score: 1

Half are +1/2 Half are -1/2 All are distributed with uniform randomness across the full of the universe, clustering in areas with positive charge (for normal electrons, negative charge for positrons, which are also considered a type of beta particle and must be included in the count of electrons to make the first statements accurate)

--
I wouldn't consider the mad hatter mad. Just reality impaired. He sure can make a mean cup of tea.
Re:It will take... by fbjon · 2005-10-09 13:10 · Score: 1

I conclude that the average information content of the entire universe is roughly zero, with a more or less uniform distribution. I have already indexed all of this on a blank piece of paper in my pocket. Do we really need more precision than that?

--
True confidence comes not from realising you are as good as your peers, but that your peers are as bad as you are.

How about pr0n? by mynickwastaken · 2005-10-09 10:35 · Score: 0

Only 300 years?!
Obviously, he doesn't know khow much pr0n is out there.

300 years? No way! by Evil-Dragon · 2005-10-09 10:36 · Score: 1

I hardly believe it'll take 300 years to index all the world's information, take the distributed computing project i'm with at the moment. They're hoping to use distributed computing to index every webpage, which when you look at when United Devices started their cancer research project they managed to do millions of hours worth of work without it taking millions of years ;) Take a look at http://www.majestic12.co.uk/ and see what I mean. They haven't got it indexing yet, but they're getting there!

Re:300 years? No way! by Tony+Hoyle · 2005-10-09 14:39 · Score: 1

It's easy to do millions of hours work without it taking millions of years.

1 million hours work done sequentially only takes 114 years. Divide that by only 1000 processors (and there are commercial systems with more than that) and you're down to months.

Boy that totally fucking makes sense by Anonymous Coward · 2005-10-09 10:40 · Score: 0

That totally fucking makes sense because everyone knows that the amount of knowledge we have STAYS THE SAME.

Who comes up with this shit science?

Hmmm by vandalman · 2005-10-09 10:40 · Score: 1

I guess this Internet thing is getting big

--
Devise, Repair, Solve, Build

299 years, 364 days by kentrel · 2005-10-09 10:41 · Score: 1

Just for all the blogs

So what by Anonymous Coward · 2005-10-09 10:42 · Score: 0

Really, is indexing the way to go. Basically indexing the world's information is basically applying government (i.e. classify and index) to the universe, which IMO is a waste of time. Life isn't just a index--there's got to be a better way.

Two things to consider by Slant675 · 2005-10-09 10:42 · Score: 1

I'm certain that 100 years ago mapping the stars was an insane concept. Today, we're able to do it to a more greater extent than we were even able to conceive 100 years ago.

Admittedly the number of stars have not changed, but the distance we are able to see has. This means that there are a lot more stars to map now than there was 100 years ago...

Where am I going with this? The stars to astronomers, as information is to Google, have become increasingly complex over time but so has our ability to sort them out.

I can only assume that over the next 50 or even 100 years we will have developed new methods of sorting data. Once upon a time, the Dewey decimal system was "da bomb." Now, a SQL database would be able to dig through that information in about two seconds instead of a minute or two (although it may be much less in some cases).

So it may be a heck of a lot less than 300 years as time goes by. I just hope we DON'T stand still technologically. :/

--
Slant
Between the Spaces

Why it will take 300 years by commodoresloat · 2005-10-09 10:43 · Score: 0, Troll

The software they've designed to do the indexing will only run on Longhorn.

Re:Why it will take 300 years by bumptehjambox · 2005-10-09 13:38 · Score: 1

So Longhorn is expected in 2035?
None of us are going to be alive for this, so what does it matter?
We should all do our part to make it even harder for them to accomplish this goal, and pass it on to our children.

I peer into my crystal ball by RoboPimp_3000 · 2005-10-09 10:48 · Score: 1

Also in 300 years:

Robot Hookers (prostitution no longer illegal! But it will hurt a lot more)
All females will be genetically engineered to look like Jessica Alba.
Slashdot will still be passing off Google non-news as important information

How to backup email by igny · 2005-10-09 10:53 · Score: 1

I just gzip my Inbox and send it to myself every month.

--
In theory there is no difference between theory and practice. In practice there is. - Yogi Berra

Shampoo advertising by Ch*mp · 2005-10-09 10:54 · Score: 1

How can Google stay in the news?
How can we persuade people that Google will still be around with work to do in 3-5 years time?
How can we persuade people that Google stocks are good for the forseable future?

Pseudoscience, 'complex' statistical analysis, oh - and one stooge in the audience...PR's finest toolset.

What the question really was by hackwrench · 2005-10-09 10:57 · Score: 1

I keep having people asking me questions about computers that indicate they don't understand the reality involved. For example, they say something like, "Do you put your own programs in there?", with no idea about writing programs, starting with a program somebody else wrote and modifying it, or simply downloading a program.

google in 300 years by click2005 · 2005-10-09 11:02 · Score: 1

What about how much space it will take up? By that time, data storage and compression will have advanced to the point where the entire google index fits on a single USB 4.0 nano-flash device.

Also, by then google will have so many indexes they'll need an index of google indexes. You'll go to googledex to find out which google to search on.

--
I am a free slashdotter. I will not be modded, blogged, DRM'd, patented, podcasted or RFID'd. My life is my own.

Re:google in 300 years by telstar · 2005-10-09 12:30 · Score: 1

In 2305? USB 4.0? More like US-what?

There are a lot of areas where essentially this... by hackwrench · 2005-10-09 11:06 · Score: 2, Interesting

question is asked, and they seem to miss that the answer is that it is it's own index.

Yeah Right. by Adam+Avangelist · 2005-10-09 11:06 · Score: 1

Random Conversation

Q: What is your favorite color, you said it was orange?

A: I changed my mind, now its clear.

from a logical point of view by ronsta · 2005-10-09 11:10 · Score: 2, Funny

Please correct me if I am wrong, but wouldn't solving a problem like this create more information than you previously had? And wouldn't you have to index that information and so on and so forth?

Also, can someone explain to me how you even approach something like this from a mathematical model point of view? How did the 170 terrabyte number even come up? Aren't there different definitions for what constitutes 'information?' Also, who the hell spent their 20% on this problem when there was integral code for vital programs to write, such as Google Suggest and Google Suggest in Japanese?

PLEASE, SOMEONE EXPLAIN BEFORE I GO OFF INTO MORE OF A FLAMEBAIT RANT!

--

Martini Glasses

Re:from a logical point of view by Halfbaked+Plan · 2005-10-09 12:07 · Score: 1

It's all hype. Google is a hustle operation now. They provide a 'free' search service, still, but the ad revenue is where it's at. So Schmidt has to come up with PHB bullet points on a regular basis.

And the fawning masses lap it up.

--
resigned

Precision! by Palal · 2005-10-09 11:11 · Score: 1

Isn't everything information? Every single bit (i.e. piece) of every single thing contains information (just like the Matrix). So what precision is this 300-year estimate set at? If we have infinitely-small precision, we have infinite amount of information, which is simply infeasible to catalogue.

--
-Palal

Naturally That Isn't True by Master+of+Transhuman · 2005-10-09 11:11 · Score: 1

With the development of nanotech-based computers, that time will be cut to probably less than fifty.

The real problem of course is getting access to the stuff that isn't digital already. Still, nanotech will probably enable more effective scanners in the next fifty years as well. Rather than using once or twice removed stuff like lasers and IR beams, mechanical scanners will actually crawl the object to be scanned - meaning that anything will be scannable, including three-dimensional objects eventually.

Waste of time to make predictions about the future when you don't know what technology is feasible and what isn't.

Of course, CEOs - even those like this guy - haven't a clue about areas outside their specific field.

Making statements like his is just stupid.

--
Richard Steven Hack - This sig is TOO GODDAMN SHORT TO DO ANYTHING USEFUL WITH! MORONS!

Re:Naturally That Isn't True by Anonymous Coward · 2005-10-09 13:52 · Score: 0

What bullshit. Everyone knows that in twenty years time, little green men will fly out of your ass and catalog the worlds information using magical bees crafted from recycled millionare heiresses. And once they've done that, they'll sack all the ignorant CEOs in the world and replace them with Slashdot posters like yourself who are clearly more qualified.

Besides... by Belial6 · 2005-10-09 11:21 · Score: 1

With the current state of "IP" law, it will be illegal to index any new information anyways. So, once the corporation that owns it decides to delete it, it won't count.

My guess: by imsabbel · 2005-10-09 11:23 · Score: 3, Informative

Stuff like this (or years ago for LHC) is most likely following approach:

They astimated an amount of information that is "all information", like 480 000 Exabyte or so.
Then they look at their current capactity (storage and database cpupower) and just interpolate moore's law into the future and look when the demand will be met.

Of course, for stuff like the LHC that only interpolates 10-20 years into the future such a thing is possible, but 300 years? He should read up about the singularity...

--
HI O WISE PRINCE. WHT TOOK U SO DAM LONG?

Re:My guess: by TeknoHog · 2005-10-09 13:40 · Score: 1

This is extrapolation, not interpolation.
Interpolation means looking between your data points, and extrapolation means looking outside the data set. Interpolation is generally much more reliable and trivial than extrapolation. In particular, when you're dealing with a time series of data, it's easy to spot a trend in past events (e.g. Moore's "law"), but harder to predict whether that trend continues.

--
Escher was the first MC and Giger invented the HR department.
Re:My guess: by Punboy · 2005-10-09 13:55 · Score: 1

Actually, I would assume they aren't using Moore's Law since it only applies to transistor technology. They probably take the current rate of expansion (represented as a mathematical equation) and project into the future.

--
If you like what I've said here, and want to read more, go to http://www.krillrblog.com

Define "all" ... by jabberwock · 2005-10-09 11:25 · Score: 2, Insightful

Noted that:

"Of the approximately 5 million terabytes of information out in the world, only about 170 terabytes have been indexed, he (Schmidt) said earlier during his speech."

So ... how many terrabytes of info will be produced in the next 300 years, and does anyone really think that Google (and anyone, or everyone) could keep up?

Especially, once all 20 billion people who live in the Solar System are video-documenting every moment of their existence ...

OK, so I project and exaggerate ...

Despite Slashdot panning it... by hackwrench · 2005-10-09 11:34 · Score: 1

They do have patents:
http://atomchip.com/_wsn/page2.html
Throw the numbers there into:
http://patft.uspto.gov/netahtml/srchnum.htm
The EU site is below:
http://www.european-patent-office.org/
Which lead me to here:
http://www.espacenet.com/getstarted/index.en.htm
Which is where I stop before submitting this message to slashdot.

Scientific and Mathematical bunk by markdj · 2005-10-09 11:42 · Score: 2

The article gives us no facts that we can use to verify the claim. Without a definition of information and a definition of indexing one cannot take this for accurate. There are many definitions of information and except that used in "Information Theory", which is a message received and decoded to its original form, I don't know of any definition that has sientific or mathematical rigour. In fact, in my opinion, Information Theory is a misnomer and is more properly called Communication Theory since it is about getting a message properly communicated, NOT about whether its contents are useful. Additionally, information as understood by most comes in many forms and types and each may require different ways of indexing. Finally, aren't the indexes information that needs to be indexed? How do you keep from recursing?

nonsense by stiefvater · 2005-10-09 11:49 · Score: 1

he's making the classic mistake of assuming progress is linear. we've seen time and time and time and time and time again that progress is exponential (or better.)

human genome project?

Moore's law?

K.

300 years later... by LinuxRulz · 2005-10-09 12:05 · Score: 1

In 300 years:
google: the answer is 42
people: the answer to what?
google: hmmm... sorry, can't remember!

Uh oh by harlows_monkeys · 2005-10-09 12:18 · Score: 2, Funny

Uh oh...someone needs to visit Applied Cryogenics and knock 700 years off Fry's timer then.

Google has indexed / the web is only 170 TB? by Captain+Perspicuous · 2005-10-09 12:31 · Score: 1

170 TB is nothing! Is it really not more? 170 TB can be saved by you locally if you really want. Well, let me explain:

For 100$, you can get 250 DVD-Rs if you look hard. This is about 1 Terabyte. 170 TB = $17'000. The web can be yours for a tiny weeny little amount of money if you think about it. Isn't that incredible that the storage industry keeps making those medias cheaper and cheaper and us humans can't create new media fast enough? Wow, I'm baffled.

A fatuous answer to an imprecise question by Jeremy+Singer · 2005-10-09 12:41 · Score: 1

What resolution of data will you index? Suppose that satelite imagery improves, and we can resolve to the nearest centimeter. Does it make sense to index the positions of each of the dandelions on my lawn?

Maybe you're thinking about this? by Schwarzchild · 2005-10-09 12:48 · Score: 1

Robot Exclusion Protocol courtesy of ftrain.

--

"sweet dreams are made of this..."

What if "$#^$^" can do it in 3 years ? by managedcode · 2005-10-09 12:55 · Score: 1

Eric is talking taking into consideration GOOGs technology. Tomorrow scientists from Berkeley might announce that they have come up with a new technology and can index the world's info in 3 years. GOOG will be history.
Cheers!

Accelerating returns? by ellcry · 2005-10-09 12:57 · Score: 1

Lets not forget the estimated century geneticist predicted it would take to decipher the human genome. It took less than a decade... If anything should be obvious at this point, making predictions based on current trends in technology is futile at best. I think I'll mail google a copy of 'The Singularity is Near' to celebrate their 9th birthday.

It would still be incomplete by melvo · 2005-10-09 13:05 · Score: 1

As proved by Goedel. QED.

MOD PARENT DOWN -- BASHING GOOGLE!!! by Anonymous Coward · 2005-10-09 13:06 · Score: 1, Funny

This is your psudo-mod-bitchslap-sarcastic reply right here.

--Anonymize me up Scotty!--

Futurama..... by artoffacts · 2005-10-09 13:23 · Score: 1

Beavers mate for life
11>4
For quality carpets, visit Kaplan's carpet warehouse

Now the Infosphere will open it's protective crust so as to scan itself, completing it's 300 year task.

I thought the answer was 42. by wcrowe · 2005-10-09 13:49 · Score: 3, Funny

Ask a stupid question...

--
Proverbs 21:19

I doubt it by MichaeLuke · 2005-10-09 13:59 · Score: 1

300 years, huh? That reminds me of a story I heard about how Michigan's trees all got chopped down. It seeems people did a study and determined that it would take about three hundred years to log all of Michigan's forests. Since the oldest trees in our state are only about three hundred years old, by the time they got done clearing the upper peninsula, the trees in the south would be as large as they were when logging began. So away they went.
Then came along an early technological innovation: the two-man saw. Trees could be cut down much faster than with axes. They then proceeded to chop everything down in a rather shorter timespan, and only about eighty or so acres of three hundred year old tree are left.
Isn't Google full of super-geniuses or something? Who do all sorts of clever things with computers? Something tells me somebody there already has this figured out.

Wouldn't it be easier by Allnighterking · 2005-10-09 14:01 · Score: 1

just to digitize the card catalog at the library of congress?

--

I'm sorry, I'm to tired to be witty at the moment so this message will have to do.

What If by Yehooti · 2005-10-09 14:17 · Score: 1

As in the ancient libraries as that of Alexander, or if Google becomes our Oracle of Delphi, all of our world's accumulated knowledge is stored in modern media and some calamity prevents our updating or refreshing of that huge data bank for a several generations or so?

It would be nice to think that after going to the great trouble of accumulating such a vast store of knowledge, it would be backed up in ways not so vulnerable to degradation with time. Is there a medium that can contain such a vast amount of data but is not badly degraded after a century or two?

If Google accomplishes this task, I hope they do the next step, and archive it in some manner where it can be totally readable to those a hundred years in the future with a recovering technology.

They're brute-forcing an exponential problem by Anonymous Coward · 2005-10-09 14:21 · Score: 0

The amount of information to index is proportional to the number of humans alive. Since humans reproduce exponentially, the problem becomes exponential.

Yahoo should take the smart way and use von Neumann machines to complete the job.

Come to think of it, they're already employing several hundred von Neumann machines right now. They just need to uhh... "put them to work", if you will? The only caveat is that the resulting von Neumann machines can't be allowed to create any information to index, otherwise the whole advantage will be lost.

The up-side to this solution is that once the world's information is finished being indexed, there will be a generous surplus of donor organs and blood......and brrraaainns.

Human Genome by Hao+Wu · 2005-10-09 14:24 · Score: 1

It was supposed to take a similar order of time to sequence the human genome. To the joy of scientists (and the dismay of life-long government grant winners), Celera did it in just a few years.

--
I suggest you read Slashdot

Distributed Indexing by DocSavage64109 · 2005-10-09 14:41 · Score: 1

Why not just have every computer index itself. You could then have everything on the internet indexed in days.

300 Years Makes Sense for 5 Exabytes and growing.. by Praxiteles · 2005-10-09 15:13 · Score: 1

Print, film, magnetic and optical storage media produced 5 exabytes of new information in 2002 according to these Berkeley researchers. They have been estimating the amount of information created each year and how much it is growing.

Google probably used their figures. Factor in the speed of Google's spiders and the current rate of their growth - and I am sure 300 years is very close to the answer they found with their "math exercise."

False alternative by ChrisMaple · 2005-10-09 15:19 · Score: 2, Insightful

Private ownership of an airport does not mean that it would be owned by an airline. Even if an airport were owned by an airline, that does not mean it would serve only that airline. (It would not be in its best interest to do so.)

Practice has shown that government ownership and operation of airports is inferior to private ownership.

--
Contribute to civilization: ari.aynrand.org/donate

Indexing the Porn by ozTravman · 2005-10-09 15:32 · Score: 2, Funny

He didn't clarify that 299 years of that was indexing all the Internet porn sites.

Re:Indexing the Porn by hutkey · 2005-10-09 23:17 · Score: 0

no i think, those many years will be required to index only all the 'information' we can get from a single porn film :-)

Re:Oooo.... No it's the giant brains all over agai by jrockway · 2005-10-09 15:59 · Score: 2, Funny

Don't worry, I just ordered a pizza for I. C. Wiener.

--
My other car is first.

Indexing Information No Longer Relevant by zod2008 · 2005-10-09 16:14 · Score: 0, Troll

After 2008, Google-As-God will no longer worry your minds. Under my power, they will be but tiny figments of your imagination. Googlers will submit to me and kneel. As will you. So vote for me: http://zod2008.com/

Re:300 Years? Feed Those Pigeons! by ggvaidya · 2005-10-09 16:14 · Score: 1

Well, pigeons is an improvement over snakes. On the other hand, as a camel devotee, I think your parrot idea has much to commend it ...

Human Genome? by NetNinja · 2005-10-09 16:34 · Score: 1

What was the original estimation on the Human genome project?
10 years?

Blah! Don't believe the hype.

When Google is done. . . by liquidMONKEY · 2005-10-09 17:17 · Score: 0

They'll have some music to accompany them.

it's the definition of "index" that's a problem by Quadraginta · 2005-10-09 18:11 · Score: 2, Insightful

Look, the problem is not how much data there is in the world, the problem is finding a general automatable algorithm for organizing it in such a way that J. Random User can rapidly find what he's looking for.

Stroll on down to the nearest university library. It's got a lot less information in it that Google is considering, and aboutt a hundred thousand man-years over a few centuries have gone into finding clever ways to organize it all: card catalogs, shelving systems (e.g. Dewey and his decimals), nowadays searchable electronic catalogs, reference books, specialized indices for law and science and medicine, citation indices, reviews, reviews of reviews...and so on and so forth forever. And yet, it can still be immensely difficult to track down a particular piece of information you want. Even if it can be done, often it takes a fair amount of expertise in a field just to know where to look. Where do you find public information on patents for desalination processes? How do you find out if anyone has synthesized a polymer resin that melts between 130 C and 150 C and is resistant to acid, with a tensile strength about X? What was the common law meaning of "ownership in fee simple" in 1680s England? Even to start looking for the answers, you often need great experience in the relevant field, so you know where to start looking -- the "search terms" we might say.

Google may be feeling its oats because they can now very rapidly provide the most obvious things people want -- directions to San Diego from Ukiah, the times and places Serenity is playing on Sunday, the lead story of the New York Times "Style" section last Sunday, or the names and addresses of the six pizzerias closest to me still open at 11:25 PM.

But this is utterly small potatoes compared to the problem of organizing information generally, so that it is useful to professionals during the weekday as well as for amusement on the weekend. It is first, generally speaking, an unsolved problem -- no library or information index I've ever used fails to have at least one frustrating "feature" that leaves me scratching my head, wondering what the heck the designers were thinking. Secondly, I very much doubt Google has the depth of professional expertise in-house to even begin to figure out how to organize all the giant repositories of information in law, science, engineering, literature et cetera in such a way that professionals can use them, let alone amateurs.

And finally, they don't have the money to do it, and it will be very hard for them to raise it. Indices have suffered from this problem for a long time: any given user will only pay a very small price per search, but it costs a huge amount to make the index. Heretofore, makers of indices and dictionaries and references have relied on selling them at very high prices to libraries, which in turn raise the money in small bits from their patrons, or taxes. But Google would cut out the library middleman -- you search directly. So how are they going to cover their costs? They've no easy way to charge you $0.005 every time you do a Google search, for example.

In short, this sounds like the 21st century equivalent of that 1950s nuclear energy braggadocio, "energy too cheap to meter." Call it "information too cheap to meter." Color me skeptical.

Re:300 Years? Feed Those Pigeons! by Zardus · 2005-10-09 18:58 · Score: 1

But without the poop, where will they get the materials for their page background???

--
You can mod your friends, you can mod your nose, but you can't mod your friend's nose.

Many problems with this comments by LS · 2005-10-09 20:18 · Score: 1

As an earlier poster said, this might be a joke, and probably is considering who's mouth it came out of. There are serious issues with this. First, what is considered information? You could consider the approximate location of every molecule in the universe a piece of information. Perhaps just text could be considered - that would narrow down the field of information into the finite. But Google does mapping, so they already have broken out of just text. Then you have the fact that the amount of information (whatever that means) supposedly grows at a geometric or exponential rate (I don't remember which), and I'm sure a google spider won't be there waiting for every new bit of information to be generated. Giving an answer to how long it will take to index all information is almost like saying "How much does purple weigh?"

LS

--
There is a fine line between being a cultivated citizen and being someone else's crop. - A. J. Patrick Liszkie

I for one by Rungchen · 2005-10-09 21:07 · Score: 1

Welcome our new(?) indexing overlords..

--
You can get it fast, you can get it good, You can get it cheap. Pick two!

Nazi Germany by paragonc · 2005-10-09 21:24 · Score: 1

thus begins the thousand year reign of the third reich !!!!! be scurrd!!!!

Fortunately... by nathanh · 2005-10-09 21:31 · Score: 1

... once Google has completed its 300 year mission, it will reduce the Empire's fall to barbarism from 30,000 years to a mere 1,000 years.

Ok indexing is one thing by TarrySingh · 2005-10-09 22:14 · Score: 1

but will they own it too? AS I'm sure if Erik is planning World Domination, he'd be long ded by then!

--
Scott McNealy to Michael: "Suck my Sun!" Michael Dell to Scott : "Lick my Dell!"

search by nut · 2005-10-09 22:45 · Score: 1

What I want to know is... How long will it take to search the whole world's information?

--
Never trust a man in a blue trench coat, Never drive a car when you're dead

It all depends on what you call information by ACORN_USER · 2005-10-09 23:29 · Score: 1

There is a lot of none-sense out there and duplicate information which would be redundant to consider more than once - or at all. What I foresee is a trip further down the road - well we are talking about 300 year time scales - where information is represented more by some kind of semantic network - more of an auto-classifying semantic web - than the current textual level representation. Interpreting meaning - or meaning type - will eventually be key to mining out what is really worth indexing.

Google are harvesters of everything from geographical to academic info - what I'm suggesting is that the *.google idea be broken down into the */*/*/.google. Hell, you might even be able to subscribe to data which is published as pointless/statements/indexing/future/google and one day get back this page! That said, if I were a clever indexing bot of the future, I would advise myself not to index this particular post.

Perhaps Google and its cache are really the start of Megadodo ( google -> first-person -> megoogle -><dyslexia> -> megooddle -> <illiteracy> -> megadodo )Publications, one of the great publishing houses of Ursa Minor Google-Beta and its Hitch Hikers Guide to the Galaxy. All wrapped in pretty e-ink digital paper. Has there ever been a better example of pruning redundent data in the whole history of the world than:

"Did you mean Ear wigs?

Search Results:

1 - Earth - Mostly Harmless"

23:59 on Dec. 31 2099 - GoogleBot is SelfAware by ACORN_USER · 2005-10-10 00:12 · Score: 1

Don't be so sure. Let's see. Google teams up with Nasa. Google involved in searching human-genome. I'm sure that the Google Research Centre also has a spin on Mind Transferance. 300 Years? That's nothing in the lifetime of a sentient google-bot.

How many companies last 300 years? by gnugrep · 2005-10-10 00:53 · Score: 1

Of all the companies that were around in 1705, how many are still in business today? Does anyone know? Off the top of my head I can't think of any. Think of how much the world has changed since 1705, will google even be relevant 300 years from now?

300 light year by mercedo · 2005-10-10 02:08 · Score: 1

300 years later We will have information of 300 years ago

--
Ancient Greek Philosophers -18c Enlightenment Thinkers -Slashdotters

Maybe a faster way by mattr · 2005-10-10 16:25 · Score: 1

1. Recognize they will NEVER index even all significant information in the world today, because a) info growing too fast and b) we are not yet at the point that everyone is using networked data for everything.

2. Data can only be indexed as it is produced, otherwise you are fighting a losing battle with diminishing returns. It looks like 300 but it is really asymptotic approaching "when the universe cools down and we live in a big black hole".

3. Google spends money to increase recognition among public and government of the need to archive for posterity and scientific/cultural advancement. This includes legal basis for the indiscriminate archiving for society's greater good and a waveform collapse of all copyright owners' Terms of Usage and various nasty legal mechanisms mostly created by the U.S. (but did you know copyright is a criminal violation in Japan?). The money is needed to lobby other governments too, so that they increase networking and slow down copyright legislation (otherwise the U.S. government and its corporate backers will tend to continue to push foreign data laws down the throats of relatively undeveloped countries). Actually 1 billion dollars is only the beginning of what it will cost but Google is going to have to act to ensure its future.

4. Develop a tree of archival nodes (indexing all information streaming through them) from devices to communities to states. The forerunner of this will be simpler, smaller, cheaper google servers placed in music, radio, tv, print and other publishers, subsidized mainly by google, for the triple purpose of providing searchability to the companies, securing a future resource for Google to search and distribute when the copyright runs out, and creating a secure data store linked to a payment mechanism to realize on-demand sales for those companies with little sweat. Using Google will ensure low middleman charges since Google needs this stuff. Publishers will jump to offer Google rights 50 years later plus a small margin on online sales until then, in return for storing, indexing and selling now.

5. With the exception of data prior to this point, the index is done when networked data in our lives hits 100%.

6. We can approach 100% a bit faster by reducing duplication, but we will end up with massively higher storage requirements as we find new definitions of important information, such as securely storing genetic data assays every year per individual for health purposes, or storage of information on preferences concerning the just in time construction of micro- or nanoscale robots.

So how long does this take? Google has started selling servers already. They will do a heroic job until we hit the network singularity (everything we do is all together on the one net, indexed as it is produced by intelligent agents). Seems likely the network singularity and a minimal form of artificial intelligence will coincide. This says the task will be done in maybe 50 years (well between 30 and 100), and not 300.

The rest of the 300 years can be spent cross-correlating all of mankind's knowledge; actually the beginning of this is starting now but when say in the next 10-30 years actually make headway in the semantic web so that we get a yahoo-like index of all scientific and business knowledge, we will stop duplicating information, and education and business will get more efficient. It will accelerate as we learn more about ourselves and advances in computing will just barely keep up with indexing and crosscorrelating it all as we start eating up CPU cycles for things other than video games and imaginary friends. My guess is 100 years. So Google has no business talking about 300 years concerning anything except maybe plans for space travel. Their online world is going to change way too much by then! Figure that (if it is economically important) 80% of the people on the planet will have the same capabilities as Google does now in 20 years. They will only have to worry about 300 years if they keep evolving their goals higher and higher.

World's Information by manojar · 2005-10-10 22:21 · Score: 1

Is it like the world series?

--
Manojar - pronounced like Manager

Slashdot Mirror

300 Years to Index the World's Information

248 comments