Slashdot Mirror


300 Years to Index the World's Information

Kasracer writes "At the Association of National Advertisers annual conference, Google's CEO, Eric Schmidt suggested that it would take 300 years for them to index all of the world's information. From the article: 'We did a math exercise and the answer was 300 years,' Schmidt said in response to an audience question asking for a projection of how long the company's mission will take. 'The answer is it's going to be a very long time.'"

21 of 248 comments (clear)

  1. Longer than expected by powerpuffgirls · · Score: 5, Funny

    I always thought 42 years ought to be enough.

  2. New hardware needed by nizo · · Score: 4, Funny

    The hardest part will be developing the hardware that is able to recursively index the Google data itself an infinite number of times.

    1. Re:New hardware needed by spuzzzzzzz · · Score: 5, Funny

      It's OK, they use linux. It does infinite loops in 5 seconds.

      --

      Don't you hate meta-sigs?
  3. What About... by Adrilla · · Score: 4, Insightful

    Did they take into account the information that is being created as they are indexing? Do they plan on live indexing everything that's being made. Information doesn't stop getting created just because they've stored everything that's already been done.

    --

    "Plans are for fools! Oglethorpe, the plutonian (Aqua Teen Hunger Force)
    1. Re:What About... by htrp · · Score: 5, Interesting

      I would assume that it would be to index the collective sum of information, even as it is growing. It's probably a lot quicker to index something than it is to generate it. With probable future advances in computing power and the development of new algorithms, it should be entirely possible that the speed of indexing (which already probably surpasses the speed of information production) would catch up to all the data that still hasn't been indexed.

      Think of it in terms of taking a ratio comparison of two infinite series.

  4. 300 years? by RonnyJ · · Score: 5, Funny

    300 years? I'd have thought their other plan would have been a lot quicker.

  5. I'd like my house indexed by obli · · Score: 5, Funny

    How long until Google decides that your house is information? Just imagine an army of small robot spiders invading your home every night, registering the position, name and contents of every single object you own, making it searchable from house.google.com. Unless you nail a robots.txt to your front door, that is...

    1. Re:I'd like my house indexed by jacksonj04 · · Score: 5, Funny

      locate:keys | pocket
      locate:phone | pocket
      locate:underwear -girlfriend | rm

      --
      How many people can read hex if only you and dead people can read hex?
  6. Everybody! by Slashdiddly · · Score: 5, Funny

    Please stop creating new information and let Google catch up! You can resume later.

  7. Yeah right.. by Klowner · · Score: 5, Funny

    It's going to take them a hell of a lot longer than that, considering my car keys are always moving.

  8. When I read the summary by colonslashslash · · Score: 5, Funny
    I immediately thought of the Futurama episode - The Why of Fry - where the giant brains build the brainsphere and assimilate all the knowledge in existance, before attempting to destroy the entire universe so no new information can be added.

    Googlesphere anyone?

    --
    She's built like a steak house, but she handles like a bistro....
  9. On a related note... by RyanFenton · · Score: 4, Interesting

    I wonder how many man-years it would take to listen to all the music and video that could be indexed. Be interesting at least to find out what the order of magnitute would be - millions, or perhaps billions or trillions of man-years of unique recorded audio and video? It would have to be a game of gross estimation - but it would at least put into perspective how much material is out there, even if most of it is boring "security" footage, compared to the scope of our lives.

    It'd be interesting, if, perhaps in a couple generations, we could have a cheap media volume that contained "recorded media, prehistory - to - 2050ad"... if the media that exists today even survives a couple generations, and copyrights aren't extended indefinetly. The idea of an indexing system that can even put all that information into a meaningful context would be fascinating to consider though, if it could be possible.

    Ryan Fenton

  10. Competition? by psst · · Score: 4, Interesting
    From the article:
    Of the approximately 5 million terabytes of information out in the world, only about 170 terabytes have been indexed, he said earlier during his speech.
    Storing 5 million terabytes has got to cost a lot of resources. It would be very inefficent if every competing search engine stored that much data. Makes me wonder if it would make more sense to nationalize Google's index and share it amongst competitors (just like it makes more sense for goverments to build airports and share them amongst airlines rather than every airline building its own airports).
    1. Re:Competition? by Shihar · · Score: 4, Insightful

      Nationalize Google? Are you joking me or just insane? You want to take one of the most innovative and successful companies that the US has right now a nationalize it!?

      I have a better idea, how about you just send out a government hit squad to kill to put a bullet between the eyes of single entrepreneur in the US. It will accomplish the same sort of freeze in the growth of innovative small businesses but look far less insane.

    2. Re:Competition? by Halfbaked+Plan · · Score: 4, Interesting

      Oh, come on. You're talking about a company that is mostly an advertising enterprise now. Who is Google hiring? Admen and their ilk. It's sometimes depressing how enamored the 'community' had become in a company whose main purpose is leveraging eyeballs to look at their ads.

      (how DARE I say anything bad about Google. Mod this down IMMEDIATELY.)

      --
      resigned
  11. 300 Years? Feed Those Pigeons! by Comatose51 · · Score: 4, Funny

    Obviously they're not feeding those pigeons enough. Time to buy some quality feeds Google. Maybe even slip in some uppers every now and then. If all else fails, maybe it's time to consider the parrot upgrade. They're a lot more expensive but their index/poop ratio is much better.

    --
    EvilCON - Made Famous by /.
  12. Makes no sense by bobintetley · · Score: 4, Insightful

    We did a math exercise? What exercise?

    To estimate the time involved, you surely need to know the size of the information involved (don't quote me that bunkum about 170 terabytes in TFA - yes I did read it), and to know the size you need to know what all the information is, which you can't (and surely new information is created all the time?).

    This translates as "I pulled my finger out my ass, waved it in the air and came up with 300 years."

  13. was he joking ? by flynt · · Score: 5, Insightful

    "We did a math exercise and the answer was 300 years," Schmidt said in response to an audience question asking for a projection of how long the company's mission will take. "The answer is it's going to be a very long time."

    Since this was in response to an audience member's question, does anyone else think he was joking? Because it is such an outlandish question from an information theory and modeling point of view, perhaps he was mocking it? "Ah yes, we just came up with an equation and it should take 294.59 years." I think this also makes sense in light of his next comment, which was made on a more serious note. I interpret it, "We really didn't use an equation, it will obviously take a long time though." This is how I understod his comments, and I may be wrong, but it wouldn't surprise me if some reporter picked up on this "joke" and put it up as "news".

  14. Re:i hereby propose by b100dian · · Score: 5, Funny

    ...Google indexed it all in 6 days, and took a rest in the 7th...

    --
    gtkaml.org
  15. Re:I'm curious... by vidarh · · Score: 5, Insightful
    I take it from that comment that you don't see much value in a thirteen year old girl's blog? What about a thirteen year old girls diary?

    Like Anne Frank's?

    Fact is, it's incredibly hard to determine today what will have value tomorrow. Most of those thirteen year old girls (or 20-something geek guys) blogs will have no historical value. But some of those people will grow up to have a profound impact on the world (or they may not grow up, but still have a profound impact, as was the case with Anne Frank). It may be ten years from now. Or 50.

    Who knows what the writing they do now might tell us about what brought them wherever they end up? When people write diaries on paper chances are reasonable they'll survive and show up in an attic somewhere. But as more and more content get online, we also risk facing the loss of entire generations worth of many types of information to bit rot and simple lack of foresight.

  16. Re:I'm curious... by Vellmont · · Score: 4, Interesting

    I think the parents question is perfectly valid. What is considered "information"? I'd consider a blog information, but is a painting some random artist creates included in this list of "information"? Is my laundry list information? How about my individual handwriting in my laundy list?

    The question of is something valuable isn't exactly an either-or proposition, but a matter of assigning a probability that a certain piece of information is valuable. Couldn't we agree that say the presidents day to day activities are more likely to be important in 100 years than say a single 13 year olds blog? Does that mean that 13 year olds blogs are worthless? Well no, but they aren't the thing I'd first choose to preserve.

    The question I have is, is the greater difficulty in control over online information balanced by the greater ease of keeping it around? Google doesn't delete messages from email for this very reason. We tend to throw stuff away because it takes up too much space, or because it just becomes clutter. But with increased storage space every year and better ability to keep track of it (and seperate it from things we consider important), why ever throw away information?

    Online information portability is obviously a problem. How do you move someones blog somewhere else, and have it mean anything in say 50 years? I think these problems will be solved as people expect information to be more portable and standardized. The solutions I think will come from the short term portability and needs rather than a few people wanting to preserve something for the next 100 years though. Many people make the assumption that standards are short lived things that are here today, gone tommorow. I'd have to disagree on a historical basis. How old are reel to reel tapes, and you can still find a player at say a thrift store. CD-audio has been around for 25 years and is still the default medium for music today. Ascii has developed I don't know how long ago and yet still is quite popular and if you have a computer that can't read it, you've got a fairly useless computer. Standards have a way of sno-balling and gathering momentum to live on a long time.

    --
    AccountKiller