Google URL Index Hits 1 Trillion
mytrip points out news that Google's index of unique URLs has reached a milestone: one trillion. Google's blog provides some more information, noting,
"The first Google index in 1998 already had 26 million pages, and by 2000 the Google index reached the one billion mark. Over the last eight years, we've seen a lot of big numbers about how much content is really out there. To keep up with this volume of information, our systems have come a long way since the first set of web data Google processed to answer queries. Back then, we did everything in batches: one workstation could compute the PageRank graph on 26 million pages in a couple of hours, and that set of pages would be used as Google's index for a fixed period of time. Today, Google downloads the web continuously, collecting updated page information and re-processing the entire web-link graph several times per day."
Or it didn't happen.
Once the index reaches a google (or rather a googol), the universe explodes.
[alk]
Seriously, since the web is something like 42% porn. (Yes, that is the ultimate answer.) So that's on average, 60-70 pages of each person in the world naked.
How many of those are automatically generated rank-spoofers, 80%?
My favorite spoof pages were the ones that randomly substituted search terms into porno stories.
"Yes!" she screamed as he thrust his SAMSUNG CD PLAYER deep into her. "I want you balls-deep in my CHEAP HARD DRIVES!" The smell of DISCOUNT SOFTWARE filled the room.
Kwisatz Haderach
Sell the spice to CHOAM
This Mahdi took Shaddam's Throne
..knowing that the vast amounts of porn just keep getting vaster. And more searchable. Amen. *sheds a tear or two*
[Slashdot Comments We Liked]
So unless there is a screenshot showing the 1,000,000,000,000 site count, Google's index didn't reach that milestone? Even if it now shows 1,000,000,000,001?
The 1,000,000,000,000th page had only one word on it:
"woosh"
My hobby:
Getting the fewest possible google results above 0 with a quoted string.
"interspecies gangbang": 6
"hot topic meets disney world": 2
"died in a blogging accident": 15,300
"can boys make babies": 4
"why does it hurt when I read": 1
Google is headquartered in Mountain View, CA -- I know, 'cause I googled it. Now, California is rather inclined to think of itself as it own country (some would say, universe), but it is indeed part of the United States of America (again, I checked with Google). And in the US, "trillion" == 1E12 (again, Google).
Generally, bash is superior to python in those environments where python is not installed.
I imagine that certain sites, such as sites the size of Slashdot (in terms of dynamically generated pages), make a difference. After all, the index talks in pages, not domains. I bet there's also a lot of junk and redundancy in there, but still, it's quite an achievement to be able to deal with that much data.
Surely you're not saying that Slashdot's full of junk and redundancy and redundancy?
Cogito, ergo sig.
Considering your comment is #24345983, I'd say about 24.3 million comments. Also, I believe there's about 1.5 million different users.
Turns out Live.com's market share for today has tripled due to Slashdot users clicking on the above links...
Also, I believe there's about 1.5 million different users.
yeah but if you take out Twitter and all his sock-puppets you'll just be left with 500K unique users...
"woosh" Funny? It went straight over my head.