Google Index Doubles

← Back to Stories (view on slashdot.org)

Posted by samzenpus on Wednesday November 10, 2004 @11:00PM from the even-more dept.

geekfiend writes "Today Google updated their website to indicate over eight billion pages crawled, cached and indexed. They've also added an entry to their blog explaining that they still have tons of work to do."

11 of 324 comments (clear)

Min score:

Reason:

Sort:

no update on the images by bvdbos · 2004-11-10 23:07 · Score: 3, Informative

Unfortunately they didn't update the image-search yet.
Re:Google thieves my bandwidth by Anonymous Coward · 2004-11-10 23:35 · Score: 5, Informative

Google respects the robots.txt file. Use it.
Re:I'm all alone by tadmas · 2004-11-10 23:43 · Score: 3, Informative

8 billion pages and not a single link to my blog.

Perhaps you should just tell them where it is.
Mine is bigger than yours!!! by ayjay29 · 2004-11-10 23:46 · Score: 4, Informative

From BBC News here.

In a statement Microsoft said its search engine returned results from five billion web pages - more than any other search engine.

But this quickly won a response from Google which announced that its index has now grown to more than 8 billion pages.

Prior to the Microsoft announcement, Google was only indexing 4,285,199,774 web pages.

Steve Ballmer is soon to announce that his daddy is one hundrad years old, and kan kick your daddy's ass...

--
Offtopic, Inflammatory, Inappropriate, Illegal, or Offensive comments might be moderated up.
Re:Quality - not quantity by dabadab · 2004-11-10 23:49 · Score: 3, Informative

"[i]Since pagerank was switched off[/i]"

Since when is Pagerank switched off?

--
Real life is overrated.
Searching LiveJournal.com by hackrobat · 2004-11-10 23:49 · Score: 4, Informative

Looks like they've added a gazillion LiveJournal pages to their index. I used to have a Google search box on my LJ that didn't throw up relevant results until last week or so. Now it works perfectly, just like builtin search (like what you see in MT and WordPress).
Competing with Microsoft's 5bn? by Richard+W.M.+Jones · 2004-11-10 23:51 · Score: 4, Informative

On the same day that this story hits the BBC. In that story Microsoft claim that they have 5 billion pages indexed, more than the 4.2 billion pages indexed (at that point) by Google. The BBC have just updated the story with the 8bn figure.
I smell competition!
Rich.

--
libguestfs - tools for accessing and modifying virtual machine disk images
robots.txt by ReKleSS · 2004-11-10 23:51 · Score: 3, Informative

Yes, this is probably a troll, but anyway... I take it you've never heard of the robots.txt file? You sound like you might want to read up on it. It's designed to help control the spidering of your pages for whatever reason, particularly cases like yours or situations where a spider would get confused and end up doing something stupid (recursive stuff, etc).
-ReK

--
md5sum -c reality.md5 reality: FAILED md5sum: WARNING: 1 of 1 computed checksum did NOT match
Re:Google needs your cookie badly by Anonymous Coward · 2004-11-10 23:52 · Score: 3, Informative

You can still save those settings but google refuses to use them when you block their cookie. In my case I get 10 search results although I like to receive 100.
Create a keyword bookmark with the URL
http://www.google.com/search?q=%s&num=100

Give it the keyword 100, then type 100 search_term in the address bar to use it.
Re:Google thieves my bandwidth by jvj24601 · 2004-11-11 00:09 · Score: 5, Informative

Well, if you know that Google is indexing your site and "stealing" your bandwidth, then you must have looked at the server logs, right? You'd see the name of the search bot is googlebot. Search for it, and you'll find that the first relevant link explains how to prevent googlebot from accessing your site.

The logs would probably also show failed attempts to find the file /robots.txt. Similar info is gained from searching on that term as well.
Re:What? by jez9999 · 2004-11-11 00:25 · Score: 4, Informative

Erm, that's only because of the bizarre plus signs the grandparent poster put in - try this. Note to grandparent: Just about any modern search engine assumes words not prefixed by anything are to be included in the Boolean search query. No need for +.

--
== Jez ==
Do you miss Firefox? Try Pale Moon.