Slashdot Mirror


Why Do Google Hit Numbers Vary?

Supa-Fly writes "I have a question about some conflicting results with the search engine google. I did a search for "pictures of mountains" and got exactly 1 million results. My friend did the same search (from the same office)and got 1,010,000 results. A second friend did the same search as the last 2 and got 1,020,000. These have not changed and every person gets the same results each time. My question is what is up with the discrepancies on google's search results?" Since this question is hard to answer from the outside, Craig Silverstein of Google kindly supplies his best answer to this question, below.

Craig writes: "Thanks for the great question. We get this from time to time and hopefully I can clear up some of the confusion. The number of estimated pages listed to the top right of a Google search results page is indeed, an estimate. It's a good estimate but still, an estimate.

There are many reasons why one might see a difference in the estimated number of pages returned for the same query. It's most likely the queries made by your co-workers were sent to different Google datacenters in what appears to have been a round-robin fashion. The index at any given Google datacenter can change slightly over the course of a day (each index is refreshed completely every three to four weeks). Depending on which datacenter finishes a query, the estimated number of results may vary.

Without having direct access to your environment it is hard for me to tell for sure, however, I believe this is the case."

40 of 362 comments (clear)

  1. Interesting Google phenomena by ergo98 · · Score: 5, Interesting

    Several weeks back I happened to mention a very nice new restaurant in Toronto on one of my pages, and within days shot to the #2 position on Google when searching for several variants of this restaurants name. I knew this by the fact that suddenly I was seeing closing on a hundred hits per day of people looking for this restaurant. Note that this restaurant has such a unique name that there are only around 5 pages of links in all anyways. Anyways suddenly the hits entirely stopped, and a search on Google found my page was purged from the database: Despite it being a unique name with few hits, it no longer even registered. A week later suddenly it was back in the #2 spot again.

    No idea why this happened, but it is entertaining to see it vary.

    1. Re:Interesting Google phenomena by dknj · · Score: 4, Informative
    2. Re:Interesting Google phenomena by lemox · · Score: 5, Interesting

      It's called the "Google Dance", which is mentioned in an earlier comment.

      --

      "We obviously need a new moderation category: (-1, Woo-fucking-hoo)" --Mr. AC

    3. Re:Interesting Google phenomena by AntiNorm · · Score: 4, Funny

      It's called the "Google Dance"

      Wow, I must really be tired...the first thing I thought of when I read this was a hampsterdance.com-esque site complete with dancing "Google"s and background music.

      --

      I pledge allegiance to the flag...
      of the Corporate States of America...
  2. Amazing! by PeterClark · · Score: 5, Insightful

    An "Ask Slashdot" that actually went to the source for the answer first, without the usually bad/wrong/pointless pontificating that normally goes along with it. How long can such a good thing last, I wonder.
    :Peter

    1. Re:Amazing! by unitron · · Score: 4, Funny
      "How long can such a good thing last, I wonder."

      The way Slashdot editors keep reposting stories? Indefinitely!

      --

      I see even classic Slashdot is now pretty much unusable on dial up anymore.

    2. Re:Amazing! by Sebby · · Score: 4, Funny

      And with all the dupes, we might start seeing more of these too!

      --

      AC comments get piped to /dev/null
    3. Re:Amazing! by Drakonian · · Score: 5, Funny
      How long can such a good thing last, I wonder.

      Maybe you should Ask Slashdot?

      --
      Random is the New Order.
    4. Re:Amazing! by SpaceLifeForm · · Score: 5, Funny
      Well, according to google,
      A search for 'ask slashdot correct answer'
      returns 10,400 hits.

      A search for 'ask slashdot'
      returns 104,000 hits.

      Therefore, I would conclude that Slashdot going directly to the source immediately is very rare.

      Obviously YMMV.

      --
      You are being MICROattacked, from various angles, in a SOFT manner.
  3. number oddities by millette · · Score: 4, Interesting

    What's really odd is searching for a few words with OR, and noticing that adding words actually lowers the numbers of results obtained.

    1. Re:number oddities by pete_p · · Score: 4, Informative

      That's because Google doesn't do boolean searches. It will ignore the or (too common a word) and ends up treating it like an and search.

      --
      Insert wit here.
    2. Re:number oddities by sparkz · · Score: 4, Informative

      Wrong. OR is a boolean operator to Google. Check the "Advanced Search" link.

      --
      Author, Shell Scripting : Expert Re
    3. Re:number oddities by millette · · Score: 5, Informative

      Actually, if you use an uppercase OR, it will perform a boolean search. Otherwise, the search defaults to an AND, unless of course you're using doublequotes "like this" to search for a phrase.

    4. Re:number oddities by Forgotten · · Score: 5, Insightful

      I nearly always use double quotes to search for phrases. It works extremely well with google. You can also combine multiple phrases, and unquoted terms as well.

      In fact, I'm surprised no one else mentioned that searching for "pictures of mountains" (quotes included) yields 1320 hits, which are likely to be much more useful than the other 998,690 or so. Though in this case I really would have searched for "pictures of mountains" OR "mountain pictures" (or done two searches).

      If you're not going to use the quotes, there's precious little point including the word "of" in the query.

      There are other useful tricks for the google search field listed on the help page, but double quotes is by far the most useful overall.

      (another handy trick if you're using Mac IE is to hack the app's resource fork so the '?' address bar shortcut goes to google instead of MSN - a trick expanded on in iCab's built in URL expansion)

  4. googledance by wfmcwalter · · Score: 5, Interesting
    There's a number of websites (dare I say "fansites") devoted to the study of google result variance - the so-called googledance.

    this and this

    --
    ## W.Finlay McWalter ## http://www.mcwalter.org ##
  5. Its too bad.. by FunkSoulBrother · · Score: 5, Funny

    It's too bad Google doesn't have one of those things where you can watch everyone's search scrolling down the screen live. I bet there would be a lot of "pictures of mountains" searches right about now.

    I think some engine had that (metacrawler)? back in the day, was fun to watch, and I believe they didnt censor it.

    1. Re:Its too bad.. by danimal · · Score: 4, Informative

      Ah, but Google does have one....well, available at the Googleplex.

  6. wow, could we all have missed this? by Tiber · · Score: 5, Informative

    About a month ago, someone posted this story over on K5 regarding the google dance. Good to see it's run by a marketing site, I couldn't think of anyone who might have more of an interest in rankings then those bastards. :P

  7. Eureka! by creative_name · · Score: 5, Funny

    No wonder I couldn't find the website I was looking for! It was in those missing 10,000 websites. If I had only gotten those and checked through them as thoroughly as I checked the other 1,010,000 then I would have certainly found it.

    Humor aside, this is pretty interesting. Alot like when you vote in a poll, go back to the main /. page and the poll from last week appears. You'd think the Uber Midgets and Stealth Ninjas could get it right ;-)

    --
    Posting as directed.
  8. First Google Haiku Post by Ayanami+Rei · · Score: 5, Funny

    like snowflakes falling
    google queries melt upon
    different servers

    like the wild flowers
    each view of the database
    unique, yet alike

    and...
    its that time of month
    google dances, results wiggle
    w00t first haiku post

    --
    THIS THING CAN TURN ON A DIME, MACROSSZERO STYLE ALSO FUCK BETA, ~NYORON
  9. Pictures of Mountains? No wonder by sssmashy · · Score: 4, Funny

    It's simple, really... mountains are the new thing in pornography. People are snapping and posting so many pictures of naughty, erotically shaped rock formations that the number of mountain pics available worldwide on the net is rising by about 10,000 every 10 minutes.

    Soon, the number of phallic granite pics worldwide will even exceed the number of Jenna Jameson facials. Quite the phenomenon, really.

    1. Re:Pictures of Mountains? No wonder by breon.halling · · Score: 5, Funny

      This is for those of you who think he's kidding... ;)

      --
      "Yeah, well, Dracula called and he's coming over tonight for you and I said okay."
  10. *grin* by Eric+Seppanen · · Score: 5, Funny

    Finally, proof that all Ask Slashdot questions could be more quickly answered by simply checking with Google :)

    --
    314-15-9265
  11. This is a coverup by elhondo · · Score: 5, Funny

    Results have been inconsistent ever since they let those damn pigeons unionize. He's obviously covering for the union.

  12. Some different results by jsprat · · Score: 5, Interesting
    Here's what I get:

    "pictures of mountains" 986,000
    "pictures of of mountains" 1,010,000
    "pictures of of of mountains" 1,020,000

    Two of these pages had a different top-ranked link.
    Funny thing, all three times Google told me "of is a very common word and was not included in my search", but it made a difference!

    Regardless of these results, Google is the best search engine. Period.

    1. Re:Some different results by Wild+Wizard · · Score: 4, Interesting

      has no one metioned the advanced settings you can use that changes what sites you get in a search

      w/english only
      1,010,000

      w/all languages
      1,040,000

      w/strict filter and all languages
      903,000

      w/strict filter and english only
      881,000

  13. Re:uh... by swordboy · · Score: 4, Funny

    I have a question about some conflicting results with the search engine google. I did a search for "pictures of mountains" and got exactly 1 million results.

    Steven? Is that you? Dude - you're smoking too much pot!

    --

    Life is the leading cause of death in America.
  14. Google Images filters by Antity · · Score: 4, Interesting

    Google is still beating my photo album...I searched for pictures of mountains, and only found 3. And two of those are debatable.

    Ever tried to turn off Google Images' "You-really-don't-want-to-see-this" filter?

    I mean.. You were searching for "pictures" of "mountains"... Big breasts, that is? ;-) Nah.

    It's "&safe=off", and people outside the US might want to change the language to English before trying to use it (hint).

    Funny thing here in Germany is: The filter is ALWAYS ON, and in the German preferences, there's no option to turn it off. After you change your language to English (URL), though, there suddenly appears an option for disabling the filter... Try talking about censorship (there are not even clear rules about what exactly they are filtering, and there's no explanation why you can't turn it off over here; even worse: They don't even tell you that there IS a filter and that it's always active).

    I asked Google about this, but never got a response.

    --
    42. Easy. What is 32 + 8 + 2?
  15. Re:uh... by Anonymous Coward · · Score: 4, Funny
    • who cares....as long as it works
    You, sir, will never be a geek.
  16. I have no idea... by El+Camino+SS · · Score: 4, Funny


    Perhaps we should Ask Jeeves.

    Hmmmmmm?

  17. Google does reverse-routing by SHEENmaster · · Score: 4, Informative

    so you may connect to any one of several servers. The servers each have different databases to pool results from and different caches to display.

    A google search for my site returns our old site that has had dead dns records for nearly a month above my new site. Sometimes my new site pulls into the lead, sometimes it isn't there, and at least one cache has the announcement that the old name was lost and a domain was purchased.

    --
    You can't judge a book by the way it wears its hair.
  18. Re:I have to wonder... by RedWizzard · · Score: 4, Insightful
    I get a list of 7 pages, and then after getting to page 5, there are only 6 pages.
    I believe that what's happening there is that as you move through the pages of results Google realises that some of the later results are similar to some of the earlier results and omits them. You can get them back but clicking on the link at the end of the last page.
  19. Google Dance by kiwirob · · Score: 5, Informative

    Results can also vary due to the Google Dance.

    Google has 7 data centers each with a copy of it's index and these are "usually" mapped to www.google.com. But google also has versions located at www2.google.com and www3.google.com.

    During the monthly update there can be different version of the index on each of the 3 versions. A website www.google-dance-tool.1hut.com provides results for a search done on all 3 of googles index.

    To check to see if the google dance is happening the most common technique is to check the "back links" for mayor sites like Yahoo by typing "link:www.yahoo.com" into the search box. this will list all the sites with links to "www.yahoo.com".

    The Google Dance Tool site mentioned checks google every 5 minutes to see if the dance is on. Once it is started it sends out an automated email to subscribers (like me) so I can visit the site and see what the search positions for the next month on google will be using their google dance tool search.

  20. Why do they vary? by Regul8or · · Score: 4, Funny

    Everyone knows it's because of he pigeons. Everytime you have an analog element, such as a pigeon, in the equation your end result will vary.

  21. heres why it jumps by deft · · Score: 4, Interesting

    google employs a sreach spider called the freshbot. the freshbot spiders constantly, looking for new content, and periodically injects those results into the search engine listings at the data center. these results drop and return sometimes.

    chances are your site was freshbotted, dropped, and re-catalogged. it could also be a result of the 'dance' at the end of each month when google is updating its search results. rankings fluctuate alot at that time.

    --

    There's nothing Intelligent about Intelligent Design.
  22. Re:Amazing! - Ask GoogleFight by poopie · · Score: 5, Funny
    How self-referential! Referring to http://googlefight.com in an article about Google on slashdot, replying to a post that is using google to determine to accuracy of 'Ask Slashdot'... and providing links that rates two google searches about slashdot against each other.

    Slashdot is right vs. Slashdot is wrong:

    slashdot sucks vs. slashdot rules:

    slashdot correct vs. slashdot incorrect:

    Cmdrtaco vs. cowboyneal

    News for nerds vs. Stuff that Matters

  23. dupes via datacenter? by nlinecomputers · · Score: 4, Funny

    So Google has more then one datacenter to cache Slashdot and Slashot often caches itself when it dupes storys. So the cached story on Google is a dupe of perveious story but Google's second datacenter purges ths story because it is a dupe and then is reposted again on Slashdot when is purged by the first datacenter and duped again and purged by second datacenter and cached on the third and duped by slashdot....*bang*

    See judge he needed killing. He was stuck. It was a mercy killing...

    --
    Slashdot, home of supporters of free software, free music, and free speech.Except for Moderators that disagree with you.
  24. google (mis)uses by lucasw · · Score: 4, Funny

    Spell Check:
    Type in candidate spellings of a word, and assume the spelling with the most search results is the right one:
    'amatuer' -> 3.9e6 hits, 'amateur' -> 35e6 hits. Amateur it is.
    'modelling' -> 2.6e6 hits, 'modeling' -> 5.7e6 hits. Close call, perhaps both are acceptable?

    Ego Boost:
    Everyone knows about this one: see what comes up under your own name (put it in quotes if necessary)- Hopefully if you run a small website or comment with your real name frequently in a google searchable place that'll come up first. But you'll have to work hard to beat out all those genealogy sites that just list thousands of names, graveyard roll-calls and whatnot. Oh, and there's some court case from five years ago where you're name is featured prominently. My namesake is shared with one of the first shaken babies to die and become a major local (wherever it happened) newstory- not much of a boost after all.

    Stalking:
    I'd imagine this pretty similar to the previous, but with names of other people you know or used to know: your old college sweetheart died in 1892! Wait...

    Trademark pre-research
    You need a product name- something fresh and original, and easily googleable? Start with a few ideas, and use a thesaurus (and don't forget cool foreign language words/roots) to refine the name until google hits are down to a zero. Run words together or otherwise potential customers will end up at sites that just randomly use those words at different points of the text- assume the customer is too dumb or lazy to use quotes.
    'NodeZero' is my new badass something-or-other- wait there's 1K hits, how about 'NodeNull'? Only 8 now, that's good, but better yet try 'NodeNothing'- zero results.
    After the google test see if the .com,.org,or .net site with the same name resolves, just in case.

    I'm sure there's many more...

  25. Re:uh... (sex) by Corbin+Dallas · · Score: 5, Funny
    I'm lucky if I get hit on for sex once a month. How do you guys get so many offers?


    Oh, page hits?! Err, nevermind. :-)

    --
    Democracy is two wolves and a lamb voting on what to have for lunch. Liberty is a well-armed lamb contesting the vote.
  26. Try reading: The Anatomy of a Large-Scale Hypertex by Prof.Phreak · · Score: 4, Interesting
    try reading: The Anatomy of a Large-Scale Hypertextual Web Search Engine

    http://www7.scu.edu.au/programme/fullpapers/1921/c om1921.htm

    --

    "If anything can go wrong, it will." - Murphy