Slashdot Mirror


Yahoo Passes Google in Total Items Searched

tonyquan writes "Yahoo announced today that its search engine passed Google's for overall capacity, with 20 billion documents and images indexed versus 11.3 billion for Google. Observers had previously pegged Yahoo's index at just 8 billion items. The growth is due to a recent expansion effort. More info can be found on the Yahoo! Search blog and at CNet."

74 of 434 comments (clear)

  1. fantastic by abrotman · · Score: 3, Interesting

    My google-fu isn't bad, but I sometimes have trouble finding relevant results. I figure adding 9 billion more possible results should complicate things quite nicely.

    1. Re:fantastic by ciroknight · · Score: 4, Insightful

      I always wonder about that. How many of those billions of additions to the engine pages that retroactively generate pages according to what is searched for?

      I *hate* those pages the most, as they usually have every word in mankind listed in six or more languages, and just so happen to grab the one you're looking for just to suck you in to their million popups.

      I guess quality verses quantity will be an afterthought; we're about to see quite the cache expansion if my gut feeling is right.

      --
      "Victory means exit strategy, and it's important for the President to explain to us what the exit strategy is." G.W.Bush
    2. Re:fantastic by fembots · · Score: 4, Informative

      While 9 billion additional pages are pretty useless to an individual, it can however mean each topic will have an additional 30 pages, or a search on Ferrari images gives another 25 pictures.

    3. Re:fantastic by b0r1s · · Score: 4, Informative

      Google's index should be growing faster in the coming months. With more and more webmasters implementing Google's sitemap helpers, a lot of unlinked/dynamic pages should start showing up very, very soon.

      --
      Mooniacs for iOS and Android
    4. Re:fantastic by timeOday · · Score: 3, Insightful
      I figure adding 9 billion more possible results should complicate things quite nicely.
      How do you figure? Do you find it harder to find restaurants in large cities?
    5. Re:fantastic by HD+Webdev · · Score: 4, Funny

      How do you figure? Do you find it harder to find restaurants in large cities?

      Only if most of those restaurants in large cities give you a menu that only lists Viagra as something you can order.

      --
      This is not a dream, not a dream...we are transmitting from the year 1-9-9-9.
    6. Re:fantastic by timeOday · · Score: 2, Insightful

      How do you know the 9BN pages google's not indexing are not worth indexing? How would google know? And if they did know those pages were no good, how would indexing them pose a risk of obscuring the better pages?

    7. Re:fantastic by xs650 · · Score: 5, Insightful

      If over 1/2 the restaurants in big cities were fake restaurants built to look like the restaurant you were looking for, yes it would be.

    8. Re:fantastic by natrius · · Score: 4, Insightful

      You, sir, win the award for worst analogy ever. Restaurants only stay in business if enough people patronize them to make the restaurant worth running. Web pages, on the other hand, are almost, if not totally free to toss up. Some things are crap, some things are gold, but I think the crap to gold ratio goes way up as the number of pages increases. The crap that goes up on the internet stays up, the crappy restaurants don't. Google's PageRank is supposed to filter out things that no one else thinks is worthy of linking to, which can eliminate much of the problems caused by a high crap to gold ratio, but the gradparent's statement that adding many more web pages may harm results is a perfectly plausible assertion.

    9. Re:fantastic by Finuvir · · Score: 2, Insightful

      Dude! What the fuck is a popup?

      --
      Why is anything anything?
    10. Re:fantastic by baadger · · Score: 2, Interesting

      Results 1 - 20 of about 824,000 for ferrari. (0.08 seconds)

      It varies. When I view it through Coral cache I get 461,000

    11. Re:fantastic by Steve+Franklin · · Score: 2, Funny

      Whoosh yourself. I got it. You didn't get that I was extending your joke to include a jab at the guys who create the popups. You are not as smart as you think you are, Sonny.

      --
      Hic iacet Arthurus, rex quondam rexque futurus.
    12. Re:fantastic by jtjin · · Score: 2, Funny

      It's Google updating their public database today from their secret cache of pages that nobody knows about except for me. THERE'S A SECOND INTERNE****NO CARRIER----

      --
      No rest for the livid.
  2. Interesting by Rupy · · Score: 2, Interesting

    Provided it is correct... I don't suppose there is any third party organisation that was allowed limited access to the data to confirm it?

    1. Re:Interesting by Rosco+P.+Coltrane · · Score: 2, Insightful

      Just go there and see for yourself: a search on the word "a" (letter "a", whatever) yield 11.5bn results. If you admit there may be twice as many pages without "a" in it (say, all non-latin webpages, files, jpgs and such), that's pretty close to their 20bn entries.

      Of course, now if you still doubt, you're welcome to count all 11.5bn results and make sure none of them are dupes :-)

      --
      "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    2. Re:Interesting by matt21811 · · Score: 2, Funny

      "Of course, now if you still doubt, you're welcome to count all 11.5bn results and make sure none of them are dupes :-)"

      Thats easy to do, just submit all 11.5bn pages as stories on slashdot and the dups will automatically appear on the front page!

  3. Yahoo! playing Tortoise to Google's Hare by Ohmster · · Score: 4, Insightful

    It's interesting to see that Yahoo! may have surpassed Google on this metric. Over the past decade, Yahoo! has beaten other "hares" to date, including AOL and Microsoft's MSN. They're doing some innovative stuff, but also have some areas to catch up on. More here: http://mp.blogs.com/mp/2005/08/on_the_merits_o.htm l

    1. Re:Yahoo! playing Tortoise to Google's Hare by cybersaga · · Score: 4, Funny

      but also have some areas to catch up on

      Like how to park?

  4. Great by Rosco+P.+Coltrane · · Score: 5, Insightful

    Now all Yahoo has to do is create a real search engine that can actually spew out relevant results amongst those 20 billion entries...

    --
    "A door is what a dog is perpetually on the wrong side of" - Ogden Nash
    1. Re:Great by jalefkowit · · Score: 3, Interesting

      Have you actually tried Yahoo lately?

      I've been finding that Yahoo's engine is as good at returning relevant results as Google, at least for my searches. In fact, in some cases it is even better at putting the most relevant hit in the first position than Google is.

      Of course, YMMV. But if you're still going off impressions of Yahoo Search you formed back in 1997, you might want to give their new-and-improved engine a spin sometime...

  5. Great... by Lewisham · · Score: 5, Insightful

    ...now it'll be even harder to find anything on Yahoo! Google keeps and holds its users because searches *work*. When I search for something, Google has a very high chance of giving me what I want in 4 pages or so. Yahoo! isn't as good at getting me the information I want. The problem might even be made *worse* with all these pages. Yahoo! has never said, AFAIK, how it ranks pages, but Google does it better. With this wealth of data, the ranking system is going to be under much more scrutiny at picking the right pages.

    1. Re:Great... by donutello · · Score: 3, Informative

      Google keeps and holds its users because searches *work*.

      You must not have used Google recently. It's been about 2 years since Google stopped returning useful results. Now, most of the results are crap. Unfortunately, there isn't a better search engine out there.

      --
      Mmmm.. Donuts
    2. Re:Great... by MacJedi · · Score: 3, Interesting
      It's been about 2 years since Google stopped returning useful results. Now, most of the results are crap.
      This has not, in general, been my experience. Can you give an example where a reasonable search yields results that are mostly "crap?"
      --
      2^5
    3. Re:Great... by bedroll · · Score: 5, Funny
    4. Re:Great... by Lewisham · · Score: 2, Insightful

      I don't have any proof, it's all subjective. YMMV. Adding more potentional hits, however, is not guaranteed to make things better. It might makes things worse.

      Massive simplification: If you have a dataset that's twice as large, you're going to get twice as many pages that might be right. The searching mechanism will be under more difficulty trying to determine which is the best one to return and in what order it should be. This is going to really work the ranking algorithm, if its not up to snuff, its going to return you pages in the wrong order of helpfulness, and the page that might have been exactly what you wanted and returned on the first page under the smaller dataset might end up bumped onto page 4.

      Google, everyone knows, uses a PageRank algorithm, so other pages do the picking for it. As the dataset gets bigger, results could actually improve, as more sites add to the reputation of others. As I said, I don't know if Yahoo! has ever published how it picks pages. The larger set could make things worse, it could makes things better. But because I have been finding Google better at ranking results than Yahoo! so far, I'm erring on the side of caution.

    5. Re:Great... by AEton · · Score: 3, Interesting

      Searching for information on digital cameras, especially less popular ones, often yields dozens of pages of sites that want to sell one and no descriptions of user experiences or reviews. (Even the manufacturers' own product description is usually hard to find.)

      This problem is not specific to digital cameras - it's endemic to any piece of hardware that the majority of the Internet is interested in selling rather than discussing. It's great that your Yahoo! Storefront and its twenty clones want to give me a great deal on the Flibbet Jibbet Cog, but I'd really like to know what people who use it think about it.

      (Occasionally some results with "flibbetyjibbit linux compatibility" will work - but never general product information!)

      --
      We recently had heard in the office over one of the Yellow Machine that's made by Anthology Solutions.
    6. Re:Great... by skiflyer · · Score: 2, Interesting

      I wouldn't call it useless... but certainlly not as useful as it used to be for me... this weekend I needed to learn how to countersink a screw.

      So I searched "how to countersink a screw"... and first a handful of links selling bits used in countersinking, then a page or two of links for how to projects which required the countersinking of a screw... then a few links about what a countersunk screw is... then I said screw it, this doesn't look so hard and I just did it... never found the link, but I'm sure it's there somewhere after page 3 where I gave up.

    7. Re:Great... by mph · · Score: 4, Informative

      Adding "review" usually results in storefronts that say "Be the first to review this product!".

    8. Re:Great... by rm999 · · Score: 2, Interesting

      Many products that are electronical or computer related return terrible results. I either get a bunch of technical pages or commercial pages. For example, the motherboard on a computer I was fixing was malfunctioning. I was looking for a manual. The manufacturer's site was not on the first site. Instead, I found several pages where I could buy the product.

      I think an option in google that excludes sites with large data tables (technical sites) and prices (froogle sites) would be great. It would get me to pure information a lot quicker.

    9. Re:Great... by poofyhairguy82 · · Score: 3, Insightful
      Searching for information on digital cameras, especially less popular ones, often yields dozens of pages of sites that want to sell one and no descriptions of user experiences or reviews.

      Simple answer, put this in Google (or whatever):

      cameramodel review -checkout -buy -shipping

      Then get what you want. The engines can't read your mind. They don't know if you want to buy one, see a review for one, get a hack for one, sell one, etc.

  6. Googlebot is not very aggressive on internal links by Anonymous Coward · · Score: 5, Informative
    We recently launched a mobile search engine. The domain was registered, pages created, etc, so I'm observing it go from zero page rank, to having a page rank and getting crawled. Yahoo's bot definitely crawls more frequently, and Googlebot doesn't seem to crawl any links unless they are linked to from external pages. I assume that as the pagerank increases, Googlebot will get more aggressive, but from what I can see in the logs it's clear that Googlebot takes a "wait and see" approach to crawling.

    That's not a bad thing. There are a lot of useless pages out there, and having twice as many pages in the index certainly does not mean twice as many useful pages.

    I am glad to see the search engine wars are on and competitive.

  7. More important by Chairboy · · Score: 5, Insightful
    A newsflash that's more important to me is how, years ago, Google passed Yahoo's abillity to display relevant results.

    Why isn't programmer efficiency measured in KLOCs? Because quality is more important than quantity when used as the only metric.

  8. Quantity versus quality by IamGarageGuy+2 · · Score: 5, Insightful

    I don't believe that volume of pages is really a relevant metric to be used in the case of search results. With an infinite number of pages the real metric comes down to relevance.

    --
    Stay tuned for new sig...
  9. I've noticed that recently by Eightyford · · Score: 2, Interesting

    I've found that yahoo! slurp is almost always my most frequent visitor to my websites.

  10. Hey Yahoo by Spackler · · Score: 2, Informative

    It's not the size of the boat...
    it's the motion of the ocean.


  11. 20 billion documents, I wonder... by baylanger · · Score: 5, Funny

    Are those 20 billion documents, the actual SPAMs I received at my yahoo mail account since 1994?

  12. 95% of which is crap by darkCanuck · · Score: 5, Interesting
    • useless blogs and geocities "websites": 12 billion
    • clipart, midi and hideous backgrounds for above websites: 6 billion
    • links to outdated or expired user sessions: 1 billion
    • real content: 1 billion, if lucky
    The only thing I ever use Yahoo for is if and when my internet connection seems slow or dead I ping yahoo.com. It's just been a habit since the 90's.
    1. Re:95% of which is crap by danila · · Score: 2, Informative

      This blog post (and especially the comments) discusses pinging yahoo.com, the switch to pinging google and what else do other people ping.

      Incidentally, this is the 2nd result when searching for "ping yahoo" on Yahoo! and only the 9th result when searching on Google (the first 8 are much less relevant).

      This is typical example of real-life "ping yahoo.com to check if you're online" suggestion.

      P.S. And personally I do ping yahoo.com. The are the Internet and compared to them Google is insignificant. :)

      --
      Future Wiki -- If you don't think about the future, you cannot have one.
  13. Re:Why Google ain't all that by HeroreV · · Score: 3, Informative

    I agree about Froogle. Usually over 90% of all items can't be ordered by price even though the engine was clearly able to determine what the price was. How is it being froogle if you can't easily figure out which is the cheapest?

  14. Re:Googlebot is not very aggressive on internal li by Eric+Giguere · · Score: 4, Informative

    The Yahoo! crawler (Slurp) is definitely more aggressive than the Googlebot. It comes knocking on my door several times a day, especially the blog pages. Google is more conservative and keeps things in a sandbox, too.

  15. Google needs to become mature like Yahoo by ShatteredDream · · Score: 2, Insightful

    If Google wants to survive in the long run, they will need to stop playing favorites based on political ideology. They give, IMO, too much lee way for their adsense and google news people to restrict access. One blogger I know of was rejected as a "racist" because she questioned whether Nelson Mandela really should be called a hero. The irony of it is that my blog is far more politically incorrect than hers and AdSense for some reason accepted me. I wrote a letter to Google about the behavior of their AdSense policies and News development team, but they did the customary Google response which was "we don't care."

    The thing that Google needs to wake up and realize is that they have been caught doing genuinely evil things like letting Hamas use AdSense to promote their recruitment and training centers, and Yahoo has survived enough big companies attacking them to make them a longterm threat. The real war is between Google and Yahoo, not Google and MSN, and Yahoo understands clearly how being apolitical is necessary to really become a hub for finding and accessing data online.

    Don't be surprised if in a few more years of broadband development, that Yahoo is able to position itself as an alternative to many cable TV providers. Expect them to start providing premium content alone or in conjunction with Apple. If that happens, Google is actually going to be screwed because the market for that sort of media is huge and the amount of money that Yahoo will have will dwarf Google. Sooner rather than later, Google's stock price will crash down to maybe $20-$30 a share unless they really do some death-defyingly radical things every so often over the next several years that the market likes. In fact, I'd wager that if Yahoo can get deep into providing on-demand TV services, that in five years they'll be able to buy Google in cash unless Google really does become the "Microsoft of search services."

    1. Re:Google needs to become mature like Yahoo by Anonymous Coward · · Score: 3, Funny

      In other news, some guy wrote a Slashdot post and linked to his own blog. Woah, shocker. 9_9

  16. Several interpretations by Alomex · · Score: 2, Interesting

    I read this in one of two ways:

    a) Yahoo crawler is not as discriminating as google, collecting loads of garbage and mirrored sites

    or

    b) Google is finally falling behind the Web. In the past every snazy search engine eventually got overwhelmed by web growth and fell beihnd. Has that time arrived for Google?

    On a different note I've heard a rumor that Google's total CPU count across all its server sites is fast approaching a million. If this is true, talk about barriers to entry! Anyone out there who can confirm or deny this?

    1. Re:Several interpretations by Alomex · · Score: 2, Informative

      Indeed. It is interesting to note that the new MSN engine crawling like mad had a hard time matching Google's count, much less surpasssing it. Out of the blue comes laggard Yahoo with a much larger count. Pardon me, but I'm somewhat skeptical.

      Say using the old technique of searching for typos I just tried Yahoo and Google. Yahoo reports five matches versus Google's five. However out of the five Yahoo matches three of them are spurious!

      Some other searches with their actual count:

      Yahoo 1, Google 1.
      Yahoo 0, Google 1.
      Yahoo 1, Google 5.
      Yahoo 26, Google 36.

      This reminds me of an old Altavista crawl, where they discovered that nearly 10% of their pages where non-standard 401 pages.

  17. It's true, and easy to check... by NotQuiteReal · · Score: 4, Interesting
    I did a search for "a" on both Google and Yahoo.

    Results:

    Google: "1-10 of about 3,120,000,000 .06 sec"
    Yahoo: "1-10 of about 11,300,000,000 .08 sec"

    Top yahoo hit - some punk band. Top Google hit, apple .com.

    Gee, who do you think will make more money with those results... ;-)

    --
    This issue is a bit more complicated than you think.
    1. Re:It's true, and easy to check... by aaarrrgggh · · Score: 2, Insightful

      People being willing to buy and people wanting to sell are different incentives. Google used to be great for researching a topic. Now if I want to buy (say) a solar cell, it is very hard to find anything but storefronts.

      The value originally afforded by the web was the fact that I could find things out about different technologies, efficiencies, lifespans, etc, prior to making the purchase. This gave me an advantage over traditional information gathering techniques.

      If in the "real world" I want to buy a few hundred solar cells, I have to talk to a manufacturer's rep, where I am unlikely to find any unbiased information. The "real world" equivalent of some of these search results are trade magazines in highly specialized areas. Most of the ads are for unknown companies or companies trying to push into a new market.

      For Google and Yahoo to keep ad revenue up, they are going to need to make sure people continue to get helpful information segregated from empty shells.

    2. Re:It's true, and easy to check... by DavidD_CA · · Score: 2, Interesting

      Does this mean that the other 9,000,000,000 pages on the internet don't have the letter 'a' in them?

      I tried searching for "a OR e OR i OR o OR u OR y" but actually got about 1/3 less results on each. How does that happen?

      --
      -David
  18. Sorry for the "self reply"... by NotQuiteReal · · Score: 2, Funny
    But I find the 11.3 Billion hits for Yahoo an odd coincidence, given that the /. story mentions 11.3 billion for Google ...

    Maybe the editors need to check on something, or we all ought to count 11.3 billion as the "new 42".

    --
    This issue is a bit more complicated than you think.
  19. Big Increase - Simple Explanation by ndansmith · · Score: 5, Funny

    The increase can be explained by Yahoo adding Slashdot dupes to their index.

  20. My own - albeit anecdotal - experience... by mosel-saar-ruwer · · Score: 5, Interesting

    I've spent the last few days doing some very important searching - we're thinking about launching a new product in a rather arcane field, and I wanted to be absolutely certain who the potential competition might be - hence I decided to search both Google & Yahoo!.

    Guess what? Yahoo! search beats Google search, hands down. Not even close.

    Two thoughts:

    1) While everybody was oohing and ahhing about Google's IPO, Yahoo! very quietly went about purchasing some excellent search engine/caching outfits, like Inktomi and AllTheWeb, and, owing to the great dot-com bust, paid only pennies on the dollar in acquiring some outstanding talent and intellectual property.

    2) I think Google's been reading too many of their own press releases, and has been resting on their laurels for a few years now. And it doesn't help matters that their CEO, Eric Schmidt, is the same fella who damn near drove Novell to bankruptcy.

    1. Re:My own - albeit anecdotal - experience... by coflow · · Score: 5, Insightful

      I do think this is interesting to note, but I have to ask you as a business man, what matters more to you, the quality of the search or the number of people using the search engine. From anecdotal evidence, I can tell you that I maybe know of 3 or 4 people who use yahoo to search, and pretty much everybody else uses google or has firefox search toolbar set to google.

      I can make a better hamburger than McDonald's can, but you're probably better off investing in them than you are in me.

    2. Re:My own - albeit anecdotal - experience... by Sancho · · Score: 4, Informative

      Multiple search engines are probably the way to go, honestly, but here's some counter-anecdotal evidence.

      Search for:
      super mario world hacks

      on each of Yahoo and Google, and check the first hit. Google takes it hands down, with an entire page devoted to SMW hacks, vs. Yahoo's page on SNES hacks.

      I routinely try other search engines, and while another one occasionally trumps Google, the big G tends to come out on top overall.

    3. Re:My own - albeit anecdotal - experience... by iceanfire · · Score: 2, Informative

      "Especially since they have the minimalist interface which doesn't suck." Last time I checked, Google used that interface... infact Yahoo copied it from them.

    4. Re:My own - albeit anecdotal - experience... by xtracto · · Score: 2, Interesting

      Just FYI, here is a nice comparison of your search query between the two engines.

      --
      Ubuntu is an African word meaning 'I can't configure Debian'
    5. Re:My own - albeit anecdotal - experience... by Darkman,+Walkin+Dude · · Score: 3, Informative

      I can only agree here. A couple of interesting points, yahoo will index your website whether or not any site in the world is pointing a link to it, and yahoo actually pays attention to the the meta tags at the top. Now while I'll be the first one to observe that meta tags have been abused horribly, in a lot of cases they do in fact represent the content of the site well. Its no more of a risk than any of the other criteria used to index websites, really. The quality of google's search and image search has declined quite a bit in the last few months, the question is whether or not they recognise that.

  21. Re:Why Google ain't all that by daniel_mcl · · Score: 3, Insightful

    Nonsense.

    Search: Google's Pagerank concept radically changed the way that search engines determined which results were relevant. While previous services were based on human rankings or on how many times a particular word was listed on the page, Google put out an automated system which was able to deliver more relevant results when confronted with normal sites and, by its very design, much harder to exploit with SEO techniques. Further, Google continually tweaks the parameters of their search -- if you can go to one of Norvig's talks about the sorts of stuff they do, it's amazing.

    Maps: That interface -- scrolling, markers, and all -- is done entirely in javascript. No plugins, no flash, no helpers. Nobody thought that that sort of thing was even possible.

    GMail: I don't use it, so I can't comment. But I do have around 1 GB of email on my primary account. When you use email for serious work, it can add up.

    Google Groups: It's my group reader. I like it because it shows the discussions in thread format from the top and supresses the quoting that can make USENET discussions turn into pages and pages of greater-than symbols.

    As to your assertion that Google hasn't ushered in a new age, I disagree. Ten years ago, when someone wanted information they went to a library, an encyclopedia, or maybe a CD-ROM. Now, any time anyone wants to know anything, they go immediately to Google and chances are that the information will come up on the first page.

    Lest you've forgotten, it was Napster and Winamp that 0popularized mp3's, not the iPod, and COBOL, not Oracle, that popularized the database. So I'd respond to you, "Stop the misinformation campaign."

    --
    I used to read Caltizzle. I was a lot cooler than you.
  22. In Soviet Russia... by Anonymous Coward · · Score: 2, Funny
    from google:
    Results 1 - 10 of about 3,930 for "In Soviet Russia" slashdot.org

    from Yahoo!:
    Results 1 - 10 of about 11,300 for "In Soviet Russia" slashdot.org - 0.38 sec.

     

    Looks convincing to me, comrades!

  23. If anyone can do it... by brunes69 · · Score: 4, Insightful

    ... how come no one is?

    Where else can I find the likes of Y! Calender / Mail / Address book, all integrated, for free? Point me there and I might jump ship.

    GMail is great for email, but it's address book is a POS, and there is no calendering whatsoever. Meanwhile, over at Y!, I have a calender that not only shows me the weather forecast for the week embedded into it, but it also issues me reminder notices via Y! IM for important dates.

    Not to mention the vast usefulness of other Y! services like Launch! and Y! Photos.

    Google may be leading the way as far as search, maps, and email goes, but for other services, *they* are the ones playing catch-up. For example, see their "Customized" home page, which http://my.yahoo.com/ had beat about 3 years ago.

    1. Re:If anyone can do it... by rpdillon · · Score: 2, Informative

      Try MyWay. They're ad-free too. I've been using them for years. They don't have some of the newer stuff, but they have a notepad, calendar, address book, news, email, search, games, etc.

    2. Re:If anyone can do it... by AoT · · Score: 2, Funny

      That is funny, I just read it as "why not?"

  24. Frigidaire effect by fishbowl · · Score: 2, Insightful

    Even when you use "Yahoo" to search for something,
    you're still googling it. Just like xeroxing on a Canon, or putting food in the frigidaire (even if it's a Kelvinator.)

    Google has this kind of brand identity, for good or for worse. This is a status that both Napster and Tivo almost acheived, but fizzled just in time to escape the phenomenon.

    --
    -fb Everything not expressly forbidden is now mandatory.
  25. Google vs. Yahoo crawl frequency by flinxmeister · · Score: 3, Interesting

    I've noticed that Yahoo's crawl visits my site more frequently...but Google's crawl seems to be intelligent about how often it crawls.

    If I update alot, google crawls more. Yahoo doesn't seem to care.

    So all these folks talking about yahoo being better may be off the mark. Why crawl all the time when you can only crawl when necessary?

  26. Re:And in other news... by Alcimedes · · Score: 2, Insightful

    However, the article said nothing about Yahoo becoming better, just Bigger. There's a world of difference between the two.

  27. What on earth? by mcc · · Score: 4, Insightful

    So.. Yahoo is mature and Google is not because Google's news service reprints many and varied websites-- but not some of the "blogs" you like-- and Yahoo's news service reprints Reuters? I'm not entirely sure what's going on here but it sounds like you are misinterpreting some kind of personal poor experience with Google's sales department as an actual problem.

    Google and Yahoo news do not even offer remotely the same kind of service, nor are the services equal in importance. Yahoo News is almost closer to the core of Yahoo's service than even the search; Google News is more auxiliary from Google's perspective, and I don't think they're even getting much money off of them.

    Anyway, frankly IMO "blogs" shouldn't be on google news anyway. Period. If I wanted a blog aggregator, I'd go to a blog aggregator. Google News is a news aggregator. The difference may mostly be only in terms of what the aggregated sites choose to identify themselves as, but that's enough of a difference for me.

    As for AdSense, the categories based on which things can get classified as inappropriate for AdSense are extremely broad and if you're expecting close attention paid to border cases, I think you're expecting things of the service that the service never intended. And if the person your complaint here concerns is Michelle Malkin...? Well, from what I've read of her stuff, if you're trying to defend her against accusations of racism then some article about Nelson Mandela would be only the tiniest part of the problem.

    Don't be surprised if in a few more years of broadband development, that Yahoo is able to position itself as an alternative to many cable TV providers.

    Wait, wasn't this exact same prediction being batted around, like, five to seven years ago? And didn't it fail to work out then either? Hm, you are a blogger, aren't you.

  28. I've got Results as to why I prefer Google: by Ralph+Spoilsport · · Score: 5, Interesting
    OK: I did a brain fart search on both engines. The word? Kyzyl. It's the capital of Tuva. Tuva is an obscure little suburb of Mongolia. Yep. When you think your stupid relatives who bought a place in Indiana live in the middle of Nowhere, you're wrong. Tuva Is The Middle Of Nowhere.

    So, In Firefox tab A, I have Google and tab B is Yahoo. Both searched on Kyzyl.

    Results (pleae pay attention because htmling this was a pain...):

    Yahoo's first 5 entries:

    * All Russia Hotels All Russian Hotels - We offer discount hotel reservation services online in Moscow, St. Petersburg, Kiev, Russia, Ukraine, CIS and Baltic. www.allrussiahotels.com

    * Tuva Travel Kyzyl city is the capital of Tuva Republic (Russia) Kyzyl city is positioned right in the center of Asia, which is proudly claimed by a local monument specifically dedicated to this fact. www.sokoltours.com

    WEB RESULTS

    1. Wikipedia: Kyzyl
    Open this result in new window
    Wikipedia Free Encyclopedia's article on 'Kyzyl' en.wikipedia.org/wiki/Kyzyl

    - More from this site - Save - Block

    2. Weather Underground: Kyzyl, Russia Forecast
    Open this result in new window Find the Weather for any City, State or ZIP Code, or Airport Code or Country. Email. Password. Maps. United States. International. Information. Refinance Rates. GoTo Meeting. Kyzyl Singles. Hosting Companies. Online deals! Vitamins. Internet Mall ... Updated: 8:00 AM KRAST on August 02, 2005. Observed at Kyzyl, Russia (History) Elevation: 2064 ft / 629 m ... Coming soon: Flash Stickers. Kyzyl, 63 F / 17 C ...
    www.wunderground.com/global/stations/36096.html
    - 64k - Cached - More from this site - Save - Block

    3. AllRefer.com - Kyzyl (CIS And Baltic Political Geography) - Encyclopedia
    Open this result in new window

    3. AllRefer.com reference and encyclopedia resource provides complete information on Kyzyl, CIS And Baltic Political Geography. Includes related research links. ... By Alphabet : Encyclopedia A-Z - K. Kyzyl, CIS And Baltic Political Geography ... Kyzyl or Kizil[both: kizil'] Pronunciation Key, city (1989 pop ...
    reference.allrefer.com/encyclopedia/K/Kyzyl
    More from this site - Save - Block

    Now, for the first five Google Results on Kyzyl:

    Kyzyl'-administrative center of Republic of Tuva, Russia Kyzyl' Republic of Tuva,
    |Central-Chernozemny| ... Republic Capital:, Kyzyl. Capital Population:, 91000( at 01/01/94) ...
    members.tripod.com/~argun/kyzyl.htm
    - 5k - Cached - Similar pages

    Kyzyl on Encyclopedia.com
    Kyzyl or Kizilboth: kzl, city (1989 pop. 85000), capital of Tuva Republic, S Siberian Russia, on the Yenisei River. It services motor transport and has ...
    www.encyclopedia.com/html/K/Kyzyl.asp
    - 47k - Cached - Similar pages

    Kyzyl Travel Information. Photos, Stories and Diaries about Kyzyl
    Sustainable Tourism for independent travellers (travelers) and backpackers. www.worldsurface.com/browse/location.asp?locationi d=5654
    - 59k - Cached - Similar pages

    Kyzyl, Tuva, Russia current local time
    Kyzyl, Tuva, Russia - before placing a telephone call or making travel plans for a flight or hotel, get the current local time provided by ...
    www.worldtimeserver.com/current_time_in_RU-TY.aspx ?city=Kyzyl
    - 17k - C

    --
    Shoes for Industry. Shoes for the Dead.
    1. Re:I've got Results as to why I prefer Google: by good-n-nappy · · Score: 2, Insightful

      How about a search for mortgage on Google. Hmmm, this looks familiar. The two top results seem to be sponsored links instead of real results. Does "this [infer] that commerce puts people above the law"" on Google?

      --
      Never underestimate the power of fiber.
  29. Re:And in other news... by sound+vision · · Score: 2, Insightful

    lolololol pwned He's right, you know. The Anonymous. And the moderation system is useless, it pushes a mental monoculture and dissenting opinions are modded as trolls or flamebait. I have both set to add +5 points instead of subtract to combat this. I have to sift through some crap posts, but if that's what it costs, so be it.

  30. The Search Engine Size Game by vicaya · · Score: 3, Informative

    For popular search terms (queries with millions of hits) index size doesn't matter much. Yahoo, google, ask, msn etc all produce pretty similar results (that tend to favor established sites/pages.) For rare terms or combinations, which contribute to the Long Tail of web search, index size is very important. Both Yahoo and Google report estimated (often inflated) hits for popular terms and exact numbers for rare terms, which still include dups. You need to go to the last result page to find out the exact non-dup number, which sometimes can shrink the de-dup'ed hits by a factor of 10. Let's see how the new yahoo fairs against google with a few queries I picked randomly:

    • "Acid Brass" stockport - yahoo:20 google:24
    • "anetan district" - yahoo:17 google:15
    • "chunder blunder" - yahoo:25 google:27
    • "information theoretical death" - yahoo:45 google:46
    • kliningan juru - yahoo:27 google:47
    • "phylogenetic organisms" - yahoo:5 google:10
    • zibelthiurdos thrace - yahoo:9 google:4

    Yahoo used to consistently underperform google on rare terms, it seems they indeed have caught up. But it has NOT really exceeded google in terms of useful size (Yahoo has more dups.) Still, it's a worthy engineering effort. Congrats!

  31. Let me explain by Moraelin · · Score: 4, Insightful

    The problem is the difference between raw data and useful information.

    When you look through a list of restaurants (or the list of anything in the yellow pages), you're looking at something put together based on _semantics_. Some human put that list together and made sure the _meaning_ is what you'd expect there: you can actually drive to one of those locations and order food.

    Search engines, on the other hand, just look at the words and have no bloody clue of semantics.

    If someone ever put together a list of restaurants, it would just be a list of all people who ever said the word "restaurant". Including everyone who ever said "I hate chinese restaurants" or "I took my gf to a restaurant" or "I went to see a new apartment, but it was above a restaurant" or whatever. Needless to say, driving to most of those locations would be a bloody useless exercise.

    Adding another 20 million people to that kind of indexing would just raise the noise-to-signal ratio, not actually produce anything useful.

    --
    A polar bear is a cartesian bear after a coordinate transform.
  32. Tuva != middle of nowhere by TeknoHog · · Score: 2, Informative
    AFAIK, Tuva is part of Russia, not Mongolia.

    Besides, it's hardly the middle of nowhere, as it has become famous for its traditional throat singing. One of the people who made it famous was Richard Feynman; I first learned of Tuva as I was searching for stuff on Feynman. It shouldn't be news to any fan of Feynman that he was into obscure music.

    If you're looking for less well known parts of the world, you might have a look at the other 'autonomous republics' within Russia, such as Komi or Mari.

    --
    Escher was the first MC and Giger invented the HR department.
  33. I am not surprised. by mrjb · · Score: 3, Informative

    Google refuses to index pages that aren't linked to by at least a gazillion other sites, submitted or not.
    My site, for example, has been up and running for nearly two months, submitted a few times and actually linked to by a few pages that are indexed by Google but it still doesn't appear *at all* in Googles index, not even far in the bottom.

    Even if you enter site:www.....com in the search bar directly, it just says it doesn't know it. At least Yahoo has got it in there, never mind high ranked or not.

    --
    Visit http://ringbreak.dnd.utwente.nl/~mrjb/growingbettersoftware to download your free copy of the book
  34. Re:Google's sitemap helpers by SebastianX · · Score: 2, Insightful

    The number of 8 billion searchable pages on Google's home page wasn't touched for a long time. Usually they do an update when another engine claims to have a bigger index. Also, this number does not include images etc., Yahoo's number does. I agree that Google's sitemap helpers will dig out a lot of stuff from the hidden Web. Most probably Google's index contains way more than 8 billion pages, perhaps even more than 20 billion objects.

    --
    http://sebastianx.blogspot.com/
  35. A faint voice by evilviper · · Score: 2, Funny

    Just then, a faint voice could be heard comming from Yahoo's headquarters...

    "I'm not dead yet..."

    --
    Slashdot gets worse every day... Pipedot: News for nerds, without the corporate slant