Slashdot Mirror


The Un-Google - The Search Competition

WinEveryGame writes "The Economist is running an article on the state of the competition for Internet Search. While Google clearly dominates, and continues to have positive momentum, its leadership is still vulnerable. The search-engine battle is not over yet." From the article: "In terms of momentum — mass times velocity — Google's lead indeed looks daunting. It has by far the most mass, with an American market share of 43% as of April, which reaches 50% counting AOL, an internet property that uses Google's search technology. This compares with 28% for Yahoo!; 13% for MSN, which belongs to Microsoft; and 6% for Ask, which is owned by IAC/Interactive Corp, a conglomerate of about 60 online media brands. Google also has velocity: its market share grew by 17% in the four quarters to this spring, whereas Yahoo! and MSN both lost share. Only Ask has more velocity — its share grew by 35% — but then again it has little mass."

36 of 141 comments (clear)

  1. The Many New Possible Fronts by eldavojohn · · Score: 5, Interesting
    The search-engine battle is not over yet.
    Of course it isn't.

    There are some customers (government/military included) that are aware of the two concepts of precision and recall. Before you groan and skip this post because you recall those words from all classifying algorithms, you should take note that there are two stages we have yet to meet in this respect.

    One is simply improving precision without sacrificing recall. When I search for 'horn' in Google, how many of those searches are relevant? I was thinking about a French horn (instrument) and the first link brings me to a society about them. The next three links, however, do not. You might say, "Well, gee, you should have put 'French' in your search" but is this really necessary? So there is some money to be made in "learning" search engines that tailor themselves to the user or perhaps the results could be displayed intuitively in domains of knowledge (a la Clusty). So that I can select a node that applies to the correct searching term and see all results returned below that. Have you ever wished to view your search results in a format other than a linear display of ranked results? The documents are related in more than one dimension, you know. As computing power increases, I suspect there will be room to display them in two dimensions (heat/area mapping, nodes & vertices on a plane) and three dimensions (spatial 3D engines with nodes & vertices in space).

    The second stage is giving the user the power to adjust precision versus recall. Even a graphical interface that shows the F-measure relationship between precision and recall would be helpful to consider in the search engine wars. Say you give the user some control through a slider AJAX interface of a threshold ß. But the threshold isn't simply the "Google score cut off" or even a term frequency cutoff. Instead, it's applied to be a "relevance" threshold. You would score relevance by fingerprinting frequency, specificity, clustering and other useful tools by using a domain ontology or taxonomy.

    Another big thing that is missing is identifying what kind of data you are searching. Social data? Scientific data? Historical data? etc. Perhaps I'm only interested in who's who to Stephen Hawking. I'd search for him and flip through nodes of separation from him to other people.

    The current search sites also only tend to favor key-word regular expressions. What about searching with raw text or entire paragraphs? If you want to see an interesting demo of this, visit Collexis' Demo Site which alludes to a whole new kind of searching.

    The key to entering the market as a competitor with Google is to pick up Google's slack and to try to pose yourself as a complimentary service to Google. Google is terrible at closed domain searches but amazingly efficient at open domain searches. You don't want to compete with them so fill a different part of the market. Google benefits from simple design, so go to an advanced flashy complex design. Most people aren't looking for that but the people that are have nowhere to go.

    The Economist is alluding to potential leadership problems inside Google. Who cares? That's not going to be Google's downfall. Google's downfall will be an new intuitive way to search and the only thing that will prevent their downfall is if they buyout the company or bone up on the technology.

    The search-engine battle hasn't even hit its stride.
    --
    My work here is dung.
    1. Re:The Many New Possible Fronts by Alex+P+Keaton+in+da · · Score: 5, Funny

      Googling "Search Engine" returns 2,190,000,000 results.
      The top 5 results?
      1.Dogpile Web Search Home Page
      2.Search Engine Watch: Tips About Internet Search Engines & Search ...
      3.Lycos
      4.MetaCrawler Web Search Home Page - MetaCrawler
      5.Mamma Metasearch search engines on the Internet.
      Google is 8th!!!!
      Granted DogPile and Mamma include Google results, but it appears Google is no longer relevant. In today's world, you are judged by your Google ranking. Google is 8th when you google Search Engines. They are a has been....

      --
      And All I Ask is a Tall Ship And a Star to Steer Her By
    2. Re:The Many New Possible Fronts by kiwimate · · Score: 2, Insightful

      Very good post, my congratulations.

      Yep. These are some of the same problems that present themselves in the document management / knowledge management / ECM market. What I want is a taxonomy. That way I can drill down when I search for horn and see the results categorized appropriately.

      But how do you present a taxonomy? It's not easy, and you really need to know the context. Example: what is a generator? Enter that word into Google and the first page is entirely links for products to generate ASCII art or banners for your church. Interestingly, the entire list of links consists of electrical generators.

      What I'm thinking about, however, has many different contexts in the electrical industry. If I'm an engineer, then a generator is something that produces electricty. If I'm in customer service, however, a generator is a company that generates power and connects to the interconnection grid.

      This also presents a problem when discussing searching on raw text or paragraphs. Again, you need to understand the context of the industry that you're working inside. You need a combination of good Natural Language Processing (NLP) and something like an expert system for your particular industry. This could be particularly important in any industry with a highly technical verbiage, such as IT, legal, medical, musical...well, you get the point.

    3. Re:The Many New Possible Fronts by Nexus7 · · Score: 2, Interesting

      > So there is some money to be made in "learning" search engines that
      > tailor themselves to the user

      I for one, wouldn't want a search engine that customizes its output to that extent for me (not the way it looks, or the ordering, but the content itself). I'd like to see results for all types of horns even though I've only been interested in air horns in my previous searches. That'd be like only watching a particular news channel that is known to be slanted one way, you'd end up thinking to world is just as they say it is.

      > or perhaps the results could be displayed intuitively in domains of
      > knowledge

      That would be helpful. For example, the biggest online auction site will show you all the results for say, "xenon lamps", but they also generate a menu in the left frame containing the main categories, such as "Automobile", "Business & Industrial", "Sporting Goods", etc. However the database of auctions is very well ordered and makes this possible. Generalizing this to contexts in real-life would be considerably harder. Classifications would be possible along many axes... a problem worthy of the company that would tackle it.

    4. Re:The Many New Possible Fronts by McWilde · · Score: 3, Insightful

      "Google is 8th when you google Search Engines. They are a has been...."

      It should be perfectly clear from your own post: noone uses the term "search engine" anymore, it's called a Google these days. Just like Xerox and Kleenex. I just googled "Google" and Google still dominates the first ten pages of results. Stanford is on page 4, Slashdot on page 8. I didn't feel the need to page on until I found Yahoo or MSN in there.

      --
      Maybe
    5. Re:The Many New Possible Fronts by Momoru · · Score: 4, Interesting

      Despite the joke, it goes to show how Google isn't that great at returning the "Best" result anymore. It needs to change its algorhythm now that all the SEO's know how to game the system.

      I still prefer Clusty.com 's way of doing search...much easier to find the most relevant thing when you type in a term used for multiple things like "Paris" Does one want the porn star or city? or "Cold Fusion".. the technology, or the programming language? Google doesn't know, and for really common terms it fails. Clusty can tell the difference, Google can't.

      Even better is Yahoo's beta search that allows you to filter results of sites that are more sales oriented or research oriented. If I want to find out about the new Trek Mountain Bike, Google hits me with tons of sales sites, when really I want reviews, or vice versa. If someone could combine all those, and then maybe a Digg like system of users rating relevance, they might have something.

    6. Re:The Many New Possible Fronts by yohan1701 · · Score: 2, Interesting

      If you put in just "search" 16,210,000,000 results
      1. Google
      2. MSN
      3. Lycos
      4. Exite
      5. Search Engine Watch

    7. Re:The Many New Possible Fronts by Anonymous Coward · · Score: 2, Interesting

      Shall the listing begin, of all the horns that are musical and not considered "French Horns"? Yes it would be nice for you, if you could teach a search engine that whenever YOU look up horn, YOU would get back results that only pertained to "French Horn". Personally, I think that is one of the worst Ideas I've heard come down the pike. What if I am searching for a car horn? Or a bicycle horn? Or rhinoceros horn?

      Let's see. It might go something like this...
      ME: Computer, I'd like to do a search...
      COMPUTER: Ooh, a search? Then you would like these results based on your last searches...
      ME: Cancel. I did not even tell you what I was searching for...
      COMPUTER: But I learned what you look for, and I have already provided it for you...

      Silly, but the point is simple. If you are searching for information about a "French Horn" you search for "French Horn". If you are looking for information about horns in general, and all the different kinds of horns there are, then you search for Horn.

      Sure, it's a great idea to have search sites offer different ways of sifting through the results of a search. Nothing at all wrong with proposing new navigation techniques, and new ways to list and cross reference what you looked up. But to think that the search engine should get to the point where it attempts to anticipate what you want, and provide it for you BEFORE you ask... that's a little over the top.

      In your example, where the engine learned you only EVER use the word Horn when the word French preceeds it, and any results related to "horn" are ignored by you if they do not pertain to the French Horn, then sure, it would learn quite nicely, and would only provide "French Horn" results for a "horn" search. WHat happens when you need to search for information on changing your "Car Horn"? By your argument, you should not need the additional qualifier of "Car" to preceed the word "Horn" and the search engine should still know that you want "Car Horn" as the target of your search. Even though you trained it to equate the word "Horn" to mean "French Horn"...

      Which opens up the possibility that your search would display as a collection of results containing "Car French Horn"... since your search engine takes the word Horn and replaces it with "French Horn" automatically after you have trained it to do so.

      But the idea of taking any search result, and breaking it down statistically and catagorically BEFORE displaying the linear results... that's good stuff.

    8. Re:The Many New Possible Fronts by miyako · · Score: 2, Informative

      One engine that has a really interesting way of dealing with this is ahref=http://www.kartoo.com/rel=url2html-10505http ://www.kartoo.com/> which is a meta search engine. Its written in flash, but uses an interesting concept of creating a "map" of different pages.
      I just did a search on "Paris" and it brings up a list of pages- you can mouse over the pages to get a preview of the site. It also gives you a list of categories such as "City of Paris" "Accomodations" and "Entertainment". These categories are listed on the background of the map, and the pages are grouped according to their category. Lines are drawn between pages showing links and the category that connects them.
      It's kind of hard to explain but actually a really neat concept. I don't use it as my primary search engine (I rarely use it at all) but it can be very useful for people who either, A: aren't familiar enough with search engines to play the "add and subtract various keywords and add quotes until I get the results I'm looking for" game or B: only have a general idea of what they are looking for, and want a quick way to narrow down their results

      --
      Famous Last Words: "hmm...wikipedia says it's edible"
  2. But since we are trying to compare quantities... by everphilski · · Score: 2

    It should really be m*v^2. (m*v is a vector equation, m*v^2 is a scalar)

  3. Does google really dominate? by EmagGeek · · Score: 5, Interesting

    It seems more and more when I try to find something on google all I get are a bunch of link farms. This morning I was trying to find a bike jersey for a friend of mine and on the first page of results, it took getting to the second page to find any actual results. I did much better using Yahoo and found what I was looking for on the first page of results.

    This is just one example, but it happens constantly...

    1. Re:Does google really dominate? by rainman_bc · · Score: 4, Interesting

      It seems more and more when I try to find something on google all I get are a bunch of link farms.

      Totally agree. It isn't that Google is great; it is IMO quite far from it.

      Personally I only stick to Google because of their usenet search. I honestly think their web search is crap thanks to google spammers, or those self titled Search Engine Optimizers.

      I actually thing Yahoo's search engine is somewhat superior... I kinda like search.yahoo.com

      --
      09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
  4. Google groups is the killer app for me. by deragon · · Score: 2, Insightful

    The reason I mainly uses Google is Google groups; whenever I have a computer related problem, the Usenet archive is often helpful. I do not think that there is any other archive of Usenet like this out there (available for free). If there are, please share the links.

    And how many slashdotters find Google Groups useful?

    --
    Remember the year 2000? They promised us flying cars. They delivered the PT Cruiser...
    1. Re:Google groups is the killer app for me. by gowen · · Score: 3, Insightful

      Google Groups is great, and gives the lie to the idea that Usenet is dead.

      Having said that, dejanews had a much better interface. (Simple is good: you'd expect Google to know that).

      --
      Athletic Scholarships to universities make as much sense as academic scholarships to sports teams.
  5. google cant find me by JCOTTON · · Score: 2, Informative

    Actually, I have as small personal page for people who are looking for me to find me. It is here. I am probably the only person in the world with a page that lists my name and my elementary school name. If you do a search on these terms "JOSEPH COTTON SEABREEZE" in google, you will not find my page. If you do a search in yahoo, then there it is at the top. So Google is not king, by any marker other than market share.

    1. Re:google cant find me by JCOTTON · · Score: 2, Insightful

      Ironically, by posting this on slashdot, I will get a higher rating on google.

    2. Re:google cant find me by LnxAddct · · Score: 2, Informative

      Relevance is key, and your site isn't very relevant (no offense). If you put your name in quotes, it turns up a better result (1st or 2nd). Apparently there are more important websites out there that contain joseph, cotton, and seabreeze. Get over it.
      Regards,
      Steve

  6. Google sucks by Anonymous Coward · · Score: 4, Interesting

    Well, actually, almost all current search engines suck. There is waaaaaay too much noise in the results they return. Let's say I'm doing a search for "product X", search for it in Google and what do you get? Several links to ebay (which may or may not be current), tons of links to various "rate it" sites such as epinions/nextag/msn/etc, and maybe a few smatterings of other sites mixed in. Typically the manufacturers own site won't even appear in the first couple of dozen results!

    So basically, I agree with the general position of the article, that there is still a TON (actually several tons) of work to be done and room for someone else to move in with a truely superiour solution. While it's great that Google is tinkering with lots of other technologies, I wish they'd actually make some real advances in their core business (and actually, I'm slowly starting to come to grips with the fact that their core competency may not be searching, but really it's in creating low latency widely distributed computing infrastructures). For all the years and the massive sums of money, my search experience is not significantly better than it was 5 years ago.

  7. MS Uses Google :) by gasmonso · · Score: 2, Interesting

    I was on the phone with some engineers at MS the other day and even they admitted that they use Google. It's just better... for now.

    http://religiousfreaks.com/
  8. How do they calculate market share? by truthsearch · · Score: 3, Interesting

    Does anyone know how they calculate these market share values? AFAIK they don't all publish traffic statistics.

  9. Numbers don't match my stats by Betabug · · Score: 3, Informative

    Which world do these numbers come out of? This month on my private site so far I got 1400 incoming links vom Google and 30 from MSN (the next runner up), 27 from Yahoo. Maybe it's just that Google loves my site for some strange reason, but I can't imagine my own little sample of web hits is statistically so "off" from their numbers. Other sites I admin for have similar numbers.

    The numbers of pagehits by spiders from those search companies are much more on an equal basis. Sometimes one of them is on top, sometimes the other, but they all spider like crazy.

    Much more interesting are little search engines like gigabot, which never ever gave me one incoming link but still spider like it's going out of style. Somehow makes me think they must live either off warm air or spam. What reason to be do they have?

    1. Re:Numbers don't match my stats by Imsdal · · Score: 2, Interesting
      This month on my private site so far I got 1400 incoming links vom Google and 30 from MSN (the next runner up), 27 from Yahoo

      I think your numbers are less representative than most, but even so, I find Google "only" having 50% to be strange. On our site and for June only: Google 75.5%, MSN 11.8%, Yahoo 4%, Kvasir 3.1%, Google (Images) 2%, Altavista 2%, everyone else 0.2% or less.

      Since we are based in northern Europe, Kvasir (a Norweigan search engine) is obviously having a much higher share than for most other sites, but my gut feeling of Google at 75% seems reasonable.

  10. The Real Stats by Nutmegan · · Score: 2, Funny

    I don't know how they measured the dominance of these search engines, but I know how its supposed to be done: by the number of hits it gets on Google. In that respect, Google: 9,630,000,000 Yahoo: 5,240,000,000 MSN: 4,220,000,000 Ask: 2,140,000,000 Clearly, though, the most dominant search is the word "search" itself. It gets 16,670,000,000 hits.

  11. Not over yet? by russ1337 · · Score: 2, Interesting

    "This compares with 28% for Yahoo!; 13% for MSN, which belongs to Microsoft; and 6% for Ask, which is owned by IAC/Interactive Corp, a conglomerate of about 60 online media brands"

    This isn't over, simply due to lack of certainty in net neutrality. If media companies get leverage to control bandwidth to the big search companies (Google), it goes without saying that that these figures will change significantly. For Google, it could be death by a thousand cuts...

  12. Not war, market by 12ahead · · Score: 2, Insightful

    It's not a war, it's a market. Markets change. That's a good thing. Right now most people are happy with google. If something better comes along, I am sure nobody would stop embracing it just because they love Google so much. For shareholders and Google's owners it's probably different, as they want to keep on top and gain maximum market share to increase profits (or whatever). But as a consumer I couldn't care less if the best search engine is called google yahoo or msn. If a search engine searches well, I'll use it. If it doesn't - well, game over.

  13. Momentum? by Intron · · Score: 2, Funny

    The article totally fails to take Lorentz-FitzGerald Contraction into account.

    --
    Intron: the portion of DNA which expresses nothing useful.
  14. cult of personality by Anonymous Coward · · Score: 4, Insightful

    This is were Google truely dominates, in cult of personality.

    It seems more and more when I try to find something on google all I get are a bunch of link farms. This morning I was trying to find a bike jersey for a friend of mine and on the first page of results, it took getting to the second page to find any actual results. I did much better using Yahoo and found what I was looking for on the first page of results.

    I speak of the same experience in another thread. Simple fact is, Google appears to be doing nothing to make searching truely better. I give them some benefit of the doubt by saying "appears" since they may have "the next great thing" in the wings and we just haven't seen it yet. But lets face it, searching today is a time wasting experience. What's worse is that several years ago, it was time wasting due to the large number of seemingly random links. Well, those links aren't so random anymore, AAMOF, those links are geared to simply make Google money. I've said it before and I'll say it again, anyone who buys into their "do no evil" marketing fluff can give me a ring, because I have a nice bridge to sell them. How are they any different that M$ in this regard? They have tons of money, some of the brightest minds in the valley, and yet a simply search for a product gives me pages of utter crap. Hell, if a manufacturer makes the mistake of naming their company after their flagship product, you won't even be able to find their main website in a search until the third page!?! But Google still collects their bucks. How is this helping? How is this not evil?

    Anyway, off my soap box. Here's to hoping someone can come along and actually do some good here (and yes, it may even be Google themselves).

  15. obligatory SW parody by oni · · Score: 2, Funny

    Obwan: that search engine is our last hope
    Yoda: no, there is another

    (later)

    Yoda: google... page rank is strong with you... pass on what you have indexed... there... is... another... search engi(ugh)

  16. velocity, acceleration? by MADnificent · · Score: 2, Funny
    Shouldn't it have to be:
    m = mass -> $$$
    v = velocity -> users
    a = acceleration -> growth

    This way you can see how much energy a company has, and thus it's importance (m*v*v/2), growth can end *very* fast.
    The energy a company gains is m*a*v, that shows that google is the best growing company...

    etc.
  17. Re:Resting on Laurels by teknomage1 · · Score: 2, Insightful

    Simple for Ask, experts are those that best further the marketing goals of their backers.

    --
    Stop intellectual property from infringing on me
  18. the climbing ask.com by op12 · · Score: 2, Informative

    About a month back I did a comparison of some searches, based on ask's claims of understanding concepts. They were able to outperform Google in this functionality: http://www.nirajsanghvi.com/stories.php?sid=318&ti d=55

  19. The competition will never be over by blueZ3 · · Score: 2, Insightful

    and that's a good thing.

    The barrier to entry in search is moderately high (you have to be able to afford the hardware to do your indexing) but there will always be people willing to invest in search. It's easily monetized (love that word, eh?), there's no cost for users to switch to new competitive product, and there's no magic bullet that gives you both accurate results and the ability to weed out aggregators and shady SEOs. As long as developers can come up with new search algorithms that give better results, there's the chance that the "next Google" could be launched.

    I like and use Google, but that's because the results are usually valuable to me and the ads are minimally intrusive. Currently, the one issue I have is Google's inability to prevent aggregators from showing up in search results. I've never found anything useful through aggregator pages, and I'd like Google to filter them out.

    Anyway, the ability of new companies to explore search is something that's good for SE users. New search startups can be launched and attempt to improve search. Google is forced to innovate. Where's the downside?

    --
    Interested in a Flash-based MAME front end? Visit mame.danzbb.com
  20. Where is my 'remove this site' button? by Unxmaal · · Score: 4, Informative

    At some point within the last few months, Google removed the "Remove this site from search results" button. I made heavy use of it, perma-banning resultspammer sites like ExpertSexchange.com and its ilk.

    Sadly, this button is now gone from Personalized Search, and the resultspammer sites are steadily reducing Google's usefulness to me. Where I could once search for specific tech terms and get a good batch of reference resources, now I'm getting junk portal pages for the top five results.

    Sure, I could report a link as spam, but that's a lot more time-consuming than the button, and it doesn't appear to have any immediate results for my searches.

    This makes me sad. I've loved Google since I first met her, but I can't be with her if she's going to continue mainlining spam.

    --
    http://unxmaal.com
  21. Why Google? by urdine · · Score: 2, Interesting

    The reason everyone still uses Google isn't because they have the best ranking of results anymore, which is usually encrusted with spam sites designed to beat Google, but because they have the most COMPLETE results. When you search for something rare, Google most likely will return results no other basic engine has. So people have gotten accustomed to checking Google first out of habit more than anything else.

    To me, I think the future of search isn't necessarily a better Google, but something different. The problem with Google is the same as its strength - its simplicity. There is very little control on Google for more complicated searches, such as searching only company websites, or searching only encyclopedia content. It's just a big kludge for them to add stuff like travel info or weather or movie info without knowing the intent of the searcher beforehand. Searchers have to get savvier, not just the algorithms. I think search aggregating sites like Seaurch.com which has 200 engines but still uses a simple interface, is a great idea. Sites like Clusty.com also take an interesting approach towards understanding the searcher's intentions.

  22. Re:Resting on Laurels by billstr78 · · Score: 2, Informative

    There are no actually subject specific "experts" at Ask. They use computer algorithms which are generic enough to work with any subject group, not just ones that a single person might know a bit about. The term "Expert" is figurative and not literal.

  23. Re:cult of personality - antispam search tips.... by iamcf13 · · Score: 2, Insightful

    The parent AC is correct.

    Big business turned the Internet (the WWW part principaly) into little more than 'online TV'. Because of all the $$$ at stake, we have probably the best search engine around, Google, drowning in ad driven/cash driven search engine spam ('spamdexing').

    If you are searching for something in Google, add site:.edu to your search. By doing that. that should lead you to .edu sites who are there principally to educate rather than sell you something. If you have to do general search, add
    -shipping -visa -mc -amex and the like to your search terms to block the 'ad pages'

    Good Luck!

    P.S. The best search engine would just index the single homepage page served up by the webserver at all 4,294,967,296 possible IPv4 addresses (minus the reserved/private/unused ones). The rationale is that if you are paying for webspace and a unique IP address, chances are good you have worthwhile content there. This would eliminate 'spamdexing' in all its forms in one bold stroke. The drawback is that 'online communities' would only be listed by the main IP webserver address and not by the URL which is why Google is in the 'mess' they are in.

    If you are doing worthwhile research and want to avoid as much e-commerce as possible in the process, here again are some helpful tips....

    add site:.edu in Google searches (mentioned earlier - restrict search to non-profit educational sites)

    add -shipping -visa -mc -amex and the like to Google searches (mentioned earlier - helps eliminate product 'ad pages' and leave behind product review pages

    search well-known 'info sites' like http://citeseer.ist.psu.edu/ or http://www.wikipedia.org/ If you can't find the info you want at those two places for free, the content you seek is either not on the Web or (more likely) you have to pay for it to get access. We all know how Net-savvy people hate to pay for anything they find on the Web....

    As a last resort, search Google's Usenet interface at http://groups.google.com/ Helpful info can be found there but be prepared to really dig for it! =/