Slashdot Mirror


Could IBM's Watson Put Google In Jeopardy?

theodp writes "Over at Wired, Vashant Dhar poses a provocative question: What If IBM's Watson Dethroned the King of Search? 'If IBM did search,' Dhar writes, 'Watson would do much better than Google on the tough problems and they could still resort to a simple PageRank-like algorithm as a last resort. Which means there would be no reason for anyone to start their searches on Google. All the search traffic that makes Google seemingly invincible now could begin to shrink over time.' Mixing supercomputers with a scalable architecture of massive amounts of simple processors and storage, Dhar surmises, would provide a formidable combination of a machine that can remember, know, and think. And because the costs of switching from Google search would not be prohibitive for most, the company is much more vulnerable to disruption. 'The only question,' Dhar concludes, 'is whether it [IBM] wants to try and dethrone Google from its perch. That's one answer Watson can't provide.'"

48 of 274 comments (clear)

  1. Google, really? by nospam007 · · Score: 4, Funny

    Watson? Jeopardy?

    Does that mean we have to enter the answer an he gives us the question?

    42?

    1. Re:Google, really? by Samantha+Wright · · Score: 3, Funny

      More importantly, was this headline conceived specifically to foil machine-learning methods for inferring meaning from context? This seems profound in its poignancy.

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    2. Re:Google, really? by davester666 · · Score: 5, Insightful

      Of course Watson will be able to do this.

      Obviously, Dhar did a bunch of research, and determined that even though it took a massively powerful computer to answer one question at a time that has a predetermined single answer, over several seconds, it can trivially scale to support millions of simultaneous queries, which may have zero, one or multiple answers.

      --
      Sleep your way to a whiter smile...date a dentist!
    3. Re:Google, really? by RenderSeven · · Score: 2

      Watson? Jeopardy?

      Jeopardy, as in "I'll try 'Silly Unsubstantiated Conjectures' for $400"

    4. Re:Google, really? by Samantha+Wright · · Score: 2

      If you remember the blatant IBM marketing material that was interspersed with the Jeopardy episodes, they had planned from the start for Watson to be scalable (one of the first applications was medical literature search)—and if you remember the episodes themselves, you'd realise that it's a tad silly to suggest it couldn't handle cases with zero or multiple answers, since it performs Bayesian reasoning on a huge pool of possible hits and simply announced the best one.

      But Watson isn't, wasn't, and would never make sense as a simple search engine like Google; it's more like Ask Jeeves in its intended use; the internet is its source text. And do keep in mind that TFS even suggests performing a Google-style search as a fallback mode. (You did actually read something about this before posting, right?)

      --
      Bio questions? Ask me to start a Q&A journal. Computer analogies available for most topics!
    5. Re:Google, really? by oodaloop · · Score: 2

      In all fairness, Watson has been used in medical contexts where it ranks its suggested diagnoses and treatments (multiple answers, not just one right answer). It also runs on 1 server now, instead of 75. But no, I don't think it's going to scale up to beat Google anytime soon. If/when IBM says they are doing so, that's another story.

      --
      Tic-Tac-Toe, Global Thermonuclear War, and relationships all have the same winning move.
    6. Re:Google, really? by Anonymous Coward · · Score: 2, Insightful

      Even if they can scale Watson up 1 billion times, that would solve nothing. Searching documents on the Internet is not that hard, in fact it's largely speaking a solved problem; especially when you add website browsing data (Google toolbar etc.) and click rates on the SERP to the traditional page-rank information derived from the link structure.

      What is fiendishly hard is maintaining good relevance of your results in the face of a human, adapting and immensely rich adversary: the SEO industry. In that regard, Google algorithms are less about science and more about proprietary voodoo that only work if no one knows how they work. Kind of like the stock market, the search industry is infinitely reflexive and you can make a profit if you know what everybody thinks everybody else is thinking (and so on).

      It would take just about a week for the SEO industry to find out what makes Watson tick and compel him to spew Viagra links for any search query.

    7. Re:Google, really? by aaronb1138 · · Score: 4, Interesting

      It would make more sense to make a Page-rank (Google style) search the default and make an intelligent Watson answer system a premium micro-payment based ($2-20 yearly for 1k queries or so). Then use to Watson derived answers to boost the page-rank result quality.

      This is more or less, what Wolfram is already trying to sell, though their parsing and indexing engine is weak compared to Watson and Google respectively.

    8. Re:Google, really? by Aighearach · · Score: 2

      If we assume that google would be able to match them server for server, there is no reason to believe that Watson is any better in the aggregate. We only know about how it scales vertically. Just because Watson is better at Jeopardy and medical research doesn't mean they'd even be competitive at internet search; unless this is a thought experiment where we pretend IBM spends all their money on supercomputers for search, and Google doesn't change or respond.

    9. Re:Google, really? by omnichad · · Score: 2

      one of the first applications was medical literature search

      And IBM being IBM, they would probably not venture into advertising-supported search. They'd rather have some other risk-taker pay them to implement it. Or have every person that wants to do this sort of search pay them for their own private setup.

    10. Re:Google, really? by Albanach · · Score: 3, Interesting

      Medical literature is a tiny world - much smaller even than the one that shapes Jeopardy questions. Medical journals are numbered in what, the hundreds? Many don't even publish monthly. I'd be willing to bet Google indexes more new content in a minute than there is new medical literature in a year.

      On top of that you have a market for the research that can and will pay per query. Combine revenue per query with a small world to search and you can dedicate an enormous amount of processing power and time compared to that which Google can offer.

    11. Re:Google, really? by Anonymous Coward · · Score: 3, Funny

      Dude, like it or not, Watson requires lots of costly hardware and power to operate. If you want to give the masses access to that power, they're either going to pay for it, or he's going to recommend you drink Coca Cola (tm) to cure your AIDS.

    12. Re:Google, really? by theshowmecanuck · · Score: 2

      First they will try to figure out how to offshore it while getting the U.S. government to give them more tax breaks while doing so.

      --
      -- I ignore anonymous replies to my comments and postings.
    13. Re:Google, really? by Jeremiah+Cornelius · · Score: 2

      Good morning, Professor Forbin...

      --
      "Flyin' in just a sweet place,
      Never been known to fail..."
    14. Re:Google, really? by dimeglio · · Score: 2

      The difference being that you can have a conversation with Watson. Not so with Google. Google takes you to information. Watson provides information.

      --
      Views expressed do not necessarily reflect those of the author.
    15. Re:Google, really? by pepty · · Score: 2

      Searching isn't as hard (anymore), interpretation (what Watson aims to do) still is. Given the right query using symptoms and lab test results as keywords Google will give you a list of articles and websites, some of which will have information on the correct differential diagnosis protocols to use to come up with a diagnosis. Watson will give you a ranked list of diagnoses and the evidence for each and/or further tests to run. I agree with you about Watson's vulnerabilities: Watson was raised on a pretty well behaved neighborhood of the internet.

  2. References? by Anonymous Coward · · Score: 4, Insightful

    The difference is, that Google does not tell the answer. It just gives you a link to the answer. So if the answer is wrong, you cannot blame Google.
    What about Watson?

    Vajk

    ps: also the last thing I would say about pagerank is being simple

    1. Re:References? by mcgrew · · Score: 2

      The difference is, that Google does not tell the answer.

      It does on an android. Ask it the current temperature and it will tell you -- in centigrade, which is pretty dumb since it knows I'm in Illinois and we use Fahrenheit for air temperature here. You have to specify Fahrenheit. When they had the floods in Colorado I pulled out my phone and asked the elevation of Colorado Springs and it told me. I asked it how far it was to Bellville and it said "94 miles" which surprised me; I was expecting kilometers since it answers temperature with Centigrade.

      It has trouble with "fur lined gloves", it thinks you're saying "for lined gloves" with laughable results.

    2. Re:References? by h4rr4r · · Score: 3, Insightful

      Save for that fact that it makes sense. It is based on water a common material we are all familiar with and uses a nice 0-100 scale.

      Yeah, other than that there is nothing better about it.

    3. Re:References? by Grishnakh · · Score: 3, Insightful

      Yes, and when do you ever see outdoor temperatures over 50 or 60 on the Celcius scale? (never) And when do you see negative temperatures on the Celsius scale (all the time unless you live in a hot area)?

      Fahrenheit has better resolution and scale for human temperatures. If it's over 100 or under 0, the weather is "extreme". Not so with Celsius. And it has roughly double the resolution.

    4. Re:References? by beatljuice · · Score: 2

      While I agree the metric system is almost always better, I disagree in this instance. Celsius is much less granular. If I set my thermostat (granted, it's a cheep one) using Celsius it will keep the correct temperature, but I'm much more likely to be hot or cold because there can be as much as a 3 degree (Fahrenheit) difference. within the Celsius measure. Yes, thermostats can be designed that use fractions (most probably are) but mine's not.

      --
      Look for a reason to smile you jaded #*^ *(%$
    5. Re:References? by h4rr4r · · Score: 2

      You realize that the temperature can have decimal places added right?

      Farenheit like the mile needs to go.

    6. Re:References? by Pope · · Score: 2

      Every digital thermostat I've ever used had 0.5 degree increments for Celsius.

        I seriously doubt the majority uses theirs to move temperature by 1 degree increments in either scale.

      --
      It doesn't mean much now, it's built for the future.
    7. Re:References? by Arcanazar · · Score: 2

      As a Canadian, the Celcius scale has a number of useful quantities:

      • - Where I live, the temperature ranges from approximately -40 C to 40 C (Winter to Summer). Essentially, temperatures further from 0 are more extreme.
      • - Room temperature is about 20 C (although some may prefer up to about 25 C). This is a nice round number to remember.
      • - If I'm driving and the temperature is near 0 C then road condition is typically poor (slush or water on top of ice).

      During my day-to-day activities, many of the temperature numbers that I use are convenient. YMMV.

  3. License tech to Google by HockeyPuck · · Score: 4, Interesting

    It's probably much more profitable for IBM to license the technology to Google/Yahoo/MSFT/whoever than it would be for IBM to build search infrastructure.

  4. Silly question by mcgrew · · Score: 3, Insightful

    Of course not. It's an IBM machine, they'll sell Google as many as Google wants to buy. Of course, so can Microsoft but they don't have a good track record at all.

    I wish Google did have a Watson. This morning I asked my Android "where can I buy a good pair of fur-lined leather gloves" and it thought I said "where can I buy a good pair of for lined leather gloves" and returned no useful results at all. The programmer was a southerner, I guess? "How much does them go fer?"

    Amazing what it does get right, but Google, buy a few Watsons!

    1. Re:Silly question by Merk42 · · Score: 2

      That has to do with speech recognition, something Watson (at least at the time of the Jeopardy filming) didn't have at all.

    2. Re:Silly question by H0p313ss · · Score: 3, Funny

      Tried this with Siri,not only did it find people selling fur-lined leather gloves, it also found photos of models wearing skimpy dresses with gloves on.

      I'm sure there's a deeper meaning here.

      --
      XML is a known as a key material required to create SMD: Software of Mass Destruction
  5. Better searches no good if they're too slow by jandrese · · Score: 4, Insightful

    Watson was a supercomputer answering basically one question at a time. You can't apply that level of compute time to every single query without bankrupting yourself on hardware costs. With time computer cycles will become cheaper and this will be more realistic, but today's technology just isn't there.

    --

    I read the internet for the articles.
    1. Re:Better searches no good if they're too slow by arth1 · · Score: 2

      Indeed. Watson cannot answer millions of queries per second.
      Google is deliberately simple so it can.

    2. Re:Better searches no good if they're too slow by Trepidity · · Score: 5, Informative

      It's also a completely different problem from information retrieval in a messy domain like "all documents on the internet". Watson is built mainly out of more structured data: dictionaries, almanacs, atlases, Wikipedia infoboxes, etc. It turns this into a huge database of knowledge, and then does inference on that database to try to answer Jeopardy-style questions posed in natural language. But this doesn't even try to tackle the other side of the natural language problem, which is parsing not only a natural-language query, but the entire contents of the internet.

      In short, Watson might compete in the Wolfram Alpha space, of retrieving structured knowledge from databases, but not, at least not without a major overhaul, in the general document search space.

    3. Re:Better searches no good if they're too slow by swillden · · Score: 2, Interesting

      And you can be certain that Google is working on their own 'Watson' tech, so when the hardware is ready, they'll be able to do it.

      Not just working on... has deployed, in a small way, to the degree that current capabilities can support on a massive scale.

      That's what Google's Knowledge Graph work is about, and its work on natural language processing of search queries (including spoken queries). Google web search doesn't have Watson-level understanding, yet, but it has already moved well beyond the string matching + page ranking that the article supposes and is continuing to progress. Indeed, that progress is the source of many of the complaints about Google search here on slashdot. It has gotten much smarter, which makes it more effective in general, but means that it's less effective when what you want is a simple string search (though you can turn on verbatim mode to fix that).

      (Disclaimer: I work for Google but not on anything related to this article or thread.)

      --
      Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
  6. Is this what Wired does now? by MikeTheGreat · · Score: 3, Informative

    Well, I guess I'm glad to see it's not just /. that's past it's prime :/
    I mean, seriously - "What is someone else made a better search engine? ALL TRAFFIC WOULD GO THERE AND GOOGLE WOULD DIE" just seems so.... speculative.
    (Maybe Wired has added a Creative Writing section since I last read it?)

  7. No by Anonymous Coward · · Score: 2, Informative
  8. Sick of 'smart' searches by RandomUsername99 · · Score: 5, Insightful

    I'm getting a bit sick of 'smart' searches... or, rather, not being able to disable the 'smartness.' More often than not, I really don't want a search engine making assumptions about what I meant, rather than just taking what I enter completely literally, and I *never* want it to insert results that don't contain all of my search terms because it scored exponentially better with the other items in the query. Chances are, I added in the term they were ignoring, specifically, to drastically reduce the number of results I got, because I wanted to *narrow it down*.

    Maybe I'm a curmudgeon, but I would rather tweak the search to narrow down crap results than try to outsmart the 'smartness' any day of the week. I understand that this isn't necessarily what John Q. Internetuser is looking for in search, but at least having the option there would be a big help. Google used to have a very straightforward syntax to help you modify your search results in specific, predictable ways... while much of that syntax is still valid in google searches, now it seems like everything can be arbitrarily overridden by what google thinks you 'should' have meant, rather than what you told it you meant. Very frustrating.

    1. Re:Sick of 'smart' searches by 93+Escort+Wagon · · Score: 2

      Try DuckDuckGo.

      --
      #DeleteChrome
    2. Re:Sick of 'smart' searches by squiggleslash · · Score: 2

      True story. Not so long ago (interestingly it appears to be fixed now), I did a Google search for CP/M submit (I was trying to remember whether submit was the command to run batch files in CP/M, for some pathetic joke that's lost in history. Anyway...)

      The list of results was apparently enormous, but none of the pages that initially came up had anything whatsoever to do with CP/M. There was nothing to suggest they'd changed my search parameters in the usual annoying way (no "Searching for "Something really common", click here to search for the obscure thing you actually were trying to find stuff about" message), indeed there weren't even any suggestions. Then I looked a little closer, and noticed that the word "COM" was highlighted in each search excerpt. Usually like this: "Click here to get a subscription for 99c to nytimes.com"

      Google had, bizarrely, changed, without even bothing to tell me, "CP/M submit" into "COM". And needless to say, with almost EVERY PAGE ON THE INTERNET containing the word "COM", usually after a dot, which in turn would follow the NAME OF THE SITE, Google had found a kagillion (or maybe a googleplex?) of search results.

      They've fixed it now. Indeed, it even comes up with relevent search results on the first page, which is a fair miracle, but I'm still trying to work out the logic that would have Google's code convert C, P, a slash, M, and the word "Submit" into "COM". And to change this search without thinking it's a big enough change that, you know, maybe it's worth warning the user and providing them with a link to a more literal search.

      I keep switching to Bing. I keep switching back because Bing is just as bad, and Google's the default setting for the three search boxes my browser provides me with whenever I open a new tab.

      --
      You are not alone. This is not normal. None of this is normal.
  9. Adventures in surmising by Empiric · · Score: 2

    Which means there would be no reason for anyone to start their searches on Google.

    Sure there is. Start with what led Google to dominate search in the first place--interface minimalism. Google has become very good at returning results based on a minimal number of keywords about the desired topic; forming a question around the topic of interest is slowing one down in terms of keystrokes. And, generally, an answer to a specific question is not at all what one actually wants. What is sought is sources of information about a topic, for which a listing of highly-relevant links is superior to a single "the" answer. Is there anyone who doesn't type topic keywords over literal specific questions at, at minimum, a 20-to-1 ratio? No one I know.

    ... Dhar surmises, would provide a formidable combination of a machine that can remember, know, and think.

    But to simplify the issue, we have this. "Surmising" that Watson, or any known technology, can do any of the above disqualifies him from any commentary on any such technological issue or endeavor.

    --
    ~ Whence do you come, slayer of men, or where are you going, conqueror of space?
  10. You don't understand Google by T.E.D. · · Score: 4, Insightful

    This betrays a very basic misunderstaning about how Google got where it is, and how it stays there.

    Yes, pagerank is a great idea, and it was perhaps an improvement over what was being done before. But that wasn't why people abandoned the likes of Lycos and Yahoo(!) for Google back in the late 90's. Back then all the other search engines had gone to practices that were quite frankly user-abusive. Adds were placed all over the place, including an indeterminate amount of the top hits on your search. The search screens themselves also existed mostly to pump ads at you, and were really clunky, with a large amount of confusing options right there on the main search page.

    Google, by contrast, had a main search page with no options whatsoever. Just a text box and a couple of buttons. "Breath of fresh air" doesn't even begin to describe how wonderful to use this was compared to what we were used to. On top of that, the search results were clearly delineated from the ads, so you could trust the results. The "don't be evil" motto was obviously infused into the whole effort. Every competitor was just a giagantic pain to use by comparison. "Page rank" or whatever wifty algorithim used for all this was something that nobody but extreme techies (and marketers) really ever gave a crap about.

    So if you've got something that you think competes with Google, you'd better be talking about how nice and clean the interface is by comparison, how much easier it is to find real results without having to wade around ads, and how trustworthy the provider is wrt not allowing marketing weasels to buy their way into my search results. If you aren't talking about any of that, frankly nobody gives a crap.

    1. Re:You don't understand Google by mbone · · Score: 2

      You do realize that those days are long gone?

      I don't normally use google for search, but I fired it up and entered "fur lined leather gloves" (see above) as a trial.

      On the first page, "above the fold" (i.e., what I can see without scrolling), there are

      11 ads
      1 dialog box for some new feature I don't care about and
      1 actual search result, for Amazon.

      As I am aware that Amazon is a company that sells many different things, and as ads are not search,
      Google search actually returned nothing interesting to me at all. As always, YMMV, but this seems entirely typical to me in the new world of search.

    2. Re:You don't understand Google by omnichad · · Score: 4, Informative

      No. It was poor results from other engines. I seriously don't think you remember how bad search results were in the late 90's. It was like a light was switched on and suddenly the best of the Internet was illuminated. Before, AltaVista barely cut it. It was far too easy to game the simple keyword system.

    3. Re:You don't understand Google by mythosaz · · Score: 2

      This is perspective.

      Google isn't a search engine, finding places for you to surf to. It's already an ANSWER engine bringing you the results that you want.

      "Fur lined gloves" returns for me two yellow sponsored ads, and another ad block with five options for buying fur lined gloves. ...but there's a good chance that's what someone typing FLG wants -- they want to buy some gloves, so here's some gloves for sale.

      Searches for movies don't link immediately to movie reviews or quotable quotes. They first get you movie tickets, because nerds aside, that's what most people are actually looking for.

    4. Re:You don't understand Google by SleazyRidr · · Score: 2

      How badly did you want those fur lined gloves? You've posted about them at least three times, so you must be pretty mad about it. Imagine how people used to get their fur lined gloves before we had the internet...

  11. Re:Except this is IBM by mbone · · Score: 2

    And, if you malform a search string, you will get an error along the lines of

    AEKJ6952 : Illegal Register Overflow

  12. not with net neutrality by globaljustin · · Score: 2

    Watson will never displace Google, because of agendas, not technology.Google will spy on the people. Watson will manage their placement with FEMA.

    This is not so when the internet (including browsers & carriers) is open and interoperable.

    Right now, switching from Google to Duck Duck Go, Bing or Watson is litterally one click on most browsers.

    I may agree that IBM **probably** won't develop Watson in this way, but they very well could and it would work as long as they gave users marginally better privacy with the same features.

    The fact is, in a free market it is a CERTAINTY that someone will eventually exploit Google's vulnerabilities.

    Any company can use their capital to lobby politicians to make laws favorable to their revenue stream...but in open competition, Google is highly vulnerable.

    Facebook.com moreso :D

    --
    Thank you Dave Raggett
  13. What search is, and PageRank by brunes69 · · Score: 2

    A couple of points.

    First of all, RE PageRank... If people think PageRank is still all Google uses to process search results, they are living in a reality distortion bubble. PageRank is 20 years old, and everyone knows how it works. Google's algorithm relies on a lot more than PageRank, in fact PageRank is probably a very minor factor nowadays in what decides the top 50 results of a search.

    Second, search is about a lot more than answering questions, and this is what I think people still don't fully understand about why Google is what it is and why they own this field so much. Search is just as much about FINDING the actual question, as asking the question itself. When people go to Google they often don't actually KNOW the question yet, all they have is something they want to know about. The real questions come later.

    1. Re:What search is, and PageRank by Sardaukar86 · · Score: 2

      When people go to Google they often don't actually KNOW the question yet, all they have is something they want to know about. The real questions come later.

      Well spotted - this is the sharp edge of Google's usefulness for me but I doubt I'd have ever put my finger on it, yet it was there all along. With that in mind I'm chuckling that one of my favourite quotes hasn't been correct for a long time now:

      Computers are useless. They can only give you answers.
      Pablo Picasso

      --
      ..Mullah or Pope, Preacher or Poet, who was it wrote: "Give any one species too much rope and they'll fuck it up"?
  14. Outsourcing their specialty would be stupid by raymorris · · Score: 2

    > I guess Google doesn't want to license the technology, they want to own it.

    Absolutely. They can license or outsource payroll software, backup systems, communications - anything that is similar to what other companies have. Outsourcing SEARCH, the thing that makes them special, would be making themselves into an easily replaceable retailer for IBM.

    Schwinn learned that the hard way. They were the dominant bicycle company. First they outsourced the manufacturing of parts, and that was fine. They designed and built bicycles, not gears. Then they outsourced assembly, and that worked okay - dealers had been doing some of the assembly for a long time anyway. When they taught their Chinese partner Schwinn's design secrets, they made themselves irrelevant. The company they outsourced too, Giant, just started selling the bikes without involving Schwinn.