Slashdot Mirror


Google's Search Copying Accusation Called 'Silly'

itwbennett writes "Google's Bing sting, reported in Slashdot just days ago and subsequently denied by Microsoft, is now being called 'silly' and 'petty' by search industry analysts and execs. The reason: it would be impossible for Microsoft to use the copied results to reverse engineer Google's search algorithms. And in fact it is more likely that Microsoft was conducting competitive research. Charlene Li, founder of technology research and advisory firm Altimeter Group, saw Google's actions as a misguided response to a real threat from a competitor in its core search business. 'Google isn't used to having competition. You look at this incident and you wonder why they are doing this. It feels amateurish in a way, a kind of 'they're not playing fair' attitude,' she said."

14 of 380 comments (clear)

  1. Seriously? by Anonymous Coward · · Score: 5, Insightful

    They don't have to copy an algorithm if they are just copying search results. This response is amateur.

    1. Re:Seriously? by ILuvRamen · · Score: 5, Insightful

      I agree totally. What "research" includes looking for an already searched term on Google and then looking at what results come up...then slapping them into your own live result list for the general public? Bing's cheap algorithm is some search and crawling technology from like 2007 mixed with marketing, marketing, MARKETING! Oh, and flashy features that don't really work. So it's not that shocking that they're ripping off other people's results because their product is pretty hollow to begin with.

      --
      Google's Super Secret Search Algorithm: SELECT @search_results FROM internet WHERE @search_results = 'good'
    2. Re:Seriously? by Korin43 · · Score: 5, Insightful

      Hey, looking at the test next to mine isn't cheating. It's not like I could reverse-engineer the other students algorithm by looking at his test!

    3. Re:Seriously? by NeutronCowboy · · Score: 4, Insightful

      No kidding. I'm used to nonsense from "industry analysts", but this takes the cake. It's a complete non-sequitur. This never was a question of reverse-engineering. It's a question of straight-up ripping off results.

      On a related note, what's with all the Google-bashing recently? First the idea (which has now turned into a meme) that Google's search result are not the gold standard for search anymore, and now the idea (probably soon to be turned into a meme) that Google can't handle competition and is resorting to FUD?

      Yes, Google is no saint, it's not perfect. No shit, Sherlock. But if all I did was read "industry analysts" and various websites, I'd think that Google was about to fall apart, what with search sucking and all other products completely falling flat on their face. There's either a general search for the same story going on (Look Ma! I broke the news of Google sucking first!), or some grade A bullshitting is taking place.

      --
      Those who can, do. Those who can't, sue.
    4. Re:Seriously? by Maestro4k · · Score: 3, Insightful

      They don't have to copy an algorithm if they are just copying search results. This response is amateur.

      You can certainly make the case that Google setting up the "sting" operation was "silly", or "petty", but Microsoft's response to the whole thing has been quite enlightening. I think it's Microsoft that's got issues with having a real competitor, and it shows. Google's kinda just rubbing salt into the wounds, which isn't very professional, but MS needs to respond better. Trying to deny it, and at the same time accuse Google of committing "click fraud" to setup the sting (something which has a very specific meaning that's mostly criminal and has not a damn thing to do with Google's "sting" operation) comes across as... desperate at best.

      Personally I think the whole thing is silly on both sides, but MS's response has done a lot to wipe out the little bit of trust they'd gained in past years for behaving somewhat better. MS's response, and not the whole "sting", is making me even less likely to use Bing in the future as well. Both of these are outcomes I suspect MS didn't want to cause with their reaction. In a nutshell, Google won this little fight when MS started responding with denials and attempts to make Google look like they'd done criminal stuff.

    5. Re:Seriously? by ThePromenader · · Score: 4, Insightful

      I also agree - Bing is cheating. Never mind Google, they're second-sourcing ~everyone's~ results without giving them credit.

      Every search engine has its own search methods and data-parsing algorithms (down to the lowest in-site-search php code), and it is these algorithms that provide the 'top results' that bing toolbar (and/or IE) users are clicking on. Never mind the Bing toolbar user; what if the owner/creator of a search engine doesn't want any data generated by it to be sent to Bing - where does ~he~ opt out of MS' data-sculling program?

      Bing's tactics are distasteful for many reasons, but mainly a) because they exploit (toolbar) users to scull data from competitors and b) because Bing uses this data to provide 'top results' that it obviously values above those provided by its own algorithm. This is borderline - if not outright - industrial espionage.

      --

      No, no sig. Really.

      ThePromenader
    6. Re:Seriously? by uglyduckling · · Score: 3, Insightful

      No, you totally missed the point.

      User A types some words into a text box, then clicks on a link that takes it to a different domain. Toolbar B records those words, together with the destination URL of that link that was clicked, and uses those words to slightly bias the results of search engine C.

      Toolbar B does this for every website that user A visits with the express permission of user A.

      The owners of search engine D get annoyed, because when they deliberately insert completely unique [text string -> URL] mappings to their search engine, and have their engineers click on those links, it shows up in search engine C.

      Note that the only reason search engine D were aware of this in the first place is because bizarre mis-spellings of words in their search engine later turned up in search engine C.

      So, there is a logical way to connect page 3 with content X - someone at some point entered X into a text box, then clicked on a link that lead to page 3. In this case it was some Google engineers. The issue here is that Google (along with a lot of Slashdot posters) are thinking of the web in a static sense: 'how could X possibly link to page 3??' - Bing stole that data. Microsoft are dynamically looking at what users do, and the text string -> URL -> click interaction is seen as a relationship between a phrase and a page that they want to take into account with their search results.

    7. Re:Seriously? by malkavian · · Score: 3, Insightful

      The problem, of course, being copyright, and claiming work as their own.
      Google create a false entry, accessible only through their own site. This is a work that is intended only to determine whether someone is actually stealing their results (i.e. taking those results, and passing them off as MS's own).
      By all means, index non-search sites. That's what search engines are for, but you can't possibly convince me that Microsoft didn't know they were looking at Google's search results.
      That really is akin to writing a dictionary by seeing what people read, then saying "Well, lots of people read this other dictionary, so I'll just lift entries verbatim from it, and claim they are my own"..
      Yes, search engine tweaking is a very fine art.. It's easy to pick up the wrong signal by mistake. if MS had confessed, and said "Ooops, programming/design error in our browser, this is how it happened, and we're now going to remove all search engine sites from our allowed input", weight of opinion may have been behind them more, rather than blithely saying "It's all Google's fault we're ripping them off".
      The root of this is that they're building a dictionary by directly reading a competing dictionary. This isn't creating a diverse, resilient ecosystem. It's parasitism.
      Everyone screws up, and things always go wrong. That's a fact of life. What isn't a fact is that strange need to point fingers and say "It's everyone else's fault but mine". Especially when it blatantly is your fault.

    8. Re:Seriously? by horza · · Score: 4, Insightful

      With tweets like "At Microsoft Productivity Council mtg on future of Office" and worse "Ribbon Hero which teaches how to use MSFT Office better. Making work (gasp!) fun", Charlene Li is obviously blatently dishonest in her representation of her position.

      "Charlene Li, founder of technology research and advisory firm Altimeter Group" - and as sortius_nod says, now paid shill.

      Phillip.

    9. Re:Seriously? by nstlgc · · Score: 4, Insightful

      First of all, I don't think Google has a copyright on any of the content they index, do they?

      Second, as I understand it, clickstream data points are only a small part of the equation. Notice how Google could only reproduce this by using totally bogus keywords, ensuring that the data they fed to Bing through the toolbar were the only data points being considered for those keywords?

      Bing tracks when users search for something, and what sites they visit as a result. I'd almost be offended intellectually if this was not part of their game to provide me with better search results.

      Disclaimer: I use Google almost exclusively. Bing can suck it, but this debate is ridiculously biased.

      --
      I'm Rocco. I'm the +5 Funny man.
  2. "Competitive Research" by Anonymous Coward · · Score: 5, Insightful

    I don't see this phrase going down well in any other industry. If you copy a map or a book or the design for a car from a different company in the same field, you wouldn't get out of it by calling it "competitive research". Microsoft doesn't need to reverse engineer google's algorithm if they can just steal their results directly; in fact, it's simpler this way because it cuts out the middle part where they even bother to figure out how it works.

    1. Re:"Competitive Research" by grantek · · Score: 4, Insightful

      It's not "research" if the leeched data appears on your production site automatically and without review...

  3. Re:Microsoft is responding with misdirection by martin-boundary · · Score: 3, Insightful

    They seem to be dancing around the core charge of copying what were nonsensical search results that, if not copied from Google, should not have returned any results.

    Uhm, how would that work, exactly?

    Let's say you have a search engine toolbar that looks over a user's shoulder to see what webpages they go to. Presumably, the links that leave those web pages carry information on said user's interests (eg if the user reads slashdot, then the links point to things like other people's comments, and also the site which carries TFA, etc). So the text of that page and the links would be automatically connected by the search engine.

    Now if a user goes on a webpage that happens to be a google results webpage, then the links on that webpage will be search results. If one user types in a weird query, then the toolbar will think that user likes those kinds of weird queries, and maybe that other people would like those, too.

    So when another user now types exactly the same query to prove the "sting", then the search engine will think it has found another user who likes weird queries, no? So it should show the connections it has learned from the previous webpage.

  4. Re:I think this article says everything... by hellop2 · · Score: 3, Insightful

    Are you serious? Clickstream, Surfstream, Searchstream. Fancy words for a keylogger. From your link:

    “We’re not copying but watching users,” Shum said.
    Weitz added, “The word ‘copy’ has a very specific connotation, and it’s wrong. We get the clickstream. We’re going to see it. We may choose to show it or not.”

    It doesn't matter if you call it watching instead of copying. It's still copying. Bing shouldn't be "watching" google's results, or "copying" the user's click behavior. That's like google's trade secrets. An analogy would be an online newspaper who copies articles verbatim from a competitor, and then justifying it by saying, "We didn't copy the article, we just monitored the user's eyestream and discovered this article. But it's ok because we copy everybody's articles."

    From your link: "Bing can also examine how people click on its own results that it lists in response to that search." No shit? It's like it's listed as an afterthought. Of course Bing should be paying attention to their own clicks... and not scraping their competitor's data. But instead, they're trying to justify it using PR words, and creating a convoluted argument that they are merely, "showing the surfstream" rather than "creating a reproduction of an original work", i.e. copying.

    It would be like a dating site copying a fake profile from a competitor. "We didn't copy that profile, we're just showing the datestream."

    --
    How many more years will slashdot have an off-by-one error on your Score in your profile?