Slashdot Mirror


MSN Search Engine Favors IIS

Scud writes "It appears that if you want to rise up in the rankings over at the MSN search engine you would do well to host your page on IIS. Ivor Hewitt has done a study and it appears that by using IIS, you are likely to increase your odds of a higher listing by several percent."

27 of 565 comments (clear)

  1. If you only have 20% of the market by Safety+Cap · · Score: 2, Informative

    ...you gotta do something to pump up your buggy, non-mainstream, insecure webserver.

    --
    Yeah, right.
    1. Re:If you only have 20% of the market by badriram · · Score: 2, Informative

      Duh, 20% of market share does not make a product mainstream. And if you think 2 security advisories is a lot compared to Apache's 24.

      If you were joking or being sarcastic well you went right over my head....

  2. Mirror site: by Bananatree3 · · Score: 3, Informative
  3. Not a controlled experiment by Anonymous Coward · · Score: 5, Informative

    To be conclusive, it needs to be a controlled experiment with the same text and same outgoing/incoming links.

    Just the webserver alone changing. This can happen by taking a popular site and then changing what it reports to the MSN search robots.

    But until such an experiment is done, the data is open to too many interpretations.

  4. Re:Would it even be worth it? by Peeteriz · · Score: 3, Informative

    MS software installs MSN as the default search engine in many places, and there are an awful lot of users who don't bother to change anything, so when they hit 'Search', it's going to be MSN.

  5. Re:Top MSN Rankings by enigma48 · · Score: 1, Informative

    Unlikely.

    Either MS has the world's best pattern recognition software by far (facial recognition software requires a lot of cpu work and under ideal circumstances, only accurate slightly more than 50% of the time) or they'd have to have someone sit and go through every picture ever indexed, then press a "Put this site to #1" button.

    With Google indexing 8 billion pages, even spending 1/100 of a second per page would make indexing a multiple-year task and pattern rec. should take longer than that. Things like "is_site_running_IIS()" can be done far, far, far faster.

    Maybe I've been trolled. Oh well. Hope someone learned something.

  6. Re:Why would i want to do that? by Anonymous Coward · · Score: 1, Informative

    Eh, sorry to burst your bubble there, buddy, but MacOS X isn't a webserver.

    Apache is a webserver. Zeus is a webserver. IIS is a webserver. thttpd is a webserver. MacOS is an OS.

    So, you just lost any credibility with that mindless-drone-MacOS-X-troll. Compare apples to apples, please.

    Besides, MacOS X is really not the best platform to run webservers off of -- the hardware is expensive, the OS is expensive. Try again. (hint: FreeBSD, which is really the heart of MacOS X, and runs on dirt-cheap x86{,-64} hardware)

  7. That explains it by guru42101 · · Score: 2, Informative

    One of our clients has a web app hosted on an IIS box and their main website hosted on apache. The web app ranks higher than the main website when doing a search for them.

  8. Re:Absolutely by Jim_Maryland · · Score: 2, Informative
    Running your query without quotes returns:
    (You do mean this link(San Andreas Radio), right?)

    MSN - 3rd

    Google - 3rd
    With quotes around it:

    MSN - 2nd

    Google - 1rst
    With a + (plus sign):

    MSN - 2nd

    Google - 1rst
    I'm not sure where #7 came from (unless others have done repeated searches on this too). Do we know what Rock Star Games is using for their web server then to know if the IIS preference comes into play in this case?

  9. Re:Would it even be worth it? by sh00z · · Score: 5, Informative
    Hre's the stats for my site, accumulated since April 1 (average 3000 unique visitors per day):
    - Google 7873
    - Yahoo 3163
    - MSN 199
    - AOL 65
    - Dogpile 44
    - Unknown 41
    - Earth Link 28
    - AltaVista 16
    - Excite 14
    - A9.com 9
    - Others 77

    ...which comes out to about 2% MSN.

  10. Updated report by orv · · Score: 2, Informative

    Tha article actually links to an older smaller version of the analysis. There's a more comprehensive wordlist at: http://www.ivor.it/goog

  11. Re:Top MSN Rankings by TobascoKid · · Score: 2, Informative

    God why the hell does everyone cry troll the minute someone makes an obvious joke

    Because jokes are never obvious to everyone. Different senses of humour, cultural differences and the fact that some people think in different ways from other people (like people with Aspergers - and there are probably one or two people who read /. who have Aspergers) mean that what's an 'obvious' joke to you is a troll to someone else.

    --
    At some point, somewhere, the entire internet will be found to be illegal.
  12. Re:Does the MSN robot have a signature? by ebonkyre · · Score: 3, Informative

    "msnbot/1.0 (+http://search.msn.com/msnbot.htm)"

    --
    "Time is an abstract concept devised by carbon-based lifeforms to monitor their ongoing decay." - Thundercleese
  13. Re:I laugh at Microsoft. by Pecisk · · Score: 2, Informative

    Google is a company which publicy trade only a small tiny part of their shares. Most of them still belongs to founders, as they don't seem to even try to sell them.

    So... No. It is not that way you state it.

    --
    user@ubuntubox:~$ stfu This server is going down for shutdown NOW!
  14. Conspiracies of one by coyote-san · · Score: 3, Informative

    You can have a "conspiracy of one" if that person acts in multiple roles.

    As an example, let's say that one person is a company's bookkeeper and CFO. (This isn't uncommon in small companies.)

    As a bookkeeper she cooks the books to cover her embezzlement.

    As CFO she prepares false financial documents for her company and its investors.

    One person, criminal acts in two roles, so in many states she can be charged with conspiracy in addition to embezzlement.

    BTW, this isn't a "conspiracy" in the legal sense since it's not a crime to give preferential service on the basis of web server. It's sleazy unless it's fully disclosed, but it's not a crime unless they actually sell the search engine as an unbiased tool.

    --
    For every complex problem there is an answer that is clear, simple, and wrong. -- H L Mencken
  15. Re:I laugh at Microsoft. by mattmentecky · · Score: 2, Informative

    I don't know what your point is, but yes, it is like I said.
    Google is a company, in which shares of the company are traded publicly, a 'publicly traded company' --> public company. You are trying to split hairs over what portion founders/insiders own, but this is mearly called retaining 'creative control'.
    As of Jan 2005 53% of the stock is owned by insiders, and I would not call 47% a 'small tiny part of their shares'.

  16. Re:Do we see a significant effect? Is it just chan by Swanktastic · · Score: 4, Informative

    Believe it...

    First off, I looked at the difference in means for Apache rankings in MSN and Google. 61.5% (MSN) vs. 64.3% (Google) for 970 observations Right there, you ought to be able to eyeball it and see significance. But, to make sure, here are the results of a t-test which checks the likelihood that two matched sets have different means (forgive the crappy formatting):

    M G
    Mean 0.615061856 0.642948454
    Variance 0.01100624 0.008740111
    Observations 970 970
    Hypothesized Mean Difference 0
    df 969
    t Stat -10.51551356
    P(one-tail) 7.26569E-25
    t Critical one-tail 1.646427658
    P(two-tail) 1.45314E-24
    t Critical two-tail 1.962415113

    As you can see, the P is 1.45 x 10^-24, which at least makes us think the results are not pure coincidence. I don't intend on speculating on the causality, though...

  17. Re:IIS/Apache - No diff by Lukey+Boy · · Score: 2, Informative
    Funny, I get this from Redhat:
    HTTP/1.1 200 OK
    Date: Tue, 26 Apr 2005 15:46:19 GMT
    Server: Apache
    Set-Cookie: Apache=206.116.84.10.2859511145303800; path=/; expires=Sun, 25-Apr-1
    0 15:46:19 GMT
    Cache-Control: max-age=21600
    Expires: Tue, 26 Apr 2005 21:46:19 GMT
    Last-Modified: Wed, 20 Apr 2005 22:38:11 GMT
    ETag: "fdd9d-37b1-4266d9d3"
    Accept-Ranges: bytes
    Content-Length: 14257
    Content-Type: text/html
  18. Re:Silly, silly boys (and girls) by Anonymous Coward · · Score: 1, Informative

    configure his web server to report itself as IIS in the headers it returns. That's the only real way to know what a web server is running, unless you want to parse server-created error messages, or exploit vulnerabilities in the server itself.

    This isn't true. IIS and Internet Explorer don't work according to spec. when closing down TCP connections in order to speed up subsequent connections between the two pieces of software. This can be used to differentiate between IIS and other servers.

  19. Re:IE bias too - RTFA by Anonymous Coward · · Score: 1, Informative

    RTFA! Google and MSNS aren't the only search engines he tested - they are just the only ones worth talking about. So you would have to conclude that /everybody/ is discriminating against IIS.

  20. Sweets for the sweet, lies for the liars by AndroidCat · · Score: 4, Informative
    MSN Search should be banned for being dishonest.

    Add something like this pseudocode to your server:
    if $Browser = "MSNSearchBot" then $Server = "Microsoft-IIS/6.0"

    --
    One line blog. I hear that they're called Twitters now.
  21. Bogus Stats by Illserve · · Score: 1, Informative

    Slow down a second.

    First, anytime you see P 10^-24 for a sample of only 1000 and means differing by 3% you should be suspicious immediately.

    And 969 degrees of freedom?!

    Buddy, your stats are waaaaay off. There is just 1 DF in this data.

    My guess is that you tried to feed categorical data to a t-test thing and it barfed (as it should), so you fiddled with it until it claimed to produce a successful result (by somehow misinterpreting each of the 970 observations as a degree of freedom) and now you've got P 10^24, which is lunacy.

    thumbs down on your methodology.

    1. Re:Bogus Stats by Illserve · · Score: 2, Informative

      Mod Me down!

      I hadn't read his F'ing Link and thought 1000 were each individual webservers for a particular search term.

      He meant 1000 different searches, which is a sensible way to do it.

      His stats may be fine.

  22. Re:IE bias too by cortana · · Score: 2, Informative

    > Maybe it is Google discriminating *against* IIS, not Microsoft for.

    He said that the distibution of servers from Google's results matched those published by Netcraft (http://news.netcraft.com/archives/web_server_surv ey.html) very closely.

  23. iTunes? Never heard of it. by meatball_mulligan · · Score: 4, Informative

    Try typing "online music".

    On Google the top two references are iTunes and iTMS. On MSN you'll have to go through a few pages before you'll see anything about iTunes.

    Yeah, I trust Microsoft to provide unbiased search results. Sure I do.

    m.m.

  24. Re:IE bias too - RTFA by orv · · Score: 3, Informative

    Yeah the trouble is that the person who submitted the story linked to the old "original" result set i.e. the "/orig/" in the url rather than the more complete more recent results at: http://www.ivor.it/goog

    I guess the "MSN against Google" report is more attention grabbing.

  25. Re:IIS/Apache - No diff by bedessen · · Score: 2, Informative

    You are wrong, but not for the reason you think.

    When you telnet to port 80 and type "GET /" you are sending a HTTP 0.9 request. That was the first version of the protocol and is all but extinct. That version of the protocol had no such thing as headers, so if the server follows the HTTP 0.9 spec, you will never get any headers. Apparently IIS does not speak HTTP 0.9 very well.

    However, if you use a valid HTTP 1.0 or 1.1 request: "GET / HTTP/1.1\r\nHost: example.com\r\n\r\n" you will get all the headers.

    So, I'm sorry, but your point is completely wrong. Apache sends just as many headers as IIS, as the other poster points out. You should really stop using telnet and start using curl or wget. If you do use telnet, please get a clue about how HTTP works.