Ask Dan Kusnetzky About Linux Server Counts
How many people run Linux as a desktop OS? How many servers run Linux? Is the Linux server market share 8.6 percent or 24 percent or somewhere in between? Dan Kusnetzky is a heavily quoted analyst at IDC who wrestles with questions like these for a living. This is your chance to find out how analysts come up with all those numbers -- and why analysts seem to disagree with each other so often. One question per post, please. We'll send 10 of the highest-moderated ones to Dan, and post his answers as soon as we get them back.
Hi Dan,
According to IDC figures, the Linux desktop market share was 5% and now it's less then 2%. Gartner numbers are of course differently (way lower if I'm not mistaken)
Yet, when I look at developments of projects like XFree, KDE, Gnome, Linux kernel - or when you get the daily list from freshmeat, or even talking to the the ISP who host mirrors of ISO images of Redhat, Mandrake etc - then you see that linux get FAR more then 2%. Hell - if it was 2% and you account the developments of Linux - then each developer works 25 hours per day on a porject!
So, as you can see - the numbers here are definately wrong here - and those numbers are actually hurting the Linux community. If an ABCD company wants to make a software for the Linux desktop and they see those IDC figures - then they will say something like "oh, 2%? no thanks - we'll make it for Mac - they are %5+", and we'll loose..
Comments?
Hetz (Heunique)
Is there a relationship between company size and linux use?
Do we have evidence of any companies with multi-million dollar revenues that rely on linux solutions for their servers or do most companies that use linux servers do it because they lack the money? (choice/only option)
-S
Scott Ruttencutter
We Apprentice Developers and Designers
Dan,
How does IDC determine if the answers they are getting are valid? What I mean by this is how does your organization insure that the people they are surveying being truthful in their responses? I can see scenarios where a well co-ordinated group of people could be swayed by a vendor to influence the results of a survey, especially in a field as specialized as the IT industry.
Also, if you suspect that the results of a survey are tainted, do you publish it anyway (with caveats included, of course) or redo it (at your time and expense)?
If "disco" means "I learn" in Latin, does "discothèque" mean "I learn technology"?
In studies of market share(or studies in general), we often hear quotes about who funded them. This seems to somehow imply that those who funded the study had some influence in how the data was gathered/interpreted.
Even if the funder does not infulence the questions asked they may well control if the "results" see the light of day.
I have often wondered how biased polls are based on the questions asked, the demographics of the people polled, etc. When results about polls are made public, is it also possible to obtain information about how the poll was conducted in a simple, by request method?
It's quite trivial for methodology to affect the results. That is why in any kind of scientific publication the method is considered at least as important as any conclusion.
Not only is no methodology available "conclusions" are being called "results".
Hi Dan,
This is probably going to be a tough question for you (if you ever get it, that is - given the bias on Slashdot, this probably won't be modded up).
A big problem I see with your methodology is that you probably overcount Linux server shipments. From what I understand (I may be incorrect here) you count each sale of Linux as a server shipment. However, many of these copies may be tested but never used, a great many are used in home computers or development workstations. The problem is, unlike with Windows or OS/2, there is no ``client version'' vs. a ``server version''. There is no easy way to tell if a copy of Linux is used in a server environment or a client environment by sales figures alone. So my question is: given total Linux sales figures, how do you estimate the number used in server environments?
Secondly, is it possible that the descrepancy between your numbers and Gartners' is due to a problem in the way you estimate Linux server sales?
---
/bin/fortune | slashdotsig.sh
The technical approach of talking to publicly accessible servers and attempting to fingerprint the OS is fraught with methodological problems.
The naive approach of asking hardware vendors how many units of hardware ship with what OS is obviously flawed.
So why not just do the dumb brute-force thing? Dial phone numbers at random and ask people what they use? Too expensive?
How much is accurate data worth, anyway?
Do you have any insight about trends that can be gleaned from these surveys - that is, regardless of what number one uses for the percentage of users that adopt Linux, is it static, taking off, steadily increasing, or what?
Toronto-area transit rider? Rate your ride.
At what point does a computer become a server? Many Linux desktops have ftp, telnet & http ports open, so do they count as severs too?
I just remembered an Idea that a friend of mine suggested:
What if IDC could work with the Linux distributions (RedHat, Mandrake, SuSE, TurboLinux, Debian) to add a small program which will run after the first internet connection has been succsessfull..
When this program runs - it will ask the user to "register" his copy of the distribution. If it has been purchased from one of the distributors - then the user can add his serial number. Some other questions like will this distribution be used as a server, a workstation, or combination of the 2, or a development workstation. The survey SHOULD be annonymous (unless the user wants to give some details about himself)
By that way - the distributors can give the numbers back to IDC - and IDC can publish a report which will tell that the number of Linux installations - and that number is X. X is combined of Y free download version and Z purchased copies of Linux.
What do you think, Dan? what the slashdot readers think about it?
Hetz (Heunique)
Do you base your data mostly on marketing analysis or do you actually go a pay a consultor to scan machines on the net? If there are scans involved, how do you pick the IP blocks to be scanned and what's the uncertainty associated with such a method (and how is this uncertainty guessed)? If there are no scans involved, why not? If this is "maket analysis", can you defined that for me? Which factors are involved? And a different question: who's the target market for this kind of study? How much does such a thing cost?
- Ask Gartner their methodology,
- Document it so that it can be reproduced;
- Make sure and get Gartner to say "yep that's how we did it".
- Dan&Co reproduces the methodology and compares the numbers
Might be overly scientific. Maybe consider it an "Open Source" version of market research because of the peer review and verification. Some may think this approach has no place in the market research area.But, I (for one) think it would be interesting to call Gartner's Bluff (if indeed that's what this is). I personally place them (and ZD, etc) into the Shill category.
Thoughts anyone?
Don't sweat the petty things. But do pet the sweaty things.
Might he have the stats on the BSD's. People from the BSD community are curious. :)
TIA
If it is possible or already happened, do analysts in general (and you in particular) find it a worrisom possibility, and if so, are there any attempts/ideas to deal with the issue?
Thanks,
DVK
"The right to figure things out for yourself is the only true freedom everyone shares. Go use it"-R.A.Heinlein
In studies of market share(or studies in general), we often hear quotes about who funded them. This seems to somehow imply that those who funded the study had some influence in how the data was gathered/interpreted.
My question is: Do those who fund a study influence how the study turns out (ie. Microsoft studies show higher MS market share). Or is it that these corperations only decide to fund groups who they know will most likely return results in thier favor.
---Lane
What's the point of moderating?!
I have a question on the area of predictions in general. For example, fellow IDC analyst Jill House has been severaly negative on Palm over the years, with regard to the Win CE operating system and devices. A sample quote from her in Feb. 2000 read, "If I was Palm, I would be beside myself with panic."
The issue is, that over the last 3 - 4 years she's been predicting the demise of Palm and the rise of Win CE, a claim that has never materialized. Who verifies the reliability of these predictions and keeps the analysts accountable. With the frequent sound bytes and one-liners that they give to the press, these analysts have significant influence over public perception of the issues. But how is policing done when the analysts don't analyze very well?
Sincerely,
Daniel McCarty
Palm OS Developer
Have fun: Join D.N.A. (National Dyslexics Association)
Do these analyses factor in the so-called third world? Most of these analyses are US-centric or some times do include the continent across the pond but what about Africa and Asia? Do these analyses *really* take inputs from these continents?
Thanks for your time,
- Brad Heintz
--
http://www.bradheintz.com/
- updated
I have often wondered how biased polls are based on the questions asked, the demographics of the people polled, etc. When results about polls are made public, is it also possible to obtain information about how the poll was conducted in a simple, by request method? Now if the answer to that question is, no, how much can we rely on polls, since we have no way of verifying if the questions asked and the people interviewed were heavily biased to favor one outcome over another? (Such as in the recent large discrepancies of the 8% vs. 24% use of Linux as a server results that we've seen on Slashdot recently).
When People magazine does an issue devoted to "what's hot" in fashion, do they interview Jane Doe from Des Moines, Iowa? No.
So why are OS numbers reported with equal rating? Not all users are equally suited to *choose* an OS, therefore not all users *choices* are equally interesting. I'd really like to see a breakdown of OS by user-type (levels of education, field of degree if applicable, occupation, etc). Keep in mind this applies just as much to business. A technology company presumably put more informed thought into their choice of server than an art supply house or whatever.
--
324006