Slashdot Mirror


Fedora Metrics Help Whole Linux Community

lisah writes "When Fedora released Fedora Core 6 late last year, the team decided to track the number of users with unique IP addresses who connected to yum in search of updates for a new installation of FC6. According to the data they collected, FC6 crossed the one-million user mark in just 74 days. Fedora Project Leader Max Spevack says that while it's great to use metrics to better understand what users want, the real value lies in its ability to encourage hardware vendors to more offer more Linux-oriented goods and services. Spevack told Linux.com: '[W]e always say we wish hardware vendors had more [Linux-capable] drivers. Well, if you can go to them and say, "Hey, there's millions of people using this," then maybe they will listen. In the real world, you need data to prove your case. Well, here it is.'" Linux.com and Slashdot are both owned by OSTG.

34 of 132 comments (clear)

  1. But.... by CaymanIslandCarpedie · · Score: 4, Funny

    Doesn't collecting data make you evil?

    --
    "reality has a well-known liberal bias" - Steven Colbert
    1. Re:But.... by Florian+Weimer · · Score: 2, Insightful

      Doesn't collecting data make you evil?

      Only if you call the process "activation" instead of "metrics".

    2. Re:But.... by spevack · · Score: 5, Informative
      the day they start requiring registration or creating GUIDs is the day I give the shove to Fedora -- and I've been and RH users for 8-9 years.

      As the "Fedora Project Leader", the Fedora buck stops with me, so to speak.

      And I promise you that I will NEVER require anyone to "register" Fedora in order to download updates, or stuff like that.

      Neither I, nor the Fedora Board, which is Fedora's governing body, will allow some sort of "required registration" in order to get the full Fedora experience.

      Download. Install. Update. If that's the extent of a person's interaction with Fedora, fine by me. We hope, of course, that there will be a fourth step, that being: Contribute

  2. 1,000,001 I can't switch but would like to by quixote9 · · Score: 3, Interesting

    I have legacy hardware, and too little knowledge, so I'm too afraid to switch from Core 3 to 6. God only knows what would break, and I sure don't know enough to work around it. But if I could get 6, I'd be in their statistic too. There's bound to be more people like me, who can't get 6 for some reason. So that number is a low estimate!

  3. Saddly... by DrYak · · Score: 4, Interesting

    Saddly this metric will be very quickly attacked because of all users who have broadband connections with IP changing every 24 hours.

    Maybe counting how many different IPs downloaded *1* given critical update will be more precise (based on the assumption that even users with non permanent IP will download the patch once to secure their machines, and then won't download it again).

    But even if it lacks precision, it is still a good indicator that Linux *IS* in fact popular and much more widespread than people think.
    It just lacks sales figures to prove it. ...

    Specially when compared to the so-many "Vista didn't get a warm welcome" reports we read a lot those days.

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
    1. Re:Saddly... by whoever57 · · Score: 2, Insightful

      Saddly this metric will be very quickly attacked because of all users who have broadband connections with IP changing every 24 hours.
      All users? I don't think my cable IP address (dynamically assigned) has changed in over a year.
      --
      The real "Libtards" are the Libertarians!
    2. Re:Saddly... by spevack · · Score: 5, Informative

      Actually, the Fedora folks address that very point. Quoting from the Fedora Project wiki, and it's page on Statistics:

      "Accuracy of metrics

      We believe it is reasonable to equate a "new IP address checking in" with "a new installation of FC6", with the following caveats:

      1. Users who have dynamic IP addresses will likely be counted multiple times, which inflates the number by some amount.

      2. Users who are behind NAT, corporate proxies, or who rsync updates to a local mirror before updating will not be counted at all.

      The anecdotal evidence that we receive from different groups, companies, and organizations makes it quite clear that group (2) is significantly larger than group (1). As such, we believe that the true numbers in the field are higher than the numbers on this page."

    3. Re:Saddly... by Lord_Slepnir · · Score: 2, Informative

      The numbers will be inflated, but also deflated by places like the one where I work that have multiple FC6 hosts behind the same router.

  4. Distinct, not "unique" by ColonelPanic · · Score: 4, Insightful

    IP addresses are necessarily unique ("one of a kind"). You mean "distinct" here.

    --
    "Skill shows through where genius wears thin." -Wittgenstein || Religion: uniting aviation and architecture.
  5. People != Computers by RyoShin · · Score: 2, Insightful

    Well, if you can go to them and say, "Hey, there's millions of people using this,"
    Actually, it's a million computers using this (that's actually at least a million computers, as multiple PCs may be behind one public IP). Especially amongst the more computer-oriented people (of which the Linux community has many), it's not uncommon to have more than one computer running the same OS. I myself have three computers, two of which run Windows (the third is being put together). While these are tied to one DSL line, one of them, a laptop, may travel to other wireless networks and thus change IPs, so I could be recorded under two unique IPs but be only one person.

    Not saying there isn't a vast number of Linux users (I'm sure there are well over a million individual Linux users - that's a third of 1% of just the American population), just that numbers from data like this can be skewed.
    1. Re:People != Computers by bky1701 · · Score: 2, Insightful

      You have it backwards. Are you going to download 3 fedora CDs because you have 3 computers? Maybe if they are differing archs... but that's not normally the case. Thus, the number would be LARGER than the one they gave, because many people use the same CD for more than one install, give their CDs away after using them, etc.

    2. Re:People != Computers by RyoShin · · Score: 2, Informative
      You are quite correct; one person would not download the CD three times.

      However, that's not how they're collecting the data:

      the team decided to track the number of users with unique IP addresses who connected to yum in search of updates
      While you need only one CD to do multiple installs, it is my understanding that each machine has to run YUM itself. They've also thought of what you mentioned.

      According to Spevack, it's not enough to simply count how many times the distribution has been downloaded
      Now, the article does go on to say

      Cacti tracks the number of unique IP addresses that connect to yum with a new installation of FC6
      So, if YUM/the tracking software can differentiate between a fresh install and a regular update, then the number of connections better correlates to new users.
  6. Impossible to install without connecting by currivan · · Score: 5, Informative

    I just installed FC6 on a machine yesterday, and they made it impossible to do anything without connecting to their server. I'm keeping the machine off the network, but apparently there's no way to install packages from the DVD without first downloading the update lists from their mirrors.

    The Add/Remove gui (and yum) crashes if DNS isn't available. After some research, I was able to hack the yum .repo files to point to the DVD instead of the internet, but it still crashes with mysterious errors about media uris. I finally gave up and installed Ubuntu instead. So no, this doesn't help the whole Linux community. We'd be furious is Microsoft imposed this sort of requirement on new installations.

    1. Re:Impossible to install without connecting by unchiujar · · Score: 2, Informative

      What about yum --disablerepo=* localinstall or rpm ?

      --
      Shakespeare poems - infinite monkeys with infinite time.Computer tech support - a few trained ones working from 9 to 5.
    2. Re:Impossible to install without connecting by LnxAddct · · Score: 4, Informative

      I just installed FC6 on my macbook pro over the weekend, and I had no internet connection at all during the entire process (I regularly work offline). It worked fine, so I can only assume that your case is an isolated incident.
      Regards,
      Steve

    3. Re:Impossible to install without connecting by jon_burgess · · Score: 2, Informative

      Unfortunately the repodata provided on the CD & DVD is not useable by yum but creating a local yum repository is quite easy once you know how.

      Installing packages from the original media is great just after you've loaded the system, but remember the good old days when you would be given the a prompt like: to complete this change you need to insert disk 3 of the installation media. Good luck finding the original disks a year or two after installing the PC.

      I believe the majority people are happy that yum is preconfigured to download and install the most up to date version of a package from the internet whenever they need to install something new.

  7. RH response to Ubuntu's 8 million number? by Locutus · · Score: 4, Informative

    Given the numbers coming out, I'd think that it sure can't hurt for these guys to post the number they are.

    Here(2nd page ) Mark Shuttleworth mentioned Ubuntu having 8 million active users:

    http://redherring.com/PrintArticle.aspx?a=20497&se ctor=Briefings

    Now what are the hardware vendors waiting for? Permission from Microsoft?

    LoB

    --
    "Anyone who stands out in the middle of a road looks like roadkill to me." --Linus
    1. Re:RH response to Ubuntu's 8 million number? by spevack · · Score: 5, Informative

      The key difference, IMHO, is that in Fedora we are trying to demonstrate *where* our numbers are coming from, as opposed to just giving a number with no context.

      It's also important to realize that this metric is just for Fedora Core 6, not "all instances of Fedora 1-6".

    2. Re:RH response to Ubuntu's 8 million number? by Locutus · · Score: 2, Insightful

      I wasn't interested in a "my numbers are bigger than your" discussion and obviously, there are more TOTAL Fedora user than the number of Fedora 6 users.

      And yes, it's a big deal having data and the technique for getting those numbers. Shuttleworth didn't state where the numbers came from but also wasn't asked. My guess is those numbers came from their date servers since I've seen default Ubuntu installations setting /etc/default/ntpdate to point to ubuntu.com servers.

      Anyway, it is great these numbers are getting out there and even better when they can be validated.

      Now, the problem will be dealing with the bitch-slapping hardware vendors are going to get from Microsoft for even saying the "L" word.

      LoB

      --
      "Anyone who stands out in the middle of a road looks like roadkill to me." --Linus
  8. Why only now? by Pecisk · · Score: 4, Insightful

    Personally I don't understand shyness/lack of will/underrating ourselves in these case. Look at Firefox, they made whole PR campange around those numbers! And if they won't matter....THEY DO. They are true numbers who can be verifired, checked, compared, etc.

    I think most of problem of using meme "look at the numbers, user count are huge, man" is that there's lot of geeks which don't see this argument as simply valid (those numbers can't be wrong, etc. etc.). They would like to better convince hardware developers that they MUST get those damn specs (by some hidden morale or simple common sense, which, I agree, exists in this case too) out rather trying to wow them to community side (presentations, numbers, proof of concept (you don't have to care about driver, etc.)).

    We need more actions like SpreadFirefox, period. Done right, they just work.

    --
    user@ubuntubox:~$ stfu This server is going down for shutdown NOW!
  9. No by DrYak · · Score: 4, Insightful

    Collecting non-personally identifying data, that would be logged anyway during the normal process of the server function (httpd/ftpd daemons will log connection anyway wether or not FC owners choose to do something out of it) and publishing only the compiled form (the total number. Opposed to the complete obfuscated [rot5 scrambled ?] list, AOL-style), ISN'T EVIL (It just similar to the "number of visitors" counters back in the old Web 1.0 days).

    Collecting data in an opt-in manner like http://counter.li.org/ to do statistic. ISN'T EITHER

    Collecting data, that don't necessary need to be collected for technical reason (IP address vs. Pentium serial number), without telling it the user first, without asking permission to the user first, THAT IS EVIL (and regularly done by microsoft and other object of hatred from the /. crowd).

    --
    "Sufficiently advanced satire is indistinguishable from reality." - [Tips: 1DrYakQDKCQ6y52z6QbnkxHXAocMZJE61o ]
  10. Read the summary, at least by ukatoton · · Score: 2, Insightful

    the team decided to track the number of users with unique IP addresses who connected to yum in search of updates for a new installation of FC6
    .

    It's how any new systems are being checked for the first time, and most people probably aren't reinstalling it constantly and downloading updates, so there's very little attacking you could do to these figures.
  11. best effort + transparency by spevack · · Score: 5, Informative

    I'm the guy who actually maintains that Statistics page on the Fedora wiki.

    The real "story" here is a couple of things:

    THING 1 -- We're making the best effort that we can at showing the world how many installations of Fedora Core 6 we know about.

    THING 2 -- We're being upfront about the assumptions and caveats that go along with that number. Quoting:

    "Accuracy of metrics

    We believe it is reasonable to equate a "new IP address checking in" with "a new installation of FC6", with the following caveats:

    1. Users who have dynamic IP addresses will likely be counted multiple times, which inflates the number by some amount.
    2. Users who are behind NAT, corporate proxies, or who rsync updates to a local mirror before updating will not be counted at all.

    The anecdotal evidence that we receive from different groups, companies, and organizations makes it quite clear that group (2) is significantly larger than group (1). As such, we believe that the true numbers in the field are higher than the numbers on this page."

    THING 3 -- We're also being upfront about how that number is generated.

    I'm not trying to spin the data in any way. I'm just putting it up there, and trying to do so as objectively as possible. Anyone can draw their own conclusions, or compare it to data from other distributions, if you can find similar reporting.

    1. Re:best effort + transparency by spevack · · Score: 2, Informative

      Well, we'll keep doing what we currently are doing. In addition, the idea currently under consideration is an OPTIONAL screen in firstboot where a user can choose to let us know more about their hardware and/or installed package set.

      KEY POINT TO MAKE: If a user says "no, go away and leave me alone", we will respect that.

      To anyone who wants to be part of the discussion, feel free to follow the Fedora Infrastructure list.

      http://www.redhat.com/mailman/listinfo/fedora-infr astructure-list

  12. Re:Sweet by quantaman · · Score: 4, Funny

    I'm ready to move.So where should we put the city ? Well I was thinking Canada or Europe, heck why not Luxembourg? With a population of only 465,000 we'd made a majority of the population and be able to form a governmenmt.

    Welcome to Fedoraland!
    --
    I stole this Sig
  13. Re:1,000,001 I can't switch but would like to by Intron · · Score: 4, Informative

    So just fire up a live CD with a recent kernel and try it out. You don't have to upgrade if it doesn't work. Hardware drivers are in the kernel, so just testing the right kernel on your system will tell you whether it works (mostly).

    FC3 uses kernel 2.6.9
    FC6 uses kernel 2.6.18

    --
    Intron: the portion of DNA which expresses nothing useful.
  14. Re:1,000,001 I can't switch but would like to by FudRucker · · Score: 2, Informative

    you need to learn to use Slackware, it is the best distro for old hardware...

    --
    Politics is Treachery, Religion is Brainwashing
  15. Using IP addresses for marketing? by Kelson · · Score: 3, Insightful

    Except collecting the IP addresses then using them for marketing purposes is not necessary

    How are they using the IP address for marketing purposes? They're using the number of IP addresses. No one can take the information they've released and determine that a computer at x.x.x.x is running Fedora. (And the information they have, they would have had anyway -- just like Slashdot knows the IP address you posted from.) As the GP said, it's no different from a website processing its server logs and reporting that it had X unique visitors during period Y.

    Come to think of it, since yum fetches data over HTTP, it is a website processing its server logs and reporting the number of unique visitors.

  16. Re:1,000,001 I can't switch but would like to by Znork · · Score: 4, Informative

    Personally, I rsync from a mirror and have a local repository, so I have a whole bunch of machines that dont get counted. Stuff like that will result in the numbers being a bit off.

    "so I'm too afraid to switch from Core 3 to 6."

    If you upgrade that rarely, I'd suggest you take a look at CentOS. CentOS 4 will be a far smaller leap (RHEL4 is close to FC3/FC4), and you'd be on a maintained platform again.

  17. Re:Sweet by gclef · · Score: 4, Funny

    Fedoraland? Bah. Tuxembourg!

  18. Even if it were MS, it wouldn't be evil. by Kadin2048 · · Score: 3, Insightful

    There might be an outcry if Microsoft did that, just because people hate Microsoft and think Microsoft is evil, but that wouldn't mean that doing it would be evil. (So, Microsoft may in fact be evil, but not necessarily everything they do is evil, and moreover, just because they could do something, doesn't make it evil.)

    There's nothing wrong with saying "x people accessed Windows Update this [year|month|day]." That's no different from the hit counters that used to exist on every web site. (And which were tacky, and I thank God that people finally realized this.)

    What would be evil, and the temptation they need to avoid, is to take their server logs and start mining them for data that can be sold or used for malicious purposes; i.e. personally identifying information about what users are using what versions of Windows, or even how often they're updating, etc.

    Aggregate information about hits is something that HTTP servers and their operators do all the time. Where it gets evil is when you have cookies tracking particular users across multiple sites, etc.

    --
    "Ladies and gentlemen, my killbot features Lotus Notes and a machine gun. It is the finest available."
  19. Re:1,000,001 I can't switch but would like to by chrwei · · Score: 2, Interesting

    i had no idea people still had these kinds of problems, they are what drove me from RH/Mandrake years ago. I moved to Debian Sarge (before it was "stable" even) and even did a dist-upgrade from sarge to ubuntu on one system. "apt" upgrades are rarely a problem even when the system is "live" and not booted off a CD, and never an issue if done from the console so that when upgrading libs the X server doens't crash on you.

    Oldest system I do this with a 486DX2 50Mhz with 32 Meg ram and there's never a problem. It's actualy an HP Network Scanjet 5 with an Ubuntu "command line" install and the Enhanced scripts to run the interface, no idea where else a 486 with linux would be all that usefull to maintain though I'm sure there are some out there.

    --
    - Disclaimer: Information in this post deemed reliable but not guaranteed.
  20. Huh? by EXTomar · · Score: 2, Informative

    I just did a retro-fit upgrade and an install on two machines and neither went to the "yum" repository mirrors to do an update till after they finished their first reboot where I had to activate the update manually (and get the gpg keys installed).

    - I remember that "install" at some point gave me an option to install against latest package in the "yum" repositories, which I do not do for speed.
    - I remember the "upgrade" and "install" screens from Anaconda being different. The "upgrade" never asked me to update against the "yum" repositories.

    "pup", which is the graphical tool analog to "yum", handles rotating through the mirrors properly as far as I remember where it just fails over to the next if the current one can't be reached. I've had my Internet die while trying to do this, I don't recall it ever crashing on me and this is doing many installs and upgrades across every version of Fedora.

    I don't blame you for switching to something else given these problems. I'm just stumped how you got these problems.

  21. Re:NAT is even more significant. by Karzz1 · · Score: 2, Informative

    From the article:
    "We believe it is reasonable to equate a "new IP address checking in" with "a new installation of FC6", with the following caveats:
    1. Users who have dynamic IP addresses will likely be counted multiple times, which inflates the number by some amount.
    2. Users who are behind NAT, corporate proxies, or who rsync updates to a local mirror before updating will not be counted at all.

    The anecdotal evidence that we receive from different groups, companies, and organizations makes it quite clear that group (2) is significantly larger than group (1). As such, we believe that the true numbers in the field are higher than the numbers on this page. "

    --
    Beware of he who would deny you access to information, for in his heart he dreams himself your master.