Slashdot Mirror


Why Standard Deviation Should Be Retired From Scientific Use

An anonymous reader writes "Statistician and author Nassim Taleb has a suggestion for scientific researchers: stop trying to use standard deviations in your work. He says it's misunderstood more often than not, and also not the best tool for its purpose. Taleb thinks researchers should use mean deviation instead. 'It is all due to a historical accident: in 1893, the great Karl Pearson introduced the term "standard deviation" for what had been known as "root mean square error." The confusion started then: people thought it meant mean deviation. The idea stuck: every time a newspaper has attempted to clarify the concept of market "volatility", it defined it verbally as mean deviation yet produced the numerical measure of the (higher) standard deviation. But it is not just journalists who fall for the mistake: I recall seeing official documents from the department of commerce and the Federal Reserve partaking of the conflation, even regulators in statements on market volatility. What is worse, Goldstein and I found that a high number of data scientists (many with PhDs) also get confused in real life.'"

312 comments

  1. Statistics add plausibility - maybe not meaning. by Anonymous Coward · · Score: 1

    And, in a word full of highly numerate simpletons, one must never forget this.

  2. So you want to retire a statistical term... by Anonymous Coward · · Score: 5, Insightful

    ...because people use it incorrectly in economics? Get bent. The standard deviation is a useful tool for statistical analysis of large populations.

    1. Re:So you want to retire a statistical term... by Fouquet · · Score: 5, Insightful

      +1 this. The problem here is the author's impression that "social scientists" and economists are scientists. The groups that he excludes in the first paragraph (physicists) are scientists. Anyone attempting to implement a statistical model designed for a large (and Gaussian) data set on a small number of data points (as the article's example does) should expect to get an answer that is at best marginal. Any scientists who ever received even the most basic of statistics and/or data analysis training knows this. Understand the problem first, then take enough data points, then carry out your statistical analysis & formulate conclusions.

    2. Re:So you want to retire a statistical term... by Anonymous Coward · · Score: 0, Flamebait

      Let me guess - you're one of these painful people that orders disciplines by "most scientific to least scientific" with no doubt physicists as the master race at the top.
      The author of the article used 5 data points in their example because it was a simple, illustrative example. I doubt they were planning to use the small dataset as input to a complex analysis. In fact, none of your points have anything to do with the main argument of the article!

    3. Re:So you want to retire a statistical term... by The_Wilschon · · Score: 1

      He's rather requesting the people start using a different statistical measure of spread, the mean *absolute* deviation, rather than the square root of the mean *squared* deviation (the standard deviation). I'm not familiar enough with it's particular characteristics to say whether or not this would be an improvement in any rigorous sense, but I'd be surprised if it were. So "Get bent." is probably still the right attitude.

      --
      SIGSEGV caught, terminating

      wait... not that kind of sig.
    4. Re:So you want to retire a statistical term... by Anonymous Coward · · Score: 0

      Right, so all 'scientists' understand statistical methods, because they've received training in them. But economists don't. Right, and your evidence for this is? In case you're missing my subtle point, it's that there are plenty of scientists who don't really understand statistics, notwithstanding their training. This is understandable because most scientists become scientists for reasons other than a desire to master statistics.

    5. Re:So you want to retire a statistical term... by Livius · · Score: 1

      Don't economists use every mathematical or scientific concept incorrectly?

    6. Re:So you want to retire a statistical term... by NemosomeN · · Score: 1

      No, they don't.

      --
      I hate grammar Nazi's.
    7. Re:So you want to retire a statistical term... by mysidia · · Score: 5, Informative

      the mean *absolute* deviation, rather than the square root of the mean *squared* deviation (the standard deviation).

      The mean absolute deviation is a simpler measure of variability. However....

      The algebraic manipulation of the standard deviation is simpler; the absolute deviation is more difficult to deal with.

      Further, when drawing a number of samples from a large population --- the standard deviation of their mean deviations is substantially higher than the standard deviations of their individual standard deviations; that is to say, the standard deviation of a sample provides an estimate that is more in-line with the whole.

      That is to say.... there are cases where the Standard Deviation may be better, AND, much of statistics is using standard deviation as its basis.

      Fisher, R. 1920 Monthly Notes of the Royal Astronomical Society, 80, 758-770:

      the quality of any statistic could be judged in terms of three characteristics. The statistic, and the population parameter that it represents, should be consistent , The statistic should be sufficient, and the statistic should be efficient -- e.g. the smallest probable error as an estimate of the population. Both the standard deviation and mean deviation met the first two criteria (to the same extent); however, in meeting the third criterion -- the standard deviation proves superior.

    8. Re:So you want to retire a statistical term... by Anonymous Coward · · Score: 0

      Gaussian is the reason it should be banished. Almost nothing is gaussian, except in astronomy and physics experiments. Almost everything is closer to a beta. standard deviations suck for everything that isn't physics and astronomy. Seriously. Show me something that's truly gaussian outside those scales.

    9. Re:So you want to retire a statistical term... by jessetaylor84 · · Score: 1

      Exactly. Economists, psychologists, and sociologists just pretend like they are doing "science" when what they are really studying is social theory. The reason the mathematics are so misused here is that these are fields that don't lend themselves to mathematical analysis. Standard deviation is a useful mathematical concept for real sciences (physics, chemistry, computer science, etc.)

    10. Re:So you want to retire a statistical term... by hawk · · Score: 1

      And *you* seem to be under the impression that economists are "social scientists," or act similarly.

      I have a B.S. in Physics, and my Ph.D. is jointly in Economics and Statistics--including classes taken in Pearson Hall (yes, the same Pearson--statistics as a field comes largely from the Iowa State Statistics Lab).

      Again, with a Ph.D. in economics, I still don't understand what the so-called "social sciences" are--they seem to be primarily an claim to an exemption from the scientific method (which has a lot to do with economists not getting along with social scientists).

      Mainstream modern economists are indeed scientists, although there are definite problems with gathering data.

      hawk, aka dochawk

    11. Re:So you want to retire a statistical term... by Fouquet · · Score: 5, Informative

      That actually was part of my point. In my day job (and night job and weekend job, and, oh god I need a vacation) I'm an astrophysicist. I have more data sets that I can recall, and the number of problems for which I'm confident that the errors are Gaussian is at most 2 or 3. We're finally in an era where computational power facilitates forward modeling & Bayesian techniques that can provide good estimates of true uncertainties. But I (and many of my colleagues) barely understand how they work. Any expectation that most researchers are willing to invest the time to understand anything beyond Gaussian statistics is unrealistic.

    12. Re:So you want to retire a statistical term... by Fouquet · · Score: 1

      No, my original statement was unclear. I was not attempting to classifly economists with social scientists. I was stating that they are both a long way from physicists. Perhaps 'fundamental science' is a better term for what I'm willing to call science. FWIW, that excludes large chunks of stalwarts such as biology & geology.

    13. Re:So you want to retire a statistical term... by khallow · · Score: 1

      The average of a large number of independent data points with the same distribution has a near Gaussian distribution. And that's where standard deviations get used.

    14. Re:So you want to retire a statistical term... by bingoUV · · Score: 1

      Ok, that is interesting. I read a lot of "economists" and I don't get the impression that they are doing what I would call science. My impressions, compressed in a sentence, amount to the lack of consensus among premier economists about non-trivial, falsifiable and reproducible hypotheses.

      Physicists have that consensus about of lots of such hypotheses, "economists" that I read don't. Would I not get such an impression if I were reading "real" economists as defined by you? Or do you disagree that this lack of consensus is a serious obstacle in economics becoming a "science" ?

      --
      Bingo Dictionary - Pragmatist, n. A myopic idealist.
    15. Re:So you want to retire a statistical term... by globaljustin · · Score: 2, Interesting

      I always hated frequentist statistical methods and the Gaussian Distribution.

      I see below an Astrophysicist echos your claim.

      I'm happily surprised to learn I am not the only one who thinks the whole 'gaussian' should be banished.

      I come from a Systems Science and Research Methodology background in this area. One of my favorite parts of grad school was a 4 hr one on one tutoring session every week I did for a semester with my large State Univ. Research Director who is the person faculty/staff go to for questions about this stuff. All across the disciplines faculty, post-docs, PhD candidates come to these guys b/c it is their job to know & I was doing work for a prof who got me this and it was really cool.

      He explained how each discipline used statistics in their published research & rules for PhD candidates.

      One personal thing I took away was a deep mistrust of anything Gaussian, beyond some astrophysics & math stuff.

      Without getting too technical, IMHO it's bullshit. It's a scientist assigning what ammount to, not random, but factor analysis that is accurate only because of non-quantifiable expertise-type decisions in how to define the research question, how to test, and what kind of statistics to expect.

      I'm not saying they don't make good choices, sometimes these PhD's come up with excellent work across the academic disciplines, I'm just saying that you can piss in a jar and call it statistical significance...

      --
      Thank you Dave Raggett
    16. Re:So you want to retire a statistical term... by sFurbo · · Score: 2

      Assuming that the distribution the data points have is reasonably well-behaved.

    17. Re:So you want to retire a statistical term... by epine · · Score: 1

      How did you like this bit from Sean Carroll suggesting that "falsifiability" is the scientific concept overdue for retirement?

      Modern physics stretches into realms far removed from everyday experience, and sometimes the connection to experiment becomes tenuous at best. ... The cosmological multiverse and the many-worlds interpretation of quantum mechanics posit other realms that are impossible for us to access directly. Some scientists, leaning on Popper, have suggested that these theories are non-scientific because they are not falsifiable.

      The truth is the opposite. Whether or not we can observe them directly, the entities involved in these theories are either real or they are not. Refusing to contemplate their possible existence on the grounds of some a priori principle, even though they might play a crucial role in how the world works, is as non-scientific as it gets.

      If you think the rift between economics and social science is deep, take a look at your roots.

      To the degree that the educated public knows and respects the scientific tradition, it's because of the adherence to falsifiability. That's the economic foundation of getting what we pay for. I wouldn't go so far as to argue that the multiverse (and the statistical landscape--bleh) is not actually physics, but it sure as hell isn't physics resting on the foundation that has conferred upon physics its esteem and respect as a hard science over the last four centuries.

      Let's reopen the question about why we're funding this kind of work on the public purse. He's smoking a crack pipe if he thinks he can flush falsifiability and still keep his cozy budget. Personally I celebrate renegade artists and crackpots like Garrett Lisi—Kepler was equally nutty and he punched through. I think Carroll should sleep in a van on a Hawaiian beach and get back to us when his awe-inspiring kaleidoscopic symmetries collapse to a waveform we can actually test (yes, I know Dyson took a pot shot at collapsing waveforms on that same forum). Perhaps Carroll will cough up on the beach a core idea for the next consolidation of physics beyond the standard model. I'll cheer for him every step of the way. Meanwhile, no falsifiability means no public funds. He can grovel at the knees of the Templetons who find this new kind of science somehow majestic.

      I'm not a physicist. I did study physics. I'm not an economist. I have listened to nearly every EconTalk dating back to 2005. I think his crush on Hayek is misguided. This doesn't stop me from tuning in, because most of his guests are smart.

      Even with economics, there's an enormous rift between mathematical economics, the study of models that barely reflect reality, and narrative economics, where people try to convince each other that a stimulus actually works or it doesn't work, and no reference back to the raw data ever settles the matter.

      Economic policy advisors are no better than sociologists.

      Taleb's primary point—and he's totally right—is that analytic models that are always right except during rare events are complete dogshit, though it you can get traction on it, you might make off with a lode for a short while. The mere presence of a nearly infallible player in an economic market precipitates rare events of the nastiest kind. If I were teleported back to Edge 2005 to write a little essay for What do you believe true even though you cannot prove it? that could be my starting point.

      You should read Assumptions Economists Make, which describes economics from the perspective of Jonathan Schlefer, a political scientist who got his hands dirty.

      In Economics, You Are What You Model

      I don't know why you think you need to look outside your own disciplines of physics and economics to find people playing fast and loose with the scientific method.

    18. Re:So you want to retire a statistical term... by martin-boundary · · Score: 2

      Most are. In engineering and science, datasets a either bounded, or explicitly truncated. Therefore central limit theory applies. Frankly, Taleb has a bugbear about heavy tails, when in fact heavy tails are often a symptom of badly thought out measurements in scientific disciplines.

    19. Re:So you want to retire a statistical term... by Anonymous Coward · · Score: 0

      [...] rather than the square root of the mean *squared* deviation (the standard deviation) [...]
      Hey now, don't bring RMS into this discussion, he's already everywhere else as it is.

    20. Re:So you want to retire a statistical term... by coastwalker · · Score: 1

      Its "so bullshit" that the entire manufacturing output of the planet uses the Gaussian distribution, standard deviation and Cpk to make the things that keep sprats like you alive. Everything that is manufactured is measured on control charts - 3D printed livers, Nuclear fuel rods, gate oxide thickness on silicon chips, bandwidth available on your shared DSL line.

      Now if some smart bastard like you can improve on 99.999% of the manufacturing worlds assumption that the central limit theorem will reduce everything to Gaussian then you my son are going to be very rich and famous. Let me be the first to congratulate you on deploying more sophisticated statistical tools worldwide and thereby increasing the efficiency and quality of every single factory on the planet.

      Of course you might just be spouting your anally retentive opinion that you are disgusted by the fact that people don't use statistics in a precise way.

      --
      Facts are history now plebs have politics for religion on social media.
    21. Re:So you want to retire a statistical term... by Lamps · · Score: 1

      "I'm happily surprised to learn I am not the only one who thinks the whole 'gaussian' should be banished."

      How exactly do you banish a probability distribution? Should the Central Limit Theorem be discarded along with it?

    22. Re:So you want to retire a statistical term... by khallow · · Score: 1

      "Reasonably well-behaved" is rather broad here. The distribution merely needs to have a finite variance which is common (includes bounded or finite distributions, for example). The Wikipedia link gives an example of distributions with infinite variance.

    23. Re:So you want to retire a statistical term... by Mashdar · · Score: 2

      Gaussian distributions are a natural phenomenon, like pi, e, etc. The distribution was not made up for convenience (ever tried to integrate it?). You can't will away the base behavior of the universe (or our number line).

      Sampling any random variable which itself is a sample of smaller events will trend Gaussian by natural law. Noise is also often the sum of a huge number of random events, and is often quite Gaussian.

      Like binomial, Poisson, Laplacian, etc, Gaussians make a lot of sense for certain situations. Gaussian is also an excellent estimator for large sample sizes on binomial and Poisson processes.

      I don't really understand a general objection to Gaussian distributions, or their use in analysis. That would be like objecting to the use of the number e.

      Maybe you are upset that people assume Gaussians without considering it might be a similar distribution with very different process implications? (I.e. you can map a binomial on to a Gaussian, but the idea of where your samples are coming from is very different.)

    24. Re:So you want to retire a statistical term... by Mashdar · · Score: 1

      BTW, standard deviation has nothing to do with Gaussian distributions, other than Gaussians being parameterized by sigma. Standard deviation is just the second order moment of a distribution. It exists for many distributions, and does not 'belong' to any particular distribution. Is your problem with Gaussians or moments?

    25. Re:So you want to retire a statistical term... by globaljustin · · Score: 1

      Gaussian distributions are not natural phenomenon. Numbers are not a natural phenomenon.

      If humans want to use a Gaussian distribution to get rid of noise in some signal from a WIMP detector, fine....that's not really what we're talking about here.

      Using a Gaussian distribution to determine how far from random your Likert scale test of whether video games make people feel more aggression...well that's ruining science.

      --
      Thank you Dave Raggett
    26. Re:So you want to retire a statistical term... by Lamps · · Score: 2

      Gaussian distributions are not natural phenomenon. Numbers are not a natural phenomenon.

      Gaussian distributions are an arithmetical description of a very common natural phenomenon. Sure, they are a man-made construct, but they are 'natural' in the sense that they describe a quality that's inherent in collections of things (although you can also argue that the notion of 'collections' or 'groups' may arguably also not be a natural phenomenon). Talking about a Gaussian distribution is about as natural as saying '1/3 of them have quality x' or talking about the circumference of a circle.

      If humans want to use a Gaussian distribution to get rid of noise in some signal from a WIMP detector, fine....that's not really what we're talking about here.

      You categorically said that the Gaussian distribution should be banished. Although your post focused on science, it wasn't implied in your quote that you were only talking about science.

      Using a Gaussian distribution to determine how far from random your Likert scale test of whether video games make people feel more aggression...well that's ruining science.

      The Likert scale thing is an example in which, in principle, the Gaussian distribution is not appropriate, and thus, should not be used. However, in practice, it typically works just fine. You can find plenty of literature for or against doing so. Here's an example in which the author argues that it's permissible.

      But even if you don't want to apply parametric methods to Likert scales, what's wrong with, let's say, using Gaussian distributions to analyze participants' response times on some perceptual test? How is that, from a statistical perspective, qualitatively different from "getting rid of noise in some signal from a WIMP detector"?

      What really would screw up science is the inability to utilize useful statistical tools to perform analyses - this would be the effect of "banishing the Gaussian distribution" from analyses used in scientific research.

    27. Re:So you want to retire a statistical term... by globaljustin · · Score: 1

      Hey...I'll respond after you contribute to the conversation.

      I said this...further down in the post you're responding to, re: Gaussian Distribution

      One personal thing I took away was a deep mistrust of anything Gaussian, beyond some astrophysics & math stuff.

      The fact that you say this in response: "You categorically said that the Gaussian distribution should be banished." is bullshit and shows me that you are either a) trolling or b) not engaging with the topic enough to justify my time typing.

      --
      Thank you Dave Raggett
    28. Re: So you want to retire a statistical term... by gzuckier · · Score: 1

      So analyze your data using small sample and nonparametric (I.e. non Gaussian) models if appropriate.
      Hell, when you come right down to it, nothing is truly Gaussian, we just can't deal with the extensively multivariate nature of reality.

      --
      Star Trek transporters are just 3d printers.
    29. Re:So you want to retire a statistical term... by Lamps · · Score: 1

      The fact that you say this in response: "You categorically said that the Gaussian distribution should be banished." is bullshit and shows me that you are either a) trolling or b) not engaging with the topic enough to justify my time typing.

      An exact quote from your post: "I'm happily surprised to learn I am not the only one who thinks the whole 'gaussian' should be banished".

      You had no problem justifying your time typing a reply to a post that you deem unworthy of your time, so I call bullshit. So far, you've put forth the impression (and I'm sure I'm not alone in this) that you've put forth a dogmatic point of view, and that you are unable to defend that point of view. My post raised a number of valid points, none of which you've attempted to address. I understand that that it's much easier to dismiss a post as a troll without basis than to engage in rational discussion, and others who read this thread will understand that as well.

      So far, I've offered a rebuttal to your anti-Gaussian point of view, and you've posted nothing of any substance. If it makes you happy, omit that phrase which you find contentious from my above post, and post your response. At this point, you're the one giving the impression of trolling (i.e. trolling those who have an understanding of the Gaussian distribution's utility in statistics and the sciences).

    30. Re:So you want to retire a statistical term... by globaljustin · · Score: 0

      you have to read the entire comment, man

      the fact that you tried to argue that because I used the word (from the Parent I was responding to btw) 'banished' then developed that thought including exceptions that somehow you aren't a pedantic shit

      you're a pedantic shit either way, but i'm right about your overly literal yet under examined reading of my comment and I'm right about the Gaussian Distribution

      read the entire fucking comment, respond on topic and we havce a discussion otherwise fuck off

      --
      Thank you Dave Raggett
    31. Re:So you want to retire a statistical term... by akozakie · · Score: 1

      That's nice if you're testing something you can recreate in a lab or have lots of time. Sometimes you need to attempt to draw some conclusions based on a limited set of data points. In that case he's right - people too often forget that some mathematical tools need more data than others. You need to know your limits. If you're only half-sure that what you see is a car, forget about guessing the type. If you only have enough data to estimate the mean (badly), forget about variance. Standard deviation for 5 samples? Are you joking?

      It gets better, I've seen one (rejected) paper taking 10 samples and trying to estimate... the Hurst coefficient. Come on! Even the mean is still blurry!

    32. Re:So you want to retire a statistical term... by Lamps · · Score: 1

      I read the entire comment. On which "word" am I overly focused? Is it this "word":

      I'm happily surprised to learn I am not the only one who thinks the whole 'gaussian' should be banished.

      Am I taking that statement, which was presented in its own paragraph, out of context? Or are you arguing that you went on to contradict that statement later in your post? You understand the difference between developing a thought and contradicting it, don't you?

      I've repeatedly given you the opportunity for discussion. So far, you haven't demonstrated the intellectual capacity for that. Instead, you've offered unsubstantiated assertions, accusations, and ad-hominem attacks. My post was on topic, and you're doing everything you can to avoid staying on topic, focusing instead on the fact that I reiterated a sentiment that you expressed. Instead of admitting that your post was poorly articulated or potentially self-contradictory, you offer unsubstantiated assertions that you're correct.

      So far, along with slinging mud, your level of discourse has been precisely this:

      I'm right about the Gaussian Distribution

      So I'll give you another chance to prove that you have the intellectual capacity for something other than repeating dogma that you haven't quite integrated or understood, and for personal attacks: go ahead and respond to the points I made in my post. Stop pretending that stuff which directly addresses what you said is off-topic. Otherwise, go ahead and keep perpetuating the impression of yourself (thick, argumentative, and with a superficial understanding of things; a fraud) that you created in this thread. Otherwise, if you maintain your present course, and continue to contribute nothing of meaning or value, you can go ahead and fuck off.

    33. Re:So you want to retire a statistical term... by khallow · · Score: 1

      Which are the "heavy tail" distributions that have been complained about as the other replier noted.

    34. Re:So you want to retire a statistical term... by acaila_edhel · · Score: 1

      Except that you don't really know the author at all. He spends a lot of time making fun of "social scientists," economists, and most of all, Harvard Professors. Go pick up his book, "Anti-Fragile"

    35. Re:So you want to retire a statistical term... by hawk · · Score: 1

      Physics has a several hundred year head start on Economics. In some ways, we're about where Newton was . . .

      And we have older folks who haven't progressed what was taught when they were in grad school (e.g., Krugman). Samuelson observed that the field progresses "one funeral at a time."

      And testing hypotheses in economics tends to take longer, and we aren't allowed to tinker with economies to take measurements, for some reason :) [Congress hating competition?]

      hawk

    36. Re:So you want to retire a statistical term... by bingoUV · · Score: 1

      I don't think physics has any headstart. Economics is at least as old as the time when our ancestors started living in groups / tribes. Physics is no older. If economics is the less "science" for it, it is fully deserved.

      Yes, old people not progressing is a problem but it is in physics too. If economists have it in a greater degree, economics is the less "science" for it and quite deservedly.

      Yes, testing hypothesis in macro - economics is somewhat like the problem astro-physics has. Only passive observation is feasible, not active experimentation. Actually economists have convinced governments to alter serious macro-economic policies, astro physicists have been able to do far far less active experimentation. My impression of lack of consensus in economists is that it is greater than that in astro-physicists. I could be wrong, though.

      --
      Bingo Dictionary - Pragmatist, n. A myopic idealist.
  3. Basic Statistics by TechyImmigrant · · Score: 4, Insightful

    The meaning of standard deviation is something you learn on a basic statistics course.

    We don't ask biochemists to change their terms because the electron transport chain is complicated.
    We don't ask cryptographers to change their terms because the difference between extra entropy and multiplicative prediction resistance is not obvious.

    We should not ask statisticians to change their terms because people are too stupid to understand them.

    --
    I should use this sig to advertise my book ISBN-13 : 978-1501515132.
    1. Re:Basic Statistics by Mr+D+from+63 · · Score: 5, Funny

      We should not ask statisticians to change their terms because people are too stupid to understand them.

      But doesn't that give an unfair advantage to statisticians? You have to give everyone a chance!

    2. Re:Basic Statistics by JoeMerchant · · Score: 1

      Actually, meaningful and readily understood labels are a considered a good thing, and beneficial to those who work in the field they apply to.

      Except programming, there, based on my experience, you should use whatever label happens to be laying around - never change it, even if it means the opposite of what it does.

    3. Re:Basic Statistics by Anonymous Coward · · Score: 0

      You've misunderstood. Please read the post again.

    4. Re:Basic Statistics by Anonymous Coward · · Score: 0

      We don't ask biochemists to change their terms because the electron transport chain is complicated.
      We don't ask cryptographers to change their terms because the difference between extra entropy and multiplicative prediction resistance is not obvious.

      We should not ask statisticians to change their terms because people are too stupid to understand them.

      Nuclear Resonance Imaging (NMR) was changed because people were afraid of word Nuclear despite it describing the process, unlike its replacement term.

    5. Re:Basic Statistics by ClioCJS · · Score: 1

      If only scientists were statisticians, your comment might have actually been actionable.

      --
      -Clio
      Karma: Bad (mostly from not giving a fuck)
      Blog: http://clintjcl.wordpress.com
    6. Re:Basic Statistics by ShanghaiBill · · Score: 3, Informative

      The meaning of standard deviation is something you learn on a basic statistics course.

      I took a statistics course in college. The statistics professor taught us to think of the standard deviation as the "average distance from the average". So if you know the average (mean) then any random data sample will be (on average) one SD away. That is simple, neat, and easy to remember.

      It is also wrong.

    7. Re:Basic Statistics by Mashdar · · Score: 3, Interesting

      Didn't you hear? Guassians are so 1893. And so are all of the other distributions with convenient sigma terms...

      And TFS calls standard deviation "root mean square error", which is only true if you assume the mean is a constant estimator for the distribution :(

      In any case, no one picked Gaussians because they are so easy to integrate... While we're at it, TFA should suggest we round the number e to 3, because irrational numbers are hard, and who cares what natural law dictates.

    8. Re:Basic Statistics by almitydave · · Score: 1

      And yet, everyone refers to the act of cooking in a microwave as "nuking," and no one seems to have a problem with that.

      --
      my, your, his/her/its, our, your, their
      I'm, you're, he's/she's/it's, we're, you're, they're
    9. Re:Basic Statistics by MobyDisk · · Score: 1

      We should not ask statisticians to change their terms because people are too stupid to understand them.

      The author didn't ask anyone to change any terms. They asked people to stop using the wrong statistic. Ex: Don't use mean if you needed the median.

    10. Re:Basic Statistics by gninnor · · Score: 5, Funny

      Then it would be the same as pi, and that would just be silly.

    11. Re:Basic Statistics by Fly+Swatter · · Score: 3, Insightful

      Someone should tell that to the lawyers!

    12. Re:Basic Statistics by Anonymous Coward · · Score: 0

      Generally speaking, I am not inside the microwave while it is in use.

    13. Re:Basic Statistics by boristhespider · · Score: 1

      Not at all. If e^pi = pi^e then a common interview question would be a hell of a lot easier to answer.

    14. Re:Basic Statistics by wonkey_monkey · · Score: 2

      Mmm, pi^e.

      --
      systemd is Roko's Basilisk.
    15. Re:Basic Statistics by Fouquet · · Score: 2

      Not in actual physics labs, it wasn't. The change to MRI is only in public facing applications.

    16. Re:Basic Statistics by TsuruchiBrian · · Score: 1

      All you need to know is SOCATOA, whatever the fuck that means.

    17. Re:Basic Statistics by fermion · · Score: 1

      I can't really get to the article right now, but one this that is true is that Standard deviations only make sense if a sample results in a normal distribution. Normal distributions has certain qualities, one is that the mean=median=mode. If this is not true then one can still have a skewed normal curve. Many distributions a skewed normal curves, which means that a standard distribution is not necessarily the best model. Yet they are still used. This can be a problem. Here is why standard deviation is so important as a statistic. If one has a normal distribution, and one knows, for example that that a part has a nominal length of 1 cm and historical data shows that the machine makes the part with a standard distribution of 0.05 cm, then one has some confidence that almost 70% of the parts will be between 0.95 cm and 1.05 cm. Such knowledge is critical when putting things together and warranty issues. Over time if the parts are out of this range, or if the average changes, then one knows the machine is broken. There are distibutions that are normal. There are many that are not, like distribution of wealth. As far as I can tell stating an average and a distribution in that is pretty silly.

      --
      "She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
    18. Re:Basic Statistics by Daniel+Dvorkin · · Score: 1

      The replacement the article proposes (mean absolute deviation or MAD) is also only particularly meaningful if you're dealing with a symmetric distribution, so it really doesn't address the problem you identify.

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    19. Re: Basic Statistics by scrote-ma-hote · · Score: 1

      Are you sure it wasn't because there was already a nuclear medicine department in hospitals, band that might cause confusion. Cause people don't seem so upset about radionucliotide scanning...

    20. Re:Basic Statistics by gninnor · · Score: 2

      Honestly, of the different things I have studied all had jargon that could have been explained in simpler terms, often in shorter common words. So much of it is a wall to the "stupid" people and their understanding.

      Other times there are specific concepts with only one word. These need to be simplified and taught to when it is being introduced in journals, but that would be work and very few people have been trained to speak to laymen.

      Even within the sciences some shorthand jargon means one thing in chemistry and another in in biochemistry.

    21. Re:Basic Statistics by Anonymous Coward · · Score: 0

      Well some do, there are those that think food cooked in a microwave oven is bad for you (and in a way not connected to the preservation and break down of various nutrients versus other cooking methods) and that water heated in a microwave will kill plants.

      Regardless, almost every use of NMR outside of medical imaging still calls it NMR.

    22. Re:Basic Statistics by OneAhead · · Score: 1

      It's Nuclear Magnetic Resonance that was changed to Magnetic Resonance Imaging. And, as others have pointed out, only in the medical field. </pedantic nitpick>

    23. Re:Basic Statistics by Anonymous Coward · · Score: 1

      And yet, everyone refers to the act of cooking in a microwave as "nuking," and no one seems to have a problem with that.

      I don't; I call it "microwaving."

    24. Re: Basic Statistics by ozydingo · · Score: 1

      To be fair, standard deviation has a meaning for any distribution, but there's a lot you can infer from alone only when dealing with Gaussians. Many people try to infer those same things with non-Gaussian data, and there lies the problem.

    25. Re:Basic Statistics by TheloniousToady · · Score: 1

      Except me. I believe the correct term is "Radaranging".

    26. Re:Basic Statistics by Anonymous Coward · · Score: 0

      Perhaps the issue here is that the educational system hasn't done a good enough job explaining the difference to students.

    27. Re:Basic Statistics by stoborrobots · · Score: 4, Informative

      ... think of the standard deviation as the "average distance from the average" ... That is simple, neat, and easy to remember... It is also wrong.

      In fact, it is wrong in exactly the way that TFA suggests: you're describing the mean deviation...

    28. Re:Basic Statistics by ebno-10db · · Score: 1

      SOHCAHTOA

    29. Re: Basic Statistics by ebno-10db · · Score: 1

      They also have a bad habit of assuming that any distribution that's roughly bell shaped is Gaussian, or close enough anyway. See Gaussian copula, risk analysis, financial crisis.

    30. Re:Basic Statistics by demonlapin · · Score: 1

      And because the medical use is for image generation, not for substance identification.

    31. Re:Basic Statistics by FriendlyStatistician · · Score: 1

      Most statisticians consider standard deviation to be a more meaningful/fundamental measure than mean absolute deviation. I agree with Nassim Taleb that mean absolute deviation is easier to understand, but I disagree that we should switch to using the mean absolute deviation.

      (I should note that, contrary to the summary, Taleb is not properly a statistician--he's an economist).

    32. Re:Basic Statistics by reve_etrange · · Score: 1

      The point of the article is not that the standard deviation is "harder to explain," but that if you want to convey, say, how much the temperature could be expected to change on a day to day basis, then MAD is the better statistic. I'm sure NNT is aware that STD is better in most other contexts (e.g. comparing fits).

      --
      .: Semper Absurda :.
    33. Re:Basic Statistics by camperdave · · Score: 5, Funny

      The phrase "orbital process" means entirely different things to brain surgeons and rocket scientists.

      --
      When our name is on the back of your car, we're behind you all the way!
    34. Re:Basic Statistics by Daniel+Dvorkin · · Score: 3, Informative

      I should note that, contrary to the summary, Taleb is not properly a statistician--he's an economist

      To be fair, economics has contributed a lot to the growth of statistics as a field of study. Due to various historical quirks, econometrics developed as almost a separate field from statistics for decades, and economists have often looked at statistical problems with a fresh eye, and had insights that people working in the mainstream of statistics and biostatistics might have missed. In my own work, biostatistics-flavored bioinformatics, I've often found myself referring to the econometric literature.

      I have no idea if any of this applies to Taleb, though. Certainly TFA doesn't strike me as a particularly profound example of statistical reasoning ...

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    35. Re:Basic Statistics by Ken+D · · Score: 1

      And how do you remember that exactly?

      I much prefer "Some Old Horse Caught Another Horse Taking Oats Away"

    36. Re: Basic Statistics by flargleblarg · · Score: 1

      To be fair, standard deviation has a meaning for any distribution, [...]

      False.

      Cauchy distributions have neither a mean nor a standard deviation.

    37. Re:Basic Statistics by Jane+Q.+Public · · Score: 1

      Oh, stuff it opposite your hypotenuse.

    38. Re: Basic Statistics by RightwingNutjob · · Score: 1

      Bzzt. Mathematically correct but practically wrong. Any real or simulated dataset from which you would want to compute a standard deviation will have the property that it will be a list of (most likely) double precision floating that is finite in size. This data defines a distribution that always has a finite first and second moment, so you will get a number that you can confidently call the standard deviation of the data. Even if it comes from physical process with a nonsense distribution like a Cauchy distribution, the standard deviation you compute will give you a bound on the spread of your data. If it's Gaussian, you can go back to your statistics class and say that 95% of the data will be within two SD's, etc. If it's not, you can use the Chebyshev rule (http://en.wikipedia.org/wiki/Chebyshev_inequality) to say that at least 75 percent of the data will be in two SD's, 89% will be within 3 SD's, etc, which is much coarser information, but is still reasonable to look at for worst-case analysis.

    39. Re:Basic Statistics by The_Wilschon · · Score: 2

      You might be interested in this: http://en.wikipedia.org/wiki/Chebyshev's_inequality The standard deviation tells you something important about *any* distribution.

      --
      SIGSEGV caught, terminating

      wait... not that kind of sig.
    40. Re: Basic Statistics by The_Wilschon · · Score: 1

      Careful. Chebyshev's inequality doesn't help you if you are sampling from a physical process with a Cauchy distribution. Be careful not to confuse the *sample* standard deviation with the *population* standard deviation. The former always exists. The latter is what you use with Chebyshev's inequality... *if* it exists. In the case of a Cauchy distribution, your sample standard deviation would mislead you into thinking that the probability to fall outside N sample standard deviations had some particular bound that it did not have.

      --
      SIGSEGV caught, terminating

      wait... not that kind of sig.
    41. Re:Basic Statistics by The_Wilschon · · Score: 1

      Clarification. Chebyshev's inequality is not going to help you with distributions that have no mean or standard deviation. Note also that the standard deviation mentioned in Chebyshev's inequality is the *population* standard deviation, and NOT the *sample* standard deviation.

      --
      SIGSEGV caught, terminating

      wait... not that kind of sig.
    42. Re:Basic Statistics by Ford+Prefect · · Score: 4, Funny

      Nuclear Resonance Imaging (NMR) was changed because people were afraid of word Nuclear despite it describing the process, unlike its replacement term.

      Also, if you arrived at a hospital saying you were there for an NMR, you might have received something other than what you were expecting.

      --
      Tedious Bloggy Stuff - hooray?
    43. Re:Basic Statistics by Anonymous Coward · · Score: 0

      There are two meaning of standard deviation: a mathematical definition and a distribution interpretation. There is no argument about its mathematical definition, but mathematical definition is arbitrary and meaningless. In reality, when we use standard definition, we quickly focus on its distribution interpretation: there are around 68% confidence within mean+/-std, or we quickly plot a bell shape inside our mind. But the latter meaning is only valid under normal distribution. In reality, often we are not dealing with normal distribution -- stock price is such example. This is exactly the author's point: the use of standard deviation give us wrong idea.

      The mean deviation also has two meaning: a mathematical one and a interpreted one. But in this case, it is not distribution based, it is a straight forward mean -- whose common interpreted meaning is exactly its mathematical definition.

    44. Re:Basic Statistics by mysidia · · Score: 1

      Actually, meaningful and readily understood labels are a considered a good thing, and beneficial to those who work in the field they apply to.

      They want to do the equivalent of telling the C standards designers and programmers... You need to stop using a function named main() or _start as the entry point to your programmers.

      From henceforth.... main() should be renamed to Program()

      Also, instead of returning an INT, it should return a string with text describing the exit condition, or NULL if the program exits with no error.

      Also, the order of the arguments should be reversed, and instead of the entry point using an array of pointers to char*, it will be a pointer to an array with 256 elements, and instead of an integer argCount, argCount will be an array with the lengths of each element; the last program parameter will have length of zero, the new signature of the renamed main() function will be:

      extern char * Program( char (*argumentData)[256] , int argumentLength[256]);

    45. Re:Basic Statistics by jessetaylor84 · · Score: 1

      I wouldn't have a problem with them renaming it to make the terminology more usable/accessible. Usability is important, and leads to more rapid progress. However, what the author of this piece is suggesting: stopping the use of standard deviation altogether, is just stupid. People not understanding something might be a reason to try to improve pedagogy, but it's certainly not a reason to stop using it.

    46. Re:Basic Statistics by hawk · · Score: 1

      This gives us two possibilities:

      1) You had a bad statistics professor, or
      2) You shouldn't have received your passing grade.

      hawk

    47. Re:Basic Statistics by EMeta · · Score: 1

      Which is why we should teach statistics to every high schooler. Mostly everyone in the US education system gets trig and not stats. WHY? All of them need to have a decent understanding of statistics to be a useful voter and citizen. Very few of them need trig, and it doesn't enhance their mind in some way that their geometry class didn't. If you don't have space for a stat class after Algebra 2, cut that & put statistics there instead. There is no more important mathematical skill for everyone to have.

    48. Re:Basic Statistics by Man+On+Pink+Corner · · Score: 1

      That'd depend on what the professor meant by "distance."

    49. Re:Basic Statistics by EngnrFrmrlyKnownAsAC · · Score: 1

      Excuse me, I think you're in the wrong class. Remedial trigonometry is down the hall.

      --
      Howdy howdy howdy
    50. Re:Basic Statistics by EngnrFrmrlyKnownAsAC · · Score: 1

      And still entirely different things to rocket surgeons.

      --
      Howdy howdy howdy
    51. Re:Basic Statistics by sourcerror · · Score: 2

      Check your math privilege!

    52. Re:Basic Statistics by bingoUV · · Score: 2

      Bruce Lee summed it up - "Before I started martial arts, a punch was a punch and a kick was a kick. When I started martial arts, a punch was no longer a punch and a kick was no longer a kick. When I understood martial arts, a punch was a punch and a kick was a kick."

      Most people are stuck at the second level - stuck in technicalities. Few people ever reach the third level - where a punch is a punch and a kick is a kick, not because of ignorance of technicalities. But because they have transcended the technicalities.

      --
      Bingo Dictionary - Pragmatist, n. A myopic idealist.
    53. Re: Basic Statistics by leaen · · Score: 1

      Bzzt. Mathematically correct but practically wrong. Any real or simulated dataset from which you would want to compute a standard deviation will have the property that it will be a list of (most likely) double precision floating that is finite in size. This data defines a distribution that always has a finite first and second moment, so you will get a number that you can confidently call the standard deviation of the data. Even if it comes from physical process with a nonsense distribution like a Cauchy distribution, the standard deviation you compute will give you a bound on the spread of your data. If it's Gaussian, you can go back to your statistics class and say that 95% of the data will be within two SD's, etc. If it's not, you can use the Chebyshev rule (http://en.wikipedia.org/wiki/Chebyshev_inequality) to say that at least 75 percent of the data will be in two SD's, 89% will be within 3 SD's, etc, which is much coarser information, but is still reasonable to look at for worst-case analysis.

      Yes but also useless when you have enough data to get reasonable mean estimate. You do not need Chebyshev inequality for getting confidence intervals, just compute appropriate 2.5percentile and 97.5percentile for 95% interval. By Glivenkoâ"Cantelli theorem these for measurable function converge regardless to distribution and are not sensitive to outliers.

    54. Re:Basic Statistics by Anonymous Coward · · Score: 0

      For the uninitiated, what is the common interview question you're referring to?

    55. Re:Basic Statistics by boristhespider · · Score: 1

      "How would you tell whether pi^e or e^pi is bigger?"

      "With a calculator" will get you a forced smile.

    56. Re:Basic Statistics by Anonymous Coward · · Score: 0

      I don't want my orbits processed. My eyes will pop out.

    57. Re:Basic Statistics by Anonymous Coward · · Score: 0

      Only in New England

    58. Re:Basic Statistics by Anonymous Coward · · Score: 0

      I was taught:
      Some Old Hippy
      Caught Another Hippy
      Tripping On Acid

      Always easier to remember...

  4. Issues by Edward+Kmett · · Score: 5, Informative

    On the other hand, you also need to use 2-pass algorithms to compute Mean Absolute Deviation, whereas STD can be easily calculated in one pass. And you still need standard deviation as it relates directly to the second moment about the mean.

    Also, annoyingly, Median Absolute Deviation competes for the MAD name and is more robust against outliers.

    --
    Sanity is a sandbox. I prefer the swings.
    1. Re:Issues by Animats · · Score: 2

      On the other hand, you also need to use 2-pass algorithms to compute Mean Absolute Deviation, whereas STD can be easily calculated in one pass. And you still need standard deviation as it relates directly to the second moment about the mean.

      Right. Some common measures in statistics date from the paper and pencil era, back when computation was really expensive. The same issue applies to least mean squares curve fitting, which is cheap to compute but overweights values far from the curve. This is well known, and was recognized decades ago. This is not something Talib "discovered", or even popularized.

      (If you want to annoy Taleb and his flunkies, ask hard questions about the actual performance of his funds in years other than 2008.)

    2. Re:Issues by Okian+Warrior · · Score: 1

      On the other hand, you also need to use 2-pass algorithms to compute Mean Absolute Deviation, whereas STD can be easily calculated in one pass.

      Okay, just to be clear: you're saying that we should use STD because (in part) it's faster and easier to calculate?

      Isn't that like the drunk looking for his keys under the lamppost - instead of where he dropped them - because the light is better?

    3. Re:Issues by Anonymous Coward · · Score: 1

      The same issue applies to least mean squares curve fitting, which is cheap to compute but overweights values far from the curve.

      Sadly, minimizing MAD leads to multiple solutions and that's why I normally use least means squares instead of least absolute mean.

    4. Re:Issues by Jane+Q.+Public · · Score: 0

      "On the other hand, you also need to use 2-pass algorithms to compute Mean Absolute Deviation"

      Since when?

      Pseudo-code:

      1) Start with S = 0 and I = 0.

      2) For each data point starting with the second:

      3) Add 1 to I. S = S + absolute value of (this data point - previous data point).

      4) When all the data points are collected, MAD = S / I

      Where's the difficulty?

    5. Re:Issues by Edward+Kmett · · Score: 1

      Sadly, the problem is your proposed algorithm doesn't work.

      Consider the sequence [1,2..100], The real mean absolute deviation is 25.

      Your algorithms yields er.. something around 1?

      Mean absolute deviation requires you to sum over a bunch of absolute values of differences to a number you don't know a priori.

      Unlike stddev's use of x^2, abs doesn't have a continuous derivative and can't be split out into calculations in terms of the moments around 0, and you can't borrow Chan's algorithm.

      Basically, to my knowledge, there isn't a sufficient statistic you can accumulate for MAD (either version of MAD) that takes less space than the original data.

      --
      Sanity is a sandbox. I prefer the swings.
    6. Re:Issues by wonkey_monkey · · Score: 1

      Data points: 1 2 10

      First pair: (1,2). S = 0+abs(1-2) = 0+1 = 1
      Second pair: (2,10). S = 1+abs(2-10) = 1+8 = 9
      MAD=9/2=4.5
      ----
      Data points: 1 10 2

      First pair: (1,10). S = 0+abs(1-10) = 0+9 = 9
      Second pair: (10,2). S = 9+abs(10-2) = 9+8 = 17
      MAD=17/2=8.5
      ----
      Two different results from the same data points. Have I misunderstood something?

      --
      systemd is Roko's Basilisk.
    7. Re:Issues by Edward+Kmett · · Score: 1

      Mean absolute deviation is a useful statistic, though I tend to actually prefer median absolute deviation.

      You can actually prove that.

      median abs dev = mean abs dev = standard deviation

      You can also prove that median abs dev will provide the minimal absolute deviation from any number, so in that sense mean abs dev is kind of a strange choice here.

      That said, we can say a few things about it.

      It is a pain in the ass to calculate. It also tends to favor solutions that let outliers run wild. Least squares provides a compromise that favors explaining the outliers over fiddling around trying to slightly better fit the rest, but doesn't go so far as to give all the weight to the outliers like higher norms.

      Often you can actually start from something based on L2 norm and 'improve it' to L1 norm, e.g. this is commonly done to refine a solution in terms of least squares in terms of least absolute deviation.

      Standard deviation is mostly used to get a feel for the 'shape' of how spread out your data is.

      I'm not saying mean abs dev it isn't a useful statistic, but like most things promoted by Nassim Taleb it isn't the panacea he purports it to be, and on sufficiently large data sets where calculating a proper median abs dev can be prohibitively expensive or nigh impossible, the mean abs dev isn't going to be enough "better" than the standard deviation for most purposes that it is worth the loss of ability to compute it online.

      --
      Sanity is a sandbox. I prefer the swings.
    8. Re:Issues by reve_etrange · · Score: 1

      The "mean absolute deviation" is the mean absolute difference between each point and the mean of the points. Not the mean absolute difference between each point and the following point.

      --
      .: Semper Absurda :.
    9. Re:Issues by Jane+Q.+Public · · Score: 1

      "Mean absolute deviation requires you to sum over a bunch of absolute values of differences to a number you don't know a priori."

      You are right... I did not subtract the mean, so what you say is correct: it is a "two-pass" operation. But it's still so easy to do, I wonder why that bothers you. As long as you already have the data, you don't need "as much space" as the original data more than the original data... you just re-use the original data.

      Using Taleb's example (daily temperature):

      First calculate the mean.

      Then, you can use exactly the same algorithm I mentioned above, except that you subtract the mean from each value rather than one value from the previous value.

      Yes, it's two passes (because you have to calculate the mean first), but it's dirt simple and as for storage space, the program only uses a couple of integers; there is no need to store any large data sets aside from the original.

    10. Re:Issues by gringer · · Score: 1

      Two different results from the same data points. Have I misunderstood something?

      I believe it should be mean absolute deviation from the mean, rather than from the next value in the list (this wasn't particularly clear in the summary or the article). So you have three numbers, mean = (1 + 2 + 10) / 3 ~= 4.3333, MAD ~= (3.333 + 2.333 + 5.666) / 3 ~= 3.778

      There's another MAD, the median absolute deviation from the median, so you have for this data set median = 2, MAD = median(1, 0, 8) = 1.

      --
      Ask me about repetitive DNA
    11. Re:Issues by aminorex · · Score: 1

      > you also need to use 2-pass algorithms to compute Mean Absolute Deviation

      Only naively. One pass will suffice.

      --
      -I like my women like I like my tea: green-
    12. Re:Issues by Jane+Q.+Public · · Score: 1

      Yes, I understand that... I left out the mean. That has to be calculated first, so it is actually a "two-pass" operation.

      Still very easy to do, though.

    13. Re:Issues by reve_etrange · · Score: 1

      Sorry, just thought the vague terminology could have been responsible for the 1-pass / 2-pass confusion.

      --
      .: Semper Absurda :.
    14. Re: Issues by Edward+Kmett · · Score: 1

      I am unaware of such an algorithm. Reference?

      --
      Sanity is a sandbox. I prefer the swings.
    15. Re: Issues by Edward+Kmett · · Score: 1

      Doing two passes means I cannot update a previous answer in response to small deltas of information. If I have 2 petabytes of data and 20 megs of updates come in, I have to go back to the well and touch all 2 petabytes, I can't just update my sufficient statistics. The incompressibility of this statistic was what I was referring to with that comment. It rules out many scenarios.

      If I am interested in just moments around the mean I can collect them as summary statistics in tree form, and usually compute them in log time. That use case is also gone. At the scale of data I routinely work with O(n) isn't in the cards and the incompressibility of either MAD works actively against it more so than the 2x factor would naively make it sound,

      There exist approximations to the mean abs dev that can be computed online that converge in the limit, but nothing that yields the correct answer exactly.

      --
      Sanity is a sandbox. I prefer the swings.
    16. Re:Issues by Edward+Kmett · · Score: 1

      Those were less than or equals that got eaten by slashdot. Bah.

      median abs dev is less than or equal to mean abs dev which is less than or equal to standard deviation.

      So while Nassim is correct that mean abs dev is always smaller than standard deviation, the median abs dev is smaller still.

      --
      Sanity is a sandbox. I prefer the swings.
    17. Re: Issues by Jane+Q.+Public · · Score: 1

      "Doing two passes means I cannot update a previous answer in response to small deltas of information. If I have 2 petabytes of data and 20 megs of updates come in, I have to go back to the well and touch all 2 petabytes, I can't just update my sufficient statistics. The incompressibility of this statistic was what I was referring to with that comment. It rules out many scenarios."

      I can understand why that would be annoying. But have you considered data management schemes? For example: let's say you have a known, dated dataset. You get your overall sum and number of data points, and from that calculate a mean and go back to get a sum of variances, and calculate your MAD. But then you store those values: overall sum, # data points, sum of variances. (The sums might well be very large numbers, but some languages will do arbitrary-precision math without a hiccup.)

      In that case, when you add more data, you only have to go through the new data to calculate a new overall sum, number of points, and sum of variances again. From those and the old values you can calculate a new mean and new MAD.

      Just a thought. But if your datasets are as large as you suggest, and getting larger, then it may be worth your while.

    18. Re:Issues by Jane+Q.+Public · · Score: 1

      You were correct. I had it wrong the first time.

    19. Re:Issues by mark-t · · Score: 1

      A whole lot faster actually. Mean deviation is O(n) for time and requires that all of the data be stored. Standard deviation calculation is O(1), and does not reequire access to the original data, so it can be used on ephemeral data sources or data streams that are unbounded in size, where the calculation needs to still be made regularly.

    20. Re: Issues by Edward+Kmett · · Score: 1

      The problem here is the mean abs dev for a larger dataset can't be composed out of mean abs deviations for the smaller subsets. e.g. Having the mean abs devs for a bunch of days is useless for computing it for the month.

      It just isn't a very good number to use for aggregations in that it is less useful than median absolute deviation for characterizing the data robustly and dramatically harder to compute than standard deviation.

      The article naively makes it out like standard deviation is some huge historical mistake, while holding out a mediocre tool as a universal replacement.

      Amusingly had he argued for the even harder to calculate median absolute deviation he'd have had a much stronger case, as it fits his overall "antifragile" narrative much better than mean abs dev in some senses and is even smaller, which he seems to like in the article for some reason.

      --
      Sanity is a sandbox. I prefer the swings.
    21. Re:Issues by mark-t · · Score: 1
      Incorrect... you need one pass to collect the data, and a second pass to compute the mean deviation. Both passes are O(n). You do not need to do a second pass to compute the standard deviation, it can be calculated in O(1) time based on data collected in the first pass.

      If you are only computing this once, doing two O(n)'s is just O(n), but if you are wanting to continually recalculate the mean as you add more elements to your data set, then the difference between them becomes much larger... mean deviation with data collection ends up being quadratic with the amount of data collected, while standard deviation with data collection remains linear with the amount of data collected.

    22. Re: Issues by Jane+Q.+Public · · Score: 1

      The problem here is the mean abs dev for a larger dataset can't be composed out of mean abs deviations for the smaller subsets. e.g. Having the mean abs devs for a bunch of days is useless for computing it for the month.

      No, you're right. Having given it more thought, I saw that it wouldn't work because your mean can change. Too bad, though.

    23. Re:Issues by leaen · · Score: 1

      ncorrect... you need one pass to collect the data, and a second pass to compute the mean deviation. Both passes are O(n). You do not need to do a second pass to compute the standard deviation, it can be calculated in O(1) time based on data collected in the first pass. If you are only computing this once, doing two O(n)'s is just O(n), but if you are wanting to continually recalculate the mean as you add more elements to your data set, then the difference between them becomes much larger... mean deviation with data collection ends up being quadratic with the amount of data collected, while standard deviation with data collection remains linear with the amount of data collected.

      Still incorrect, you need to know data structures for that. When you use a red-black tree where you in node maintain sum of element below node then you can compute sum of elements in arbitrary interval in O(log(n)) time. That cuts complexity from quadratic to O(n log n)

    24. Re:Issues by martin-boundary · · Score: 1

      Computation is still really expensive, for worthwhile problems. Hard problems today tend to deal either with huge datasets where the difference between one and two passes is measured in days or weeks. Other hard problems have realtime constraints where every microsecond counts, and you still can't afford two passes over one pass..

    25. Re:Issues by mark-t · · Score: 1

      Collecting the data alone is a log(n) step... and can be worse if you are trying to keep the data sorted while you collect it. How can you calculate the mean deviation at any time without revisiting all of the data points that you have collected so far? How can you calculate do it in any time better than O(n)? Calculating standard deviation takes O(1) and does not require reexamining the data at all if you've been keeping track of right things during data collection (which still takes O(n)).

    26. Re:Issues by leaen · · Score: 1

      Collecting the data alone is a log(n) step... and can be worse if you are trying to keep the data sorted while you collect it.

      Use red-black trees these keep data sorted with O(log(n)) worst case bound for insertion.

      How can you calculate the mean deviation at any time without revisiting all of the data points that you have collected so far? How can you calculate do it in any time better than O(n)? Calculating standard deviation takes O(1) and does not require reexamining the data at all if you've been keeping track of right things during data collection (which still takes O(n)).

      That is typical exam question for data structures class. You maintain a red-black tree and for node you keep a sum and count of elements of its subtree (you need to update these in rotation and thats it). As red-black tree has logarithmic height you easily find sum of elements greater than given number in logarithmic time. Just do binary search and sum values for subtrees whose smallest element is greater than searched element.
      Once you have that a mean absolute difference by following expression
      (sum_greater(mean) - count_greater(mean) * mean) + (count_less(mean) * mean - sum_less(mean))
      and you can get each term in O(log (n)) time.

    27. Re:Issues by mark-t · · Score: 1

      O(log(n)) for each and every insertion, yes... when you are doing n insertions, that becomes O(n log n). If you are trying to compute the mean deviation at every step as well, you are looking at O(n^2 log n), because you cannot compute the mean deviation without revisiting *every* element you've collected so far, regardless of how they are stored or sorted.

      But standard deviation calcuations don't even require that you even insert the data that you are monitoring into a data structure at all... it only requires that you see the data once, and you never need to look at it again, just like the mean calculation, which can be done in O(1). The total time, therefore, including data collection is just O(n), because that's the amount of time required to collect n data points without storing them all... only storing sums of data collected so far. (but with standard deviation, you need to also store the sum of the squares of all the data collected as well).

    28. Re:Issues by leaen · · Score: 1

      O(log(n)) for each and every insertion, yes... when you are doing n insertions, that becomes O(n log n). If you are trying to compute the mean deviation at every step as well, you are looking at O(n^2 log n),

      No, you just failed data structure class. Insertion takes O(log(n)) with bookkeeping needed to find mean standard deviation in O(log(n)) time which gives a O(n log n) total time. All you need to know to calculate deviation is sum and number of elements above mean and sum of elements below mean. I explained it in more detail in parent post, there is standard data structure that can calculate sum of elements in of elements in given range in O(log(n)) time and supports insertion in O(log (n)) time.
      A quick google query found following implementation: http://kaba.hilvi.org/pastel/pastel/sys/redblacktree.htm

      because you cannot compute the mean deviation without revisiting *every* element you've collected so far, regardless of how they are stored or sorted.

      Repeating a lie does not make it true.

    29. Re:Issues by mark-t · · Score: 1

      Explain how you can calculate mean deviation without revisiting every data point you collected to see what it originally was. It is my assertion that you cannot.

      Explain how you can revisit every element in a red-black tree that has been inserted into it in anything less than O(n) time.

      I do not dispute the access time of red-black trees that you've cited, what I am saying is that you apparently don't understand the actual scope of the problem of computing these values to think that such a data structure would even begin to make a program that needs to do something like compute standard deviation more efficiently unless it was implemented absurdly stupidly in the first place.

      I repeat, the calculation for standard deviation can be performed in O(1) time from aggregate data that uses O(1) space, which can be collected as you make an initial pass over the data (which is by itself an O(n) operation). Calculating the mean, the standard deviation, and other similar statistics can be directly computed in constant time without having access to the original data points collected, and only the aggregate data..

  5. That's not the problem. by khasim · · Score: 4, Insightful

    The problem is that people think they understand statistics when all they know is how to enter numbers into a program to generate "statistics".

    They mistake the tools-used-to-make-the-model for reality. Whether intentionally or not.

    1. Re:That's not the problem. by Deadstick · · Score: 4, Interesting

      Three characterizations of statistics, in ascending order of accuracy:

      1. There are lies, damned lies, and statistics.

      2. Figures don't lie, but liars figure.

      3. Statistics is like dynamite. Use it properly, and you can move mountains. Use it improperly, and the mountain comes down on you.

    2. Re:That's not the problem. by JoeMerchant · · Score: 5, Insightful

      The problem is that peoples' attention spans are rapidly approaching that of a water-flea.

      Up until the past 50 or so years, people who learned about Standard Deviation would do so in environments with far less stimulation and distraction. Their lives weren't so filled with extra-curricular activities and entertainments that they never sat for a moment from waking until sleep without some form of stimulus based pastime. When they "understood" the concept, there was time for it to ruminate and gel into a meaningful set of connections with how it is calculated and commonly applied. Today, if you can guess the right answer from a set of 4 choices often enough, you are certified expert and given a high level degree in the subject.

      Not bashing modern life, it's great, but it isn't making many "great thinkers" in the mold of the 19th century mathematicians. We do more, with less understanding of how, or why.

    3. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Ha! My squirrel ate your water-flea

    4. Re:That's not the problem. by khasim · · Score: 1

      Up until the past 50 or so years, people who learned about Standard Deviation would do so in environments with far less stimulation and distraction.

      They also did so in an environment where they had to do all the math by hand (or with a slide rule).

      The math is not difficult. But it is repetetive in the extreme. So unless you were a savant you learned to pay very close attention to the numbers and what they represented. For those of you who didn't take statistics, here's a link to show you how standard deviation is calculated. With only 6 items:
      http://www.wikihow.com/Calculate-Standard-Deviation

      Imagine doing that, by hand, with a hundred items. And that is just finding the standard deviation.

      Now you can get the "answer" with nothing more than copy-paste. And if that "answer" doesn't suit you then you tweak the input and get another "answer" a second later.

    5. Re:That's not the problem. by dcollins · · Score: 2

      In Soviet Russia, improper statistics puts you up on mountain.

      --
      We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
    6. Re:That's not the problem. by Nemyst · · Score: 1

      I really, really wish I could finish my Master's just by guessing the right answer on a multiple-choices exam. Sadly, it would appear that either you're wrong, or I've picked a subject where that gross oversimplification does not apply. Either way, I think you're being blinded by your nostalgia goggles.

    7. Re:That's not the problem. by TsuruchiBrian · · Score: 3, Informative

      Not bashing modern life, it's great, but it isn't making many "great thinkers" in the mold of the 19th century mathematicians. We do more, with less understanding of how, or why.

      The easier math problems are lower hanging fruit. As time goes on, the problems that are left become increasingly hard. Even when they get solved, average people can't understand what it means, and that makes it hard to care about, and hward for newspapers to make money covering that story.

      Also when you read about the history of mathematics, it's easy to feel like these breakthroughs were happening all the time, compared with now, when in fact they were very slowly, and the pace of discovery is probably higher now than at any point in the past.

      It's easy to say music was better in the 70's than now when you condense the 70's down to 100 truly great songs, forgetting all the crap, and compare it to whats playing on the radio today.

    8. Re:That's not the problem. by Anonymous Coward · · Score: 0

      tl;dr

    9. Re:That's not the problem. by JoeMerchant · · Score: 1

      True, I'm comparing today's "median" or perhaps "average" University student with the same "average" student from 50 to 80 years ago.

      We've got a lot more population, and probably more great thinkers alive today than in the entirety of the 1800-1950 timespan, more people with opportunity, means, etc.

      It's just the everyday UniGrad you meet that I'm lamenting.

    10. Re:That's not the problem. by JoeMerchant · · Score: 1

      I took the thesis option for my Masters', but I was in the minority, most preferred to take extra classes and just get the paper.

      If you select your institution, courses and professors carefully, I bet you can get a degree with mostly multiple choice testing determining the grades.

    11. Re:That's not the problem. by TsuruchiBrian · · Score: 4, Insightful

      I think it's also true that a larger percentage of people are going to university, so the average "intelligence" of people in university in terms of natural ability is probably lower now than when it was just the very best students attending.

      Most of the mediocre students today would have simply not gone to university in the past. I think the same principle holds when it comes to things like blogs. The fact that public discourse can sometimes make it seem as if people are getting dumber, when it is really just that more and more people know how to read and write and can now even be published, whereas in the past, there was a higher cost to publishing, and you were more likely to have something important to say before being willing to incur that cost.

    12. Re:That's not the problem. by OneAhead · · Score: 1

      My hypothesis is that Generation X-ers (such as me) and older - who didn't grow up with cell phones and internet (and the two combined into smartphones) - have an inherent disadvantage in coping with information overload because they weren't immersed in it as a kid. They see the younger part of Generation Y and Generation Z fluently switching between information sources and think they are unfocused, while in reality, their brain is just wired from childhood to be better at context switching / multitasking. If they really were all suffering from some tech-induced form of ADD, they wouldn't be nearly as successful in the workforce as they reportedly are.

      Disclaimer: I'm well aware of the stereotypes and overgeneralizations in this post. I'm talking on average. You know what I mean. And no, no standard deviation given.

    13. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Gen Y and Z are successful in the workforce? Citation needed. I was just reading earlier today that they are narcissistic, whiny bitches with entitlement complexes. Late boomers and Gen X built the current world and very much still run it.

    14. Re:That's not the problem. by phantomfive · · Score: 1

      My hypothesis is that Generation X-ers (such as me) and older - who didn't grow up with cell phones and internet (and the two combined into smartphones) - have an inherent disadvantage in coping with information overload because they weren't immersed in it as a kid.

      I'm curious, do you feel you have trouble coping with information overload?

      --
      "First they came for the slanderers and i said nothing."
    15. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Hey, don't be dissin the water-fleas. What have they done to deserve this!?

    16. Re:That's not the problem. by phantomfive · · Score: 1

      Well said.

      --
      "First they came for the slanderers and i said nothing."
    17. Re:That's not the problem. by flargleblarg · · Score: 1

      Ha! My squirrel ate your water-flea

      Ha! My dog ate your squirrel

    18. Re:That's not the problem. by OneAhead · · Score: 1

      Oh yes! It's hard to stay focused on one thing these days. I would be tempted to leave my e-mail client and my cell phone off during working hours, but people don't appreciate that, so I only do the e-mail client and only when I'm really busy.

    19. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Or, as a famous quip put it:
      The trouble with modern education is that now the illiterate can read and write.

    20. Re:That's not the problem. by phantomfive · · Score: 1

      Practice, you'll get better at it.

      --
      "First they came for the slanderers and i said nothing."
    21. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Huh? I didn't even know it's possible to get masters anywhere without thesis. If I'm ever hiring I gotta remember to check this thing out. I laugh at your "i did extra coursework" masters.

    22. Re:That's not the problem. by Anonymous Coward · · Score: 0

      They also did so in an environment where they had to do all the math by hand (or with a slide rule)

      I thought they used women to do the calculations.

    23. Re:That's not the problem. by Anonymous Coward · · Score: 0

      I'm curious, do you feel you have trouble coping with information overload?

      I'm curious, were you aware that you have trouble coping with information overload? See you missed the part where he wrote that he is a member of a class of people who "have an inherent disadvantage in coping with information overload." Of course I would have thought overload already implied not coping.

    24. Re:That's not the problem. by Eskarel · · Score: 1

      No I think it's more that standard deviation is a horrible name for what standard deviation actually is. On top of that it's often used by the wrong people for the wrong purpose(in part because it's badly named). It's certainly a useful tool for people who know what it does and what it's for, but when non statisticians read it they are going to assume that it actually represents MAD, which it doesn't.

      I don't think the author means for a second that we shouldn't use STD for any purpose, but that as a general rule it should stop being used to highlight results. This makes sense because the audiences for most papers, even highly scientific ones are not statisticians and will not really understand STD in any meaningful way.

    25. Re:That's not the problem. by Eskarel · · Score: 1

      I'd actually suggest that the "average" university student from today is probably smarter than the "average" student from 80 years ago. 80 years ago the primary determining factor for university admission wasn't how smart you were, it was whether you had enough money to attend. On top of that much of what was taught in universities 80 years ago is actually core curriculum in high school today. In terms of 50 years ago that's post GI bill territory so it wouldn't really be much different than today.

    26. Re:That's not the problem. by Anonymous Coward · · Score: 0

      You forgot surprise and a fanatical devotion to the Pope.

    27. Re:That's not the problem. by Anonymous Coward · · Score: 0

      Ha! Like women could ever do math.

    28. Re:That's not the problem. by JoeMerchant · · Score: 1

      The audiences for most papers understand very clearly about STDs, and the probability of contracting one is reduced by using a condom.

    29. Re:That's not the problem. by Anonymous Coward · · Score: 0

      He could have been talking about the online intro statistics courses that are springing up everywhere.

      The master's degree is still safe.

    30. Re:That's not the problem. by OneAhead · · Score: 1

      Yeah, but probably never as good as some of these kids who grew up with it.

    31. Re:That's not the problem. by phantomfive · · Score: 1

      But you'll have other skills that will make up for it.

      --
      "First they came for the slanderers and i said nothing."
  6. Re:Would those data scientists with PhDs by Anonymous Coward · · Score: 0

    Not even close. As someone just a field or two over from climate science, I gotta say that I've never heard of a data scientist before. They have nothing in common.

  7. because we're so perfect by Anonymous Coward · · Score: 0

    error free written right in to our scriptdead pretense.

  8. Useful in some situations but nonsense in others by Anonymous Coward · · Score: 0

    If the value being measured is a voltage or current the square is proportional to energy (or power) so standard deviation has an important physical interpretation. In other applications it could be worthless. No one measure works for all cases - apply the correct tool for the job.

  9. Standard Deviation is Important by njnnja · · Score: 5, Informative

    Standard Deviation is the square root of the second moment about the mean, an important fundamental concept to probability distributions. Looking at moments of probability distributions gives us lots of tools that have been developed over the years and in many cases we can apply closed form solutions with reasonably lenient assumptions. Then we apply the square root in order to put it in the same units as the original list of observations and get some of the heuristic advantages that he attributes to the mean absolute deviation.

    But it is a balance, and any data set should be looked at from multiple angles, with multiple summary statistics. To say MAD is better that standard deviation is a reasonable point (with which I would disagree), but to say we should stop using standard deviation (the point made in TFA) is totally incorrect.

    1. Re:Standard Deviation is Important by Anonymous Coward · · Score: 1, Informative

      This.

      Standard Deviation is the square root of the second moment about the mean, an important fundamental concept to probability distributions.

      More generally, it is the L^2-norm of deviation from the mean which will open up theory for Hilbert spaces and functional analysis in general. Try to beat that. You shouldn't discard anything because people use it wrong. You should teach students today to use it right instead. p-value has been as big, if not bigger, a problem.

    2. Re:Standard Deviation is Important by JanneM · · Score: 2

      What he is saying is not that statisticians should stop using SD in statistical theory or anything. What he's saying is that non-statisticians should stop using SD as a measure of variability when describing their data to each other. And since everybody (except statisticians) think SD is the average deviation from the mean, then people should perhaps use that instead, and reduce confusion for everyone.

      --
      Trust the Computer. The Computer is your friend.
    3. Re:Standard Deviation is Important by Anonymous Coward · · Score: 0

      I'm not a statistician and I knew what standard deviation is. Granted, I'm a mathematician.

    4. Re:Standard Deviation is Important by Anonymous Coward · · Score: 0

      We have so much theory on second moment distributions because those happen to be mathematically convenient --- you can calculate all sorts of useful properties from first principles using pencil and paper. However, most work today is done with different tools from pencil and paper; it's easy to use a computer to numerically calculate all sorts of stuff that would be prohibitatively difficult without analytical shortcuts "by hand."

      Since few people bother to understand the analytical theory anyway, perhaps there are good reasons to switch to more "brute force" computationally intensive methods (which may be easier to explain/understand, even if they take more arithmetical operations to carry out). Instead of approximating everything as Gaussian distributions (because they're analytically tractable), researchers can now numerically manipulate arbitrary probability distributions and carry them all the way through calculations. Performing statistics on arbitrary probability distributions will prevent people from making bad simplifying assumptions because they don't understand how the magical analytical tricks work --- it's computationally harder, but, in a sense, conceptually clearer.

    5. Re:Standard Deviation is Important by neonsignal · · Score: 3, Insightful

      I'm a little surprised at Nassim Taleb's position on this.

      He has rightly pointed out that not all distributions that we encounter are Gaussian, and that the outliers (the 'black swans') can be more common than we expect. But moving to a mean absolute deviation hides these effects even more than standard deviation; outliers are further discounted. This would mean that the null hypothesis in studies is more likely to be rejected (mean absolute deviation is typically smaller than standard deviation), and we will be finding 'correlations' everywhere.

      For non-Gaussian distributions, the solution is not to discard standard deviation, but to reframe the distribution. For example, for some scale invariant distributions, one could take the standard deviation of the log of the values, which would then translate to a deviation 'index' or 'factor'.

      I agree with him that standard deviation is not trustworthy if you apply it blindly. If the standard deviation of a particular distribution is not stable, I want to know about it (not hide it), and come up with a better measure of deviation for that distribution. But I think the emphasis should be on identifying the distributions being studied, rather than trying to push mean absolute deviation as a catch-all measure.

      And for Gaussian distributions (which are not uncommon), standard deviation makes a lot of sense mathematically (for the reasons outlined in the parent post).

    6. Re:Standard Deviation is Important by rsclient · · Score: 1

      So, the solution to "standard deviation is hard" to is rephrase it in terms of "square root of the second moment about the mean"? I'm on board! that's totally simpler and more intuitive!

      (note for the sarcasm-impaired: that was sarcasm)

      --
      Want a sig like mine? Join ACM's SigSig today!
    7. Re:Standard Deviation is Important by Anonymous Coward · · Score: 1

      Parent is absolutely correct. I'd like extend on this from a different perspective. From TFA:

      The confusion started then: people thought it meant mean deviation.

      But, avoiding the functional analysis terminology, it IS correct to think it means "mean" deviation. The confusion, suffered by the author of TFA too, results from the fact that "mean" refers to lots of different functions. So while standard deviation is not *the* mean deviation, it certainly is *a* mean deviation.

      Root mean square, which is what the standard deviation is, IS a mean. As are the arithmetic mean, Gauss' arithmetic-geometric mean, the identric mean, the logarithmic mean, etc.., and countless others without fancy names.

      If I travel at 10 km/hr for half the distance to my destination and then 20 km/hr for the rest of the trip, the appropriate "mean" speed is the harmonic mean.

      If I invest at an annual growth rate of 1.03 for one year and then 1.07 for another, the appropriate "mean" annual growth rate is the geometric mean.

      If I measure centrality of my data with the arithmetic mean (which in statistical terminology is then *the* mean, just to create confusion), then the appropriate "mean" deviation is the root mean square (which in statistical terminology is the standard deviation). To do otherwise is as absurd and ill-informed as to say that my average speed above was 15 km/hr or that my average annual growth rate was 1.05.

    8. Re:Standard Deviation is Important by reve_etrange · · Score: 1

      All he is saying is that if you want to tell someone, e.g. what temperature change to expect from day-to-day, then the MAD is the right measure to use.

      NNT's post doesn't at all state that MAD should be used instead of STD in any other context.

      --
      .: Semper Absurda :.
    9. Re:Standard Deviation is Important by Anonymous Coward · · Score: 0

      Choosing a different measure of deviation does not mean the null is "more likely to be rejected." Nulls are rejected based on the assumed underlying distribution, not the particular measure of deviation. Standard deviation happens to factor conveniently into the calculation of the null (and violations of the null) for normal distributions. One would not simply substitute mean for standard deviation, and plenty of hypothesis tests (think non-parametrics) can't/don't use standard deviation at all.

      On the other hand, there's nothing wrong with standard deviation, and there's plenty that's useful about it. Most importantly, as indicated in an earlier comment, the relation between variance and the second moment makes standard deviation (or at least its square) immensely valuable as part of the intuitive progression from mean to variance to skew and so on. It also turns up (naturally, not by force!) in the formulation of various useful statistics. As a scientist, my only gripe is that more of us don't grasp the difference in meaning and appropriate application of standard deviation and standard error of the mean.

    10. Re:Standard Deviation is Important by AthanasiusKircher · · Score: 1

      And since everybody (except statisticians) think SD is the average deviation from the mean,

      Who the heck thinks standard deviation is that? I mean, seriously... I understood the definition of standard deviation in middle school. More importantly, I understood a few of its uses, and I wouldn't have confused it with a mean deviation. And I'm nowhere near a statistician... heck, I was officially taught the definition in high school math class too.

      This is like someone thinking that the factorial sign (!) means a number is important or surprising or something, rather than what it's actually defined as. It's absolute idiocy. If someone is doing analysis where mean deviation is more appropriate, by all means, do it. But the idea the "most people" who aren't statisticians don't understand... well, I'd hope that anyone who is doing research and using it would. And if not, they should literally be sentenced to a statistics course where they compute everything by hand until they understand what the heck the numbers they are using mean.

    11. Re:Standard Deviation is Important by JanneM · · Score: 1

      Yes, we all know the definition. But knowing the definition doesn't mean you really understand the implications. My experience (I am a researcher) fits with his description: most researchers effectively think of SD as a measure of average deviation, and treat it that way when they (informally) reason about their data. As another post mentioned, even statistics teachers sometimes describe it that way for non-statistician students.

      And yes, one approach would be to make people learn statistics at a much deeper level. In fact, that would be a really good idea in general. But another, complementary approach would be to simply use the mean (or median) deviation when that's what you really try to use anyhow.

      In the situations this comes up, people aren't actually using the SD to derive anything else; it's simply used as a concise description of the underlying data. And the benefit of using mean or median deviation is that it's valid and sensible even when the data is not Gaussian; another common form of misuse of the SD.

      --
      Trust the Computer. The Computer is your friend.
    12. Re:Standard Deviation is Important by Anonymous Coward · · Score: 0

      Standard deviation can be justified in terms of maximum likelihood, assuming that the "deviations" are Gaussian-distributed around a mean. The sum of squares corresponds to minus the logarithm of the Gaussian probability of that particular combination of deviations occurring, so the higher this value, the more improbable (i.e., unlikely) that combination is. The mean as usually defined then happens to be the value that minimizes the sum of squared deviations from the mean, i.e., that gives the "most likely" combination of mean and deviations. But it is based on the assumption of Gaussian behavior.

      In radio communications, the channel noise is Gaussian-distributed, so the mean square corresponds to the power, and the root mean square is a measure of the amplitude e.g. in volts.

    13. Re:Standard Deviation is Important by martin-boundary · · Score: 1

      Nonsense. The standard deviation is a much better measure to use, due to the three sigma rule. This tells you much more for Gaussian, and approximately Gaussian distributions, which occur very frequently in science and engineering (unlike economics).

    14. Re:Standard Deviation is Important by martin-boundary · · Score: 1

      Choosing a different measure of deviation does not mean the null is "more likely to be rejected." Nulls are rejected based on the assumed underlying distribution, not the particular measure of deviation.

      Just a clarification. While nulls can be rejected based on many different measures of deviation, statistical theory explains which among all possible statistics are the most efficient at rejecting a hypothesis. The prime importance of standard deviation (and variance) is no accident.

  10. Education!! by RichMan · · Score: 2

    There is a great difference between a mean value and an RMS value. Scientific people can work with the appropriate version so I don't see a problem with using the correct one for the correct occasion. And certainly science should stay with the correct term as appropriate.

    What I believe the person is calling for here is the most appropriate use when communicating to the non-scientific person. This is an education issue in that the communication really should not use either term as a shorthand but should explain in full the effect of the distribution. Science uses mean and standard deviation (often also requiring a named distribution) because they are shorthands that describe the random behavior and have full meaning without any other explanation needed. So I say use neither term when communicating to the non-scientific as they do not fulfill the communication role to which they are intended.

    What I believe should actually be done is proper education of all so that they understand the differences between various random distributions and move totally away from a "it is cold today, so global climate change based on heating must be a lie".

    1. Re:Education!! by ZombieBraintrust · · Score: 1

      He isn't talking about Non Science people. He is talking about Social Science people and Science Journalist people. Both of whom have educations.

    2. Re:Education!! by bunratty · · Score: 2

      The problem with people accepting anthropogenic global warming is not a matter of understanding. It's a matter of people believing what they want to believe. If people want to believe in God and think that evolution diminishes the importance God or think that evolutions are saying that God doesn't exist, they look for any evidence that evolution doesn't happen, no matter how flimsy it is, to prevent having to feel uncomfortable emotions. If people believe that they have to give up a comfortable lifestyle to reduce carbon dioxide emissions, they will look for any evidence that AGW is incorrect, no matter how flimsy it is. You can see this behavior for what it is when people cling to a mistaken idea for dear life. In a way, they feel that their way of life does depend on that belief, because that belief is their way of comforting themselves.

      --
      What a fool believes, he sees, no wise man has the power to reason away.
    3. Re:Education!! by Obfuscant · · Score: 1

      He isn't talking about Non Science people. He is talking about Social Science people and Science Journalist people. Both of whom have educations.

      So he is talking about Non Science people. Have you never read the output of a Science Journalist when they write about something you are familiar with?

      "Hav[ing] an education" doesn't make one a scientist. Doing things the scientific way makes one a scientist.

    4. Re:Education!! by onkelonkel · · Score: 2

      It is even worse than that with religious folk. All sorts of people will go into "Cognitive Dissonance" mode when their strongly held beliefs are challenged and some will refuse to change their beliefs no matter what evidence is presented. With religious people you get this plus also they are convinced that it is their faith (how strongly they cling to their beliefs - no matter what) that determines their reward in the afterlife. There is no point in attempting logical or evidence based discussions with someone like that.

      --
      None of them can see the clouds; The polished wings don't care.
  11. WTF by Anonymous Coward · · Score: 0

    The confusion started then: people thought it meant mean deviation. The idea stuck: every time a newspaper has attempted to clarify the concept of market "volatility", it defined it verbally as mean deviation yet produced the numerical measure of the (higher) standard deviation.

    First, when the media reports on a scientific discovery, they report the researcher's stats - not they're own. So, if there's an error with the interpretation of STD, then it's the original researchers'.

    Do you take every observation: square it, average the total, then take the square root? Or do you remove the sign and calculate the average? For there are serious differences between the two methods.

    I suggest he publishes in a peer review journal instead of ... WTF is 'edge.org'???

  12. "many with PhDs" by Daniel+Dvorkin · · Score: 1

    If there are "data scientists" who don't understand what the standard deviation is, then they certainly shouldn't be calling themselves "data scientists," and quite possibly not scientists at all. What subjects are their PhDs in, I wonder? This doesn't do anything to reduce my skepticism that such a thing as "data science" really needs to exist.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    1. Re:"many with PhDs" by Anonymous Coward · · Score: 0

      Sociologists are notorious for misusing statistics. I've perceived this for a long time. I'd love to see a citation for Taleb's claim that it's been shown that the majority of sociologists misuse statistics, because there are lots of people who need it thrown in their face.

    2. Re:"many with PhDs" by Anonymous Coward · · Score: 1

      Hi. Ph.D. in data science and network analysis here. First, I consider my statistics background among my peers to be somewhat lacking, because I have chosen to specialize more in graph theory. Even so, I understand at a very deep level a concept as simple as standard deviation, and I doubt very much anybody with a Ph.D. who would call himself a data scientist does not. This article is offensive, frankly.

      Second, you may have skepticism, but data scientists operate at an interesting and challenging intersection of hardware and parallelism challenges in processing huge data sets, hardcore statistics, data mining principles and machine learning algorithms, and often network science to name a few skills. What other existing specialization in computer science, physics, etc,. do you feel is qualified to use Hadoop to process trillions of triple stores into a network and subsequently build highly multivariate link prediction models and evaluate their output statistically with respect to ground truth, to name but one trifling task?

    3. Re:"many with PhDs" by Daniel+Dvorkin · · Score: 3, Interesting

      What other existing specialization in computer science, physics, etc,. do you feel is qualified to use Hadoop to process trillions of triple stores into a network and subsequently build highly multivariate link prediction models and evaluate their output statistically with respect to ground truth, to name but one trifling task?

      As it happens, one of my colleagues runs a project which, among other things, does exactly that. His PhD is in computer science. I'm a bioinformaticist with a background primarily in biostatistics; I couldn't develop a tool like that, but I can certainly see the value in it. In general, I'm not arguing that the tasks currently getting lumped together under "data science" aren't valuable. I'm just saying that I'm not convinced they fit together into a coherent field that can meaningfully be studied in a single degree program, and attempts to make them so may well run into the problem of "jack of all trades, master of none."

      --
      The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
    4. Re:"many with PhDs" by Anonymous Coward · · Score: 0

      do you feel is qualified to use Hadoop to process trillions of triple stores into a network and subsequently build highly multivariate link prediction models and evaluate their output statistically with respect to ground truth, to name but one trifling task?

      Why would I want to do that?

    5. Re:"many with PhDs" by Anonymous Coward · · Score: 0

      Same AC.

      Fair enough. My actual program was in computer science too, but I identify as a data scientist, because it best describes my expertise.. It sounds like we agree that the intersection is challenging and interesting. I can even agree that the "jack of all trades, master of none" description may be apt for myself to some extent, though I am also a world-recognized specialist in one very specific research area.

      Where I think we differ is that I think the confluence of these trades is important, and I don't think that a group of four specialists can as easily or quickly craft as good of a solution as one specialist in data science. In fact, I know it. I have seen a physicist, a sociologist, a statistician, and a HPC computer scientist sit around a table trying to communicate with each other very unsuccessfuly about very basic principles in each field to put together some research. They ultimately went their own ways.

      In other words, the data scientists is the "master" of the confluence, and that can be very useful for achieve research goals and corporate goals that would otherwise be difficult or infeasible with traditional specialists.

  13. Re:Would those data scientists with PhDs by Daniel+Dvorkin · · Score: 0

    Given that most of the buzz about "data science" seems to be in the business world, I'd say it's more lilkely they're corporate hacks working for the propaganda machine that's so effective on suckers like you.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  14. Yes, use the interquartile range instead by G3ckoG33k · · Score: 1

    Yes, use the interquartile range instead https://en.wikipedia.org/wiki/Interquartile_range

    It is like the median a very robust method, not readily influenced by outliers. https://en.wikipedia.org/wiki/Median

    The median is wickedly robust, with a breakdown point at 50%, meaning that you can throw a huge a mount of junk data at it and it still doesn't care.

    The arithmetic mean and the standatd deviation are both junk, often worse than the too-often-assumed-normal data thrown at it.

    1. Re:Yes, use the interquartile range instead by Jane+Q.+Public · · Score: 2

      "It is like the median a very robust method, not readily influenced by outliers. The median is wickedly robust, with a breakdown point at 50%, meaning that you can throw a huge a mount of junk data at it and it still doesn't care. The arithmetic mean and the standatd deviation are both junk, often worse than the too-often-assumed-normal data thrown at it."

      That depends entirely on what you are trying to show. None of them are junk for all purposes; all of them are junk for the wrong purposes.

      For example, if you're talking about salaries of employees of a corporation, the mean might not mean much: the CEO makes 30 times as much as everyone else, and other managers 20 times more, lower managers 10 times more... so the mean is thrown way off. The median is much more meaningful.

      On the other hand, even the mode can be useful sometimes. Suppose the corporation has only 3 pay grades: employees grade A, managers grade B, owner and CEO grade C. In that case the mode might actually tell you something interesting. That's not the best example, but it is an example.

    2. Re:Yes, use the interquartile range instead by Anonymous Coward · · Score: 0

      Except that the Interquatile Range (IQR) depends on the number of data. If you took the IQR of 10 data, it would probably be lower than if you used a hundred data.

    3. Re:Yes, use the interquartile range instead by serviscope_minor · · Score: 1

      . so the mean is thrown way off

      Way off what?

      On the other hand, even the mode can be useful sometimes.

      Fun fact: the mean, meadian and mode all find the number that minimizes some norm of the absolute deviation. For the mean it's the L2 norm, the median the L1 norm[*] and the mode is the L0 "norm" (yeah yeah). There's an entire family from 0 to infinity (or -minus infinity if you're perverse).

      Anything below 1 is not convex (so they're not strictly norms but the equations are the same). Also these can be somewhat difficult to estimate from samples, especially L0.

      [*] The high school thing about averaging the two middle values to find the median if the dataset has an even number of points is unnecessary. Any value between and including those two is a valid median in that it is a minimizer of trhe L1 norm.

      --
      SJW n. One who posts facts.
    4. Re:Yes, use the interquartile range instead by Anonymous Coward · · Score: 0

      Except that the Interquatile Range (IQR) depends on the number of data. If you took the IQR of 10 data, it would probably be lower than if you used a hundred data.

      No, that's not a correct assumption.

  15. The big picture by Okian+Warrior · · Score: 1

    We should not ask statisticians to change their terms because people are too stupid to understand them.

    I've always wondered about this attitude.

    For me, any change requires an analysis of risk/reward versus value. For example, if code contains confusing names, it might be worthwhile to refactor it.

    The tradeoff is in the time spent refactoring versus the perceived value - if it's a mature product that largely works with few planned updates and few people will have to deal with the confusion, then the effort outweighs the returned value. If the code is open source, being actively developed and with many eyes looking at it, there may be a great deal of value in making it easier to understand.

    The same could be said of English versus Metric measurements. Why should the US change to use the new system when everyone understands the one we have?

    If the Federal Reserve sometimes gets it wrong, there may be great value in changing terms. The effort to fix the mistakes people make might be a good deal less effort than changing the terms used by a subset of mathematicians.

    You can look at the big picture and see changes that would return a large overall/distributed value, or you can look at small groups and see that making those changes would cost them time and effort.

    Is it too much to ask statisticians to look at the big picture?

    1. Re:The big picture by PRMan · · Score: 1

      Especially when in Visual Studio you can right-click and Rename throughout the project. I change names that don't make sense and change them to what makes sense all the time.

      --
      Peter predicted that you would "deliberately forget" creation 2000 years ago...
    2. Re:The big picture by flyingfsck · · Score: 1

      Everyone understands the US Measures? How many pottles are there in a firkin? Or how many nails in a chain?

      --
      Excuse me, but please get off my Pennisetum Clandestinum, eh!
    3. Re:The big picture by Okian+Warrior · · Score: 1

      Everyone understands the US Measures? How many pottles are there in a firkin? Or how many nails in a chain?

      Everyone else understands what I meant.

      What are you going on about?

    4. Re:The big picture by boristhespider · · Score: 4, Funny

      I often change CSensiblyNamedClassThatDescribesItsFunctionWell to bTrue throughout the code for precisely this reason and no-one ever appreciates it :(

    5. Re:The big picture by FriendlyStatistician · · Score: 4, Informative

      Hi, I'm a statistician.

      It's not so simple to just say "ok, we're going to use the Mean Absolute Deviation from now on." The use of standard deviation is not quite the historical accident that Taleb makes it out to be--there are good reasons for using it. Because it is a one-to-one function of the second central moment (variance), it inherits a bunch of nice properties that the mean absolute deviation does not. There is not a one-to-one correspondence between variance and mean absolute deviation.

      Taleb is correct that the mean absolute deviation is easier to explain to people, but this is not just a matter of changing units of measure (where there is a one-to-one correspondence) or changing function and variable names in code (where there is again a one-to-one correspondence). Standard deviation and mean absolute deviation have different theoretical properties. These differences have led most statisticians over the last hundred years to conclude that the standard deviation is a better measure of variability, even though it is harder to explain.

    6. Re:The big picture by mythosaz · · Score: 4, Interesting

      I would have said "18 half gallon pottles to the quarter-barrel firkin."
      Wolfram Alpha says 15.75 pottles to the firkin, but that's because of US/UK gallon conversions, I reckon.

      352 nails in a chain - which was interesting to me, in that Google includes those units in its calculator.

      I now know more about pottles, firkins, nails and chains that I did when I woke up. I shudder to think about what got pushed out of my old head to make way for this new minutia.

    7. Re:The big picture by camperdave · · Score: 1

      Only Americans understand the US measures. How many ounces in a pound? Well, it depends on what you're weighing. 63360 inches is about 3mm shy of a mile? Okay, if you say so. I'll stick to metric, thanks.

      --
      When our name is on the back of your car, we're behind you all the way!
    8. Re:The big picture by turbidostato · · Score: 1

      "I would have said "18 half gallon pottles to the quarter-barrel firkin."

      Are you talking about English or American gallons?

      And I expect you talk about English London Beer barrels, not those stupid Dry Colonial barrels, you know...

    9. Re:The big picture by reve_etrange · · Score: 3, Insightful

      I think NNT is saying that the MAD ought to be used when you are conveying a numerical representation of the "deviations" with the intent that readers use this number to imagine or intuit the size of the "deviations." His example is that of how much the temperature might change on a day-to-day basis. According to him, it's not just that the concept is easier to explain, but that it is the more accurate measure to use for this purpose.

      Based on his other work I'm sure he understands that the STD is generally superior for optimization purposes, fit comparison, etc.

      --
      .: Semper Absurda :.
    10. Re:The big picture by Anonymous Coward · · Score: 0

      You use an IDE *and* Hungarian notation? The horror!

    11. Re:The big picture by Anonymous Coward · · Score: 1

      No. We don't understand the US measures either. How many ounces in a pound? I have no idea. I ACHE for the metric system. Please!!!

    12. Re:The big picture by The_Wilschon · · Score: 1

      I hope he has more examples than just the temperature (no, I didn't RTFA). For the temperature in a day, most people are satisfied with the minimum and maximum, and don't need any more complicated measure. The MAD would actually be LESS informative for temperatures within a day...

      --
      SIGSEGV caught, terminating

      wait... not that kind of sig.
    13. Re:The big picture by Anonymous Coward · · Score: 0

      I agree it's hardly as problematic as the author seems to think it is. Statistics are valuable since they tell you things that can help you make decisions. And money. As a science teacher, stats are as valuable to a scientist as a ruler and square to a builder. Re: Std Dev - actually, leading students down the lane of "measuring scatter" seems to work fine - they can tell two data sets are different visually, now you have to make them produce a number that makes sense to reflect that difference. The "hard part" is only getting them to see that deviation has to be a non-squared number, as it makes no sense to say that two groups of data produce similar means of 25.4 people per bus (for example) but that one of those bus companies has it bad because the passenger list varies more widely, say by a variance of 30 "square people" vs. 12 "square people". Square root, problems solved. Standard units makes sense.

    14. Re:The big picture by Anonymous Coward · · Score: 0

      In other words we're smarter than you. It's always amusing that people from parts of the world that weren't smart enough to codify and enforce their units of measure are always mocking us for our system that's worked for a longer period of time.

      I realize that people in the rest of the world are somewhat brain addled when it comes to measurements, but this is ridiculous. Nobody ever worries about 3mm when talking about a mile, because quite frankly the measurements are rarely, if ever that exact. Even modern surveying equipment is going to be off by more than that over the course of a mile. Not to mention the fact that 3mm is about 1/8th of an inch, but not precisely.

    15. Re:The big picture by reve_etrange · · Score: 2

      Not within one day, but across several days.

      Say someone just asked you to measure the "average daily variations" for the temperature of your town (or for the stock price of a company, or the blood pressure of your uncle) over the past five days. The five changes are: (-23, 7, -3, 20, -1). How do you do it?

      The STD of the series is 15.7 and the MAD is 10.8. NNT argues that MAD is more useful in this context.

      In fact, whenever people make decisions after being supplied with the standard deviation number, they act as if it were the expected mean deviation...every time a newspaper has attempted to clarify the concept of market "volatility", it defined it verbally as mean deviation yet produced the numerical measure of the (higher) standard deviation.

      He says he's seen this mistake made not just in popular articles, but also in financial publications and regulatory documents. The daily temperature is just a contrived example; he's mainly talking about financial analysis (which is his field).

      --
      .: Semper Absurda :.
    16. Re:The big picture by Compuser · · Score: 1

      One could make a more practical argument too. For most cases MAD will be smaller than standard deviation. So reporting so many standard deviations of difference is often a way to overestimate your error bars. Or in other words to report a more conservative estimate. If MAD or SEM is the appropriate way to estimate errors - oh well, that just means your reported result (significant difference) is even stronger. Of course you will usually use a test and not rely on error bars to estimate significance but on a visual graph, when you plot huge error bars and your results are still worlds apart - it is a good way to be extra conservative.

    17. Re:The big picture by hawk · · Score: 1

      About two thirds of things are within one standard deviation of the mean, 95% within two, and 99% in three.

      This applies to all bell-shaped data, which is nearly (but not quite) all of it. Slightly broader rules (Chebyshev) apply to all data, regardless of distribution)

      There is no similar statement for average deviation.

      hawk

    18. Re:The big picture by EngnrFrmrlyKnownAsAC · · Score: 1

      How many ounces in a pound? Well, it depends on what you're weighing.

      No, there is always 16 ounces in a pound regardless of what you are "weighing" (measuring the mass of). Perhaps you've conflated ounce with fluid ounce -- a distinct, though confusingly named, unit.

      And yes, a mile is 5280 ft = 63360 inches. I don't know where you pulled "3mm shy of" from but if you're measuring in miles and worrying about being 3mm shy, you're doing it wrong.

      --
      Howdy howdy howdy
    19. Re:The big picture by Jamu · · Score: 3, Funny

      pnWhat vIs nWrong cWith aHungarian nNotation?

      --
      Who ordered that?
    20. Re:The big picture by camperdave · · Score: 2

      No, there is always 16 ounces in a pound regardless of what you are "weighing" (measuring the mass of). Perhaps you've conflated ounce with fluid ounce -- a distinct, though confusingly named, unit.

      And yes, a mile is 5280 ft = 63360 inches. I don't know where you pulled "3mm shy of" from but if you're measuring in miles and worrying about being 3mm shy, you're doing it wrong.

      BZZZT! Wrong. Gold, silver, and other precious metals are measured in Troy ounces, which are slightly heavier than regular ounces. Oddly, Troy pounds only have 12 Troy ounces in them. Thus, an ounce of gold is heavier than an ounce of lead, but a pound of gold is lighter than a pound of lead.

      Similarly, long distances are measured in statute (or survey) miles, which are based on a longer standard than customary measures. The volume of a hogshead is different between ale and beer.

      But thanks for correcting me. Not even Americans know their own system.

      --
      When our name is on the back of your car, we're behind you all the way!
    21. Re:The big picture by Anonymous Coward · · Score: 0

      You forgot baz, you insensitive clod!

    22. Re:The big picture by Anonymous Coward · · Score: 0

      Takes the fun right out of software development. And it's for morons who don't know what they're doing (i.e. what type their fucking variables are).

    23. Re:The big picture by boristhespider · · Score: 1

      Company policy. You get used to it quickly, even if any IDE worth its salt can tell you a type in a rather less silly way.

      At first it felt like someone was forcing me to program in Fortran 77 again. I don't like Fortran 77.

    24. Re:The big picture by Anonymous Coward · · Score: 0

      Aaaand since all the units you brought up are defunct, you've wasted your time and ours, as well as being wrong. Thanks for playing, though.

  16. How about "somewhere in the middle" by sandbagger · · Score: 1

    That's a good enough replacement term.

    --
    ---- The above post was generated by the Turing Institute. Maybe.
  17. Why eliminate it? by Anonymous Coward · · Score: 0

    Properly educating the world on this problem would likely take no more effort than convincing everyone to stop using standard deviation. To that end, why eliminate something that (apparently) has widespread use?

  18. Same trick, different pony by Blackajack · · Score: 1

    My college math teacher made a special point of warning us that journalists almost always mix up pct and pp. Sometimes they even do that on purpose!

    1. Re:Same trick, different pony by Anonymous Coward · · Score: 0

      Ooh, you took college "math"? Did you take college "history" and "gym," too?

  19. Use "Margin of error" by sinequonon · · Score: 1

    If you don't like the term "standard devation", use "margin of error" instead.

    --
    -Bob-
  20. Re:response by clj · · Score: 2

    I don't know which is more foolish, thinking that saying nothing, but saying it first, is a worthwhile goal, or claiming to be first when you're not. No need for you to choose, however: you did both.

  21. Re:Would those data scientists with PhDs by Anonymous Coward · · Score: 0

    No, it's more likely to be a public relations firm in the guise of a "grass roots" organization in the pockets of big oil. But we digress.

  22. Roadway Intersections by Okian+Warrior · · Score: 1

    If there are "data scientists" who don't understand what the standard deviation is, then they certainly shouldn't be calling themselves "data scientists," and quite possibly not scientists at all. What subjects are their PhDs in, I wonder?

    The problem isn't with highly-educated people, it people who are not highly educated, or who are highly educated but in a different field.

    If a particular intersection attracts a lot of accidents, we consider the accidents to be the fault of the drivers involved. But at the same time, we recognize that aspects of the intersection might be a contributing factor as well.

    Expert drivers would never have such accidents, but if we spend some effort reblocking the intersection we could get improved safety, and sometimes there is value in doing this.

    Like the roadway intersection, if a term is so confusing that average people make mistakes because of it, there may well be value in changing to easier-to-understand terms.

    1. Re:Roadway Intersections by Anonymous Coward · · Score: 0

      "...Expert drivers would never have such accidents..."

      A completely ridiculous assumption.

  23. Re:response by flibbajobber · · Score: 5, Funny

    First!

    ... to within 0.5 standard deviations.

    Actually, the more posts this story attracts, the more accurate your statement is, and the fewer standard deviations you are away from true first. Response times not being distributed in a Gaussian curve perhaps complicates things.

  24. Standard deviation BAD, but mean GOOD? by PacoSuarez · · Score: 4, Interesting

    Perhaps non-mathematicians don't have a problem with this, but it rubs me the wrong way.

    What makes the mean an interesting quantity is that it is the constant that best approximates the data, where the measure of goodness of the approximation is precisely the way I like it: As the sum of the squares of the differences.

    I understand that not everybody is an "L2" kind of guy, like I am. "L1" people prefer to measure the distance between things as the sum of the absolute values of the differences. But in that case, what makes the mean important? The constant that minimizes the sum of absolute values of the differences is the median, not the mean.

    So you either use mean and standard deviation, or you use median and mean absolute deviation. But this notion of measuring mean absolute deviation from the mean is strange.

    Anyway, his proposal is preposterous: I use the standard deviation daily and I don't care if others lack the sophistication to understand what it means.

    1. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      Ah, thank you! The most intelligent comment I've read here. I just hope everyone reads it. Most importantly:

      The constant that minimizes the sum of absolute values of the differences is the median, not the mean. So you either use mean and standard deviation, or you use median and mean absolute deviation. But this notion of measuring mean absolute deviation from the mean is strange.

    2. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      I understand that not everybody is an "L2" kind of guy, like I am. "L1" people prefer to measure the distance between things as the sum of the absolute values of the differences.

      This is ridiculous. We all know that pi is the one true constant. Why aren't we all using Lpi?

    3. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      The constant that minimizes the sum of absolute values of the differences is the median, not the mean.

      Out of curiosity: how do you prove this? For the sum of the squares of the differences it is straightforward to prove that it is minimized by the mean. But for the sum of the absolute values, you cannot use differential calculus.

      I would really be interested to see the proof. Do you have a reference?

    4. Re:Standard deviation BAD, but mean GOOD? by locofungus · · Score: 1

      For an odd number of samples consider the median value first.

      Obviously for one value, the distance is minimized by picking that value.

      Now add the next two points (one on either side) - the extra sum of the absolute value of the distances is the distance between these two points regardless of where we put the value between them- so it's still minimized if it stays at the median. repeat.

      For an even number of points the initial point can be anywhere between the two central values.

      For completeness you need to consider (and reject) the case that the median lies outside the list completely.

      --
      God said, "div D = rho, div B = 0, curl E = -@B/@t, curl H = J + @D/@t," and there was light.
    5. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      Fuckin' a, right, man! It's all about what you want!

    6. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      I agree with your consistency argument (if you're using measures for dispersion following a L1 metric, you should also use them for location). Though...

      So you either use mean and standard deviation, or you use median and mean absolute deviation.

      If you're going to go with the median as measure of location (a robust estimator that is insensitive to outliers), why not use median absolute deviation (rather than mean absolute deviation) as a measure of dispersion (since, again, it is a more robust estimator of dispersion in the presence of outliers)?

      (just curious and genuinely interested in hearing your opinion)

    7. Re:Standard deviation BAD, but mean GOOD? by Anonymous Coward · · Score: 0

      I don't want to sound pedantic (LOL, who am I kidding? Of course I do...), but, obviously, the fundamental metric is Ltau, not Lpi.

    8. Re:Standard deviation BAD, but mean GOOD? by PacoSuarez · · Score: 1

      I am not sure how I feel about that measure. If we were to use the median absolute error and try to be consistent, we would have to use as the central measure whatever minimizes the median absolute error. That would be a point somewhere between the 25th and 75th percentile, in the "flatter" part of the distribution, in some sense. I don't know if that central measure has a name, but I suspect it's not very relevant in practice.

  25. Re:response by Anonymous Coward · · Score: 0

    You forgot making a blatantly late fp just to elicit a reaction from someone, otherwise known as trolling. Someone with only six digits should know better FFS.

  26. I hate averages by tthomas48 · · Score: 5, Interesting

    I also think averages should go away. Most people think they are being reported the median (the number in the middle) when people tell them the average. It's great for real estate agents, and people trying to advocate for tax reform, but the numbers are not what people think they are.

  27. Re:Would those data scientists with PhDs by ISoldat53 · · Score: 1

    They are vital to getting meaningful information out of a sea of data. Cancer research and particle physics use data scientists. Unfortunately so does amazon.com.

  28. Re:Would those data scientists with PhDs by segedunum · · Score: 2

    Well, given that they think it's a great idea to take two different data sets measured in the same units, but measured in completely different ways, and put them together as a comparison over time then I'd say the definition of deviation is the least of their worries.

  29. Revisiting a 90-year-old debate: Advantages of MAD by Anonymous Coward · · Score: 1

    Food for thought: "Revisiting a 90-year-old debate: the advantages of the mean deviation"

    http://www.leeds.ac.uk/educol/documents/00003759.htm

  30. SD in life sciences by Anonymous Coward · · Score: 0

    When taking measurements (such as protein concentration in blood) we are forced by the magazine editors to inform SD as an error estimate. That is in my view plainly wrong, as the SD is an estimate of the population variance. I try to use what is known as standard error of the mean (SEM) (mean deviation in TFA).

  31. Confused Taleb by Anonymous Coward · · Score: 1

    Didn't Taleb warn us about the perils of modeling things with normal distributions that fail to capture outliers ("Black Swans") and yet now he advocates the use of a stastical measure that conceals^H^H^H^H^H is robust with respect to outliers?

    Oh well, next year he'll probably come up with something along the lines of "Monte Carlo methods major cause of global warming, return to analytic methods and moments unavoidable truth"...

  32. Incorrect identification of the problem. by Anonymous Coward · · Score: 0

    The problem is not that standard deviation is confusing. The problem is that sociologists need to learn how to apply statistics. Either the majority of sociology PhDs are ignorant of statistics, or they've mastered the art of selecting a politically desirable conclusion and misapplying statistics to support it.

  33. Bell Curve by samwhite_y · · Score: 1

    I find this article quite confusing. Is the actual suggestion that we should be going around using the mean deviation as a way of capturing the general variance of our data sets? Or to put it another way, does he want "deviation" measures not to give us a real sense of the larger deviations that might occur with some real probability. For example, with temperatures, standard deviation is more likely to suggest that we can have periods of significantly higher and lower temperatures than a simple "mean deviation".

    Adding to my confusion is that there is no reference to articles, books, or other subject material that supports the general thesis. If the "mean deviation" is better than the "std deviation", give some real concrete examples and supporting mathematics.

    Also, there seems to be no reference to "bell curve" distributions and "non bell curve" distributions. Standard deviation computations are built around bell curve distributions for their mathematical soundness. For example, if I were to take every number and raise it the fourth power, standard deviation would not work so well on this new set of numbers. Is the author suggesting that typical sampling distributions of sampled events tend not to be "bell curve" like?

    Standard deviation is taught in 7th grade in my local school. It shows up constantly in any standard K-12 curriculum. To challenge this, you really should bring a lot more substance to any argument that we should do things differently.

    For example, I could argue that we should use 1:2 to represent 1/2 because the slash (/) should be used for logical dependency arguments instead. I could create lots of examples and go into a diatribe about how people constantly misuse fractions and ratios because they use a slash in their construction. But I would still be spouting nonsense.

  34. Mean Deviation is Always Zero by dcollins · · Score: 3, Interesting

    Well... first of all, summary has it wrong. It's not "mean deviation", it's "mean absolute deviation", or just "absolute deviation" from the literature I've seen. (Mean deviation is actually always zero, the most useless thing you could possibly consider.)

    Keep in mind that standard deviation is the provably best basis if your goal is to estimate a population *mean*, the most commonly used measure of center. Absolute deviation, on the other hand, is the best basis to use for an estimate of a population *median*, which is maybe fine for finances, which is what the linked paper seems mostly focused on. (Bayesian best estimators, if I recall correctly.)

    If the main critique is that economists and social scientists don't know what the F they're doing, then I won't disagree with that. But no need to metastasize the infection to math and statistics in general.

    --
    We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
  35. The example is flawed by Glires · · Score: 1

    The example in the article isn't even an example of a standard deviation. He may have plugged his five values into the RMS formula, but what it produced isn't an actual standard deviation because five values is too small of a sample size.
    This article is really a demonstration of why people should stop misusing the term "standard deviation" than it is an argument of why people should stop using standard deviation.

    --
    -Glires
    1. Re:The example is flawed by vakuona · · Score: 1

      Actually, if that is the whole population of interest, then that is the standard deviation. If that is only a sample, then as an estimator, and it is biased. That's when you need to use the unbiased estimator to estimate the standard deviation for the population.

      As your sample becomes larger, it begins to matter less anyway (the difference between dividing my n and dividing by n-1 becomes tiny the larger n gets).

      I tend to have less hangups about using either to be honest if all I am looking for is a measure of the dispersion in the population. Of course, if I want to score full marks in a test, I use the right one.

    2. Re:The example is flawed by Glires · · Score: 1

      A problem with that line of thinking is that if those five values are the whole population of interest then you cannot establish that it is a normal distribution, which is a fundamental prerequisite for even considering the existence of a "standard deviation". Even for samples, normal-curve statistics are designed for large sample sizes (n>30). If your sample size isn't large enough to derive a normal distribution curve, then your RMS values are not measuring from the normal peak, but rather from a meaningless arbitrary value. Robotically plugging a small sample size into a large-sample statistical formula doesn't produce a valid statistical result any more than plugging your body weight into "C x 9/5 + 32 = F" would tell you your body temperature.

      --
      -Glires
  36. Know it, use it. by ExXter · · Score: 2

    I studied geodesy in germany as diploma on a technical university. Standard deviation has its right to exist and to be there and to be used. If this man really means what he says he should not say to abandon standard deviation but to write BOOKS that teach people correctly what it is and how it is calculated on the data which you have. Yes I also meet people (talking of themselves as scientists and researchers) who have no fucking clue how to work with data and standard deviation, but on the other hand I also meet alot who know and also derive the right conclusions, formulars or algorithms out of these. For me this guy sounds like a mad panda who just didn't get it right...

  37. Using MAD moves in the wrong direction by Anonymous Coward · · Score: 0

    He says that standard deviation gives too much weight to tail events.

    But I think the bigger problem in finance is under weighting tail events.

    1. Re:Using MAD moves in the wrong direction by vakuona · · Score: 2

      Um, yes it does.

      if you imagine a random draw in which you had 99 realisations of the number "1" and 1 realisation of the number "101". The mean would equal 2 while the (population) standard deviation would equal sqrt(1/100*(99+99^2)) (which is a fairly large number).

      If you removed the one observation of "101" from the sample, you would have zero standard deviation. So one additional outlier will cause the standard deviation to go from zero to 10 without changing the mean by nearly as much (or the median at all).

      The issue of underweighting tail events is really a separate issue, and that is usually because a distribution such as the normal distribution has very thin tails, so basically anything outside about 4 standard deviations would be predicted to be extremely unlikely by the normal distribution.

      Other distributions do better, but they tend to be rather less friendly from an analytical perspective. However, with the kind of computing power available nowadays, this is less of an issue now as it would have been when people first started using the normal distribution for the purposes for which it has been always known (and some not so clever bankers have recently found) to be inadequate.

      I should add that it is still a bit of an issue where speed is concerned though, because you can do your calculations involving the normal distribution much faster than you can many of the other more esoteric and likely more realistic distributions.

  38. Taleb doesn't live in a normal world by Yoik · · Score: 1

    When I was in school, they still taught the central limit theorem which explains why so many error distributions are "normal". Our world provides us with millions of examples in everyday life where the standard deviation of our experiences is the best statistic to estimate the probability of future events.

    What you do with a statistic is what counts. It's easy to look at the standard deviation and estimate the probability that the conclusion was reached by chances of the draw, though it takes some practice to develop your intuition. It is imbedded in our language when we talk of "6 sigma" reliability or " 4 sigma" thinkers. Anyone who thinks he is a scientist should understand this!

    Mr. Taleb may be working in a field where normal distributions are rare, but the probability is he is either lying or poorly educated.

    1. Re:Taleb doesn't live in a normal world by FriendlyStatistician · · Score: 1

      I disagree with Mr. Taleb on this point, but I feel I should note that most of his work over the last 10-20 years has been about things which are not Normal. He is quite well known for it.

  39. he does have a point, but maybe goes too far by dsoodak · · Score: 1

    I agree that mathematicians may become imprinted on standard deviation and forget that it is only used because it is easier to work with than average absolute deviation (ex: the derivative of x^2 is continuous, unlike abs(x)), and that less technically inclined readers might not realize there is a difference. However, they ARE usually pretty close (I don't have a reference, but I once ran a simulation comparing the 2 using random data with a Gaussian distribution and the curves matched exactly), and its harder to find exact solutions with average absolute deviation. On the other hand, it wouldn't hurt to use "MAD" occasionally on a data set to make sure that the standard deviation gives results that are meaningful as a measure of "deviation".

  40. Circular error probable. by Anonymous Coward · · Score: 0

    Cep of 50% and 10 meters means half the missiles land on your house, and half on your neighbors. A fine measure

  41. . . . in Social and Biological Sciences by dmatos · · Score: 1

    That's what he concludes at the bottom of the article. He starts the article by saying that standard deviation should only be used by physicists, mathematicians, and mathematical statisticians. If I'm not mistaken, "physics" and "math" covers a whole lot of different fields, including most of the STEM fields that (largely) define the users of this site.

    I know in my particular field (physics based), standard deviation is a hell of a lot more useful than mean average deviation. And easier to use.

    Bah. I call poor summary.

    --

    It may look like I'm doing nothing, but I'm actively waiting for my problems to go away.
    --Scott Adams
  42. Re:Would those data scientists with PhDs by Daniel+Dvorkin · · Score: 2, Interesting

    Cancer research and particle physics use data scientists. Unfortunately so does amazon.com.

    Okay, since cancer research is a very large field, I can't say for sure one way or the other ... but I do know that working in bioinformatics at a major academic research center, I've never known a single person in medical research of any kind who called themselves a "data scientist." We have lots of computer scientists and statisticians, most of whom, fortunately, get along well enough to make use of each other's strengths. Regarding particle physics I have no idea, but yeah, I'm willing to bet Amazon or any other large corporation hires more "data scientists" than all the scientific institutions in the world put together--and gets exactly the kind of buzzword bingo they're paying for in return.

    --
    The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
  43. Inside the microwave? by Anonymous Coward · · Score: 0

    Have you checked to see if you're oscillating while it is in use? That might be why you're not inside it, I think you're supposed to be both the particle and the wave unless you check.

  44. Re:Would those data scientists with PhDs by Anonymous Coward · · Score: 0

    Unfortunately so does amazon.com.

    "Amazon.com: Turning your Main Streets into Skid Rows since Walmart made it fashionable — with a computer!"

    "Choke down your Super-size while your city chokes on down-size — shop Amazon!"

    "Amazon — named after the lumber used to package your shit, bro! High-five!"

  45. His description of how to calculate SD is pretty p by Anonymous Coward · · Score: 0

    If you read the post his description presumes you have 1) already calculated the mean and found it is zero thereby simplifying the calculation 2) "average the total" whatever that is, when he really means total the squares of the values and divide by the number of values less one. This is pretty poor language from a professor writing a post about confusion. Nevertheless I think his argument is based on a lay person's intuitive misunderstanding of SD, and maybe if that is true, and I suspect it is, MAD might be a better measure to report in newspaper articles?

  46. Statistician and author? by Anonymous Coward · · Score: 0

    This guy is not a statistician.

    http://www.edge.org/memberbio/nassim_nicholas_taleb

    He is someone who *uses* statistics, just like the scientists he criticizes, albeit with probably a fair bit more understanding of the underlying mathematics. But nonetheless, it is wrong to label him a "statistician" just to add unwarranted authority to his words.

  47. Journalists by wiredlogic · · Score: 1

    If I read a non-scientific article that spewed out standard deviations I would automatically disregard the numbers anyway. It is a safe assumption that a journalism major doesn't understand what they're writing about and just adding filler to boost word count.

    --
    I am becoming gerund, destroyer of verbs.
  48. normal densities by stenvar · · Score: 3, Informative

    For normal densities, standard deviations and MAD are just proportional, with a factor of about 1.25, so it doesn't matter which you use.

    For non-normal densities, neither of them really is universally "right" for characterizing the deviation, but it's mathematically a whole lot easier to understand how standard deviation behaves in those cases than MAD. So even there, standard deviations are usually the better choice.

  49. This is completely wrong by Anonymous Coward · · Score: 0

    Standard deviation is a much better measurement of the spread of a distribution than the mean error. There are many great mathematical properties (mostly originating for Pythagoras theorem) that emerge in using the squared errors. Further, using the regular mean error can more often result in non unique values. For example, the MAD for the distribution -4, -4, 4, 4 will result in the same value as for the distribution of -8, 0, 0, 8. In most cases, different decision will be made between these two distributions. I.e. If it's stock prices, the latter is normally considered more volatile than the former.

  50. Heisenberg uncertainty by xPsi · · Score: 1

    The celebrated Heisenberg uncertainty principle in quantum theory is based on statistical statements about the coupled standard deviations of position and momentum measurements (for example), not the mean deviation. The mean deviations are assumed to be zero since the means of the position and momentum distributions are exactly known for theoretical work. What matters are the fluctuations about the mean. In fairness, Taleb does allow physicists to keep using STD. But, quantum mechanics aside, it seems characterizing fluctuations about the mean, rather than fluctuations of the mean, is often an important measure depending on the nature of the investigation. Retiring the standard deviation seems a bit hasty.

    --
    i\hbar\dot{\psi}=\hat{H}\psi
  51. standard deviation doesn't weight by observation by YesIAmAScript · · Score: 1

    It weights by the difference between the observation and the mean, by the variation. So large observations are not weighed any more than small ones. Two observations equally far from the mean get equal weight.

    Widely varying observations do get higher weight and that is intentional. Standard deviation is that way because it is so useful in analysis of variance and measuring likelihood of statistical significance.

    --
    http://lkml.org/lkml/2005/8/20/95
  52. Re:So you want to decertify a management degree... by rwa2 · · Score: 4, Funny

    ...and besides... JUST THINK of all the rigorous Lean Management courses that will have to re-certify all of their "Six-Sigma Black Belts" to some kind of "Half-Dozen of the Other" degrees!

    PANDEMONIUM!!!

  53. Re:response by Anonymous Coward · · Score: 0

    First! to Help you out with that. ;)

  54. Re:Would those data scientists with PhDs by Brett+Buck · · Score: 1

    Where does it say that?

  55. agree: as bad as "Anti-Fragile" by KWTm · · Score: 2

    So you want to retire a statistical term......because people use it incorrectly in economics? Get bent. The standard deviation is a useful tool for statistical analysis of large populations.

    Agreed that this is a ridiculous proposal. He probably just wants more publicity.

    This was the guy who wrote the book "Anti-Fragile", which I had hoped would educate and broaden my way of thinking, in the same way that the Malcolm Gladwell books ("Tipping Point", "Blink", "Outliers") did. He ended up droning on and on without really making a worthwhile point, and I gave up after a while.

    --
    404555974007725459910684486621289147856453481154 in hex is "You sank my Battleship?"
    [GPG key in journal]
  56. Re:Would those data scientists with PhDs by The_Wilschon · · Score: 3, Insightful

    I know several people who have left high energy physics to become data scientists. Nobody in HEP calls themselves a "data scientist", but that's (some of) what we do anyway. It's just analysis of very large data sets. Unlike in the life sciences, both HEP and many commercial / industrial environments have sufficiently large data sets that very complex questions can be asked and answered. You can never have "enough data" -- if you think you have "enough data", then you aren't asking hard enough questions.

    --
    SIGSEGV caught, terminating

    wait... not that kind of sig.
  57. more blogs masquerading as news by Anonymous Coward · · Score: 0

    And here's /. advertising yet another blog article.

    Step 1 Start bullshit blog with paid advertising.

    Step 2 Get blog article on front page of /.

    Step 3 Profit!

  58. On a related note by Lamps · · Score: 2

    "If people believe that they have to give up a comfortable lifestyle to reduce carbon dioxide emissions, they will look for any evidence that AGW is incorrect, no matter how flimsy it is. You can see this behavior for what it is when people cling to a mistaken idea for dear life."

    The above reminded me of something from Nassim Taleb's writings. Those who have read his books may be familiar with the following Upton Sinclair quote: "It is difficult to get a man to understand something, when his salary depends upon his not understanding it." NNT applies this principle to financial 'experts' (quants, stockbrokers, advisors, etc.) who do things that are demonstrably counterproductive (applying stat methods that assume Gaussianity to non-normal distributions; disregarding the randomness inherent in stock movements) not necessarily out of ignorance, but largely because such actions serve their economic benefit. In all areas, people often disregard evidence when doing so serves what they perceive as their immediate interests.

  59. Data Science by Lamps · · Score: 4, Informative

    Data science is a field that combines machine learning and statistics to derive meaning from data. Data scientists should be reasonably well-versed in classical stats, but the data sets they deal with are often huge, ill-defined, and not amenable to analysis using classical methods. To deal with such challenges, data science recruits a healthy combination of certain areas of comp-sci (databases, machine learning, NLP, AI), statistical methods, and, quite often, improvisation.

    Strange that there are so many people on here that are unfamiliar with data science.

    1. Re:Data Science by BorisSkratchunkov · · Score: 0

      not amenable to analysis using classical methods.

      Care to explain how this is true? I think I have an idea, but using "a healthy combination of certain areas of comp-sci (databases, machine learning, NLP, AI), statistical methods, and, quite often, improvisation" seems to be an even more obtuse approach than going about it the old-fashioned way. I'd much rather hear that people are using what we already know or (still better, but probably not as plausible) the latest mathematical advances regarding nonlinear systems rather than just ad-hoc'ing methods because... computers! I believe that this is at least partiallly Nassim Taleb's objection to the entire field of data science as well. How many 'results' coming from data science are the product of sound and rigorous methodologies, and how many are just due to chance/data dredging?

  60. "Mr. Taleb may be working in a field where normal by Lamps · · Score: 1

    Concerning his education credentials: he's got a U. Penn. MBA and a U. of Paris doctorate, and currently teaches at NYU Polytech. If you want to know his thoughts on normal distributions, stats, epistemology, econ, and the social sciences, his books are excellent, and are well worth a read (although much of the best material is quite derivative of Mandelbrot). NNT may be called an anti-academic, anti-econ establishment crank, but it would generally be inaccurate, in accordance with your inference, to accuse him of lying.

  61. Re: Would those data scientists with PhDs by Anonymous Coward · · Score: 0

    odd. i would suggest the exact opposite

  62. Standard Deviation is fn of 2nd moment of the data by MickLinux · · Score: 3, Informative

    I can really go for renaming standard deviation, but it should not be abolished.

    Standard deviation is a function of the second moment of the data, and if you remember your laws for combining moments of inertia (the parallel axis theorem), then you'll understand better what you're dealing with.

    2nd moments detail resistance to spin, and thus the resiliance of your findings to changes and errors.

    --
    Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
  63. How do you calculate the mean deviation in 1 pass? by mark-t · · Score: 1

    Calculating the standard deviation of a data set with only one pass over the data as you initially collect it is fairly straightforward. This is ideal in situations where the data you are working with is ephemeral, or of unbounded size and impractical to store every individual sample. How do you calculate the mean deviation without having to go back and revisit all of your samples?

  64. Re:response by Decker-Mage · · Score: 2

    Naw, you'd probably need a Poisson distribution ;-).

    --
    "[I]t is a wise man who admits the limits of his knowledge or skill, and that pretending either causes harm." --Terry Go
  65. The title is misleading, sensasionalist by guacamole · · Score: 1

    Perhaps we should be looking at MAD statistics more often when summarizing or describing data. However, the standard deviations are very useful in Statistical Inference . Standard deviations are always reported with the parameter estimates. Now this is really useful because the parameter estimates are assumed to be approximately normally distributed either due to the Central Limit Theorem or by assumption of iid normal disturbances. Under the standard normal distribution, two standard deviations account for 95% of the coverage probability, so just by glancing at the standard deviations you know roughly the confidence terminals and also the outcome of a simple z or t test of a hypothesis about the given estimate.

  66. Article is bollocks by Chrisq · · Score: 1

    Do you take every observation: square it, average the total, then take the square root? Or do you remove the sign and calculate the average?

    WTF! - he's managed to get both the definition of standard deviation and mean absolute deviation wrong.

  67. Re by Anonymous Coward · · Score: 0

    Beats me, but everyone knows there are 20 fluid ounces in a pint.

  68. Journalists don't care for uncertainty anyway by GauteL · · Score: 2

    And neither does the media-consuming public. Most would totally ignore your measure of precision regardless of whether you call it standard deviation or mean absolute deviation. For them your average is absolute and if any values aren't at all near it something is terribly wrong. They will also not rest until every school performs above average and nothing in your work will convince them otherwise. The public doesn't like uncertainty and will assume every outcome is for a special reason, and this even goes for the non-religious ones. The idea that some things aren't absolute and are actually uncertain and variable terrifies them.

    Nowhere is this more apparent than in sports. Everything there is always "written in the stars" or "destiny" and if you win it always proves beyond doubt your are better than your opposition (or you were 100% cheated by the refs). Hell, journalists may have had a full article written up 2 minutes before the end of a game and then completely change everything to be about one team's dogged determination because chance would have it they scored in the last minute. I love football (soccer), but discussing it can be frustrating.

    If you still believe you can convince them, use mean absolute deviation in your "executive summary" or press release and leave the standard deviation as is in your actual paper. The only ones that actually read the paper are scientists anyway. The typical journalist reading your actual paper is likely to misunderstand something in every paragraph anyway. Changing real science to pander to the masses is a fucking huge mistake.

  69. Even Taleb himself can't get it right! by Anonymous Coward · · Score: 0

    In TFA, Taleb claims that, for the set of differences from the mean [-23, 7, -3, 20, -1], the standard deviation is 15.7, but it's actually 14.1!
    I'm highly surprised that someone with such a reputation would makes such a mistake in such an article...

  70. Wait, what? by Anonymous Coward · · Score: 0

    Why on earth would he suggest that the standard deviation is an *average*? It's an average deviation *from* an average.

    I get that the concept of an average deviation from an average is highly confusing, to the point of apparently confusing the guy himself. Or his apparent message, I should say.

    Incidentally, for data points much as the ones he suggests (-23, 7, -3, 20, -1), I consider the most sensible average to be zero. I'm sure there's a nifty name for it. It happens to be the mean distance from the lowest sample, i.e. (0 + 30 + 20 + 43 + 22) / 5.

  71. Screwdrivers should be outlawed! by Anonymous Coward · · Score: 0

    I know some people who try to use a screwdriver as a hammer, and to open buckets of paint. If we follow the logic of the article, since that's not what screwdrivers are for, everyone else should stop using them.

  72. Bayesian view by Anonymous Coward · · Score: 0

    From a Bayesian view -- we would prefer not to quote any "statistics" anyway, either SD or this MD. Rather, we should infer the posterior distribution over the variable of interest, and use it directly to inform our action based on utility. For just reporting, we can show the full posterior as a picture, or if it's a Gaussian, quote it generative mean and variance parameters, which enable a reader to do anything they like with them. (The square root of the variance, sigma, is generally understood to describe the region around the mean where 68.2% -- a one-sigma portion -- of the generated data would fall. And it appears directly in the posterior Gaussian generative equation in a way that MD does not.)

  73. I'm shocked by aled · · Score: 1

    You mean that there are scientists publishing papers out there that don't understand basic concepts of statistics? Like, the rest of the world?
    I'm truly 99.9999% e+/-0.5 shocked!

    --

    "I think this line is mostly filler"
  74. Standard deviation... by Muad'Dave · · Score: 1

    A standard deviation is something kinky everyone should try at least once.

    --
    Tiller's Rule: Never use a word in written form that you've only heard and never read. You will end up looking foolish.
  75. /. article leads to spread of misinformation by McFly777 · · Score: 1

    (what's new right?)

    I never even thought to conflate Std Deviation and Mean Deviation prior to reading this article/summary. I just thought of Std Deviation as that bit of the normal distribution which captures ~68.2% of the values (for +/- 1 sigma). And Yes, I knew how it is calculated, my mind just didn't go that direction.

    --

    McFly777
    - - -
    "What do people mean when they say the computer went down on them?" -Marilyn Pittman
  76. Re:So you want to decertify a management degree... by Max+Threshold · · Score: 1

    I worked for one of those guys once. Not long after I was hired, he eagerly explained to me that sigma is how many nines there are after the decimal point.

  77. WRONG by Anonymous Coward · · Score: 0

    If he doesn't like standard deviation, then what he wants is the standard error of the mean. The mean deviation is meaningless, unless you're Al Gore perhaps

  78. Standard vs. Mean by ChiRaven · · Score: 1

    Using the mean deviation is kind of like kissing your sister. Nice, but it doesn't go anywhere. Significance testing is right out, for starts.

    1. Re:Standard vs. Mean by ChiRaven · · Score: 1

      In economics, try using the mean absolute deviation for a Black-Scholes computation. Oopsie. You just way underestimated the volatility and lost your shirt.

  79. Mean Deviation Day by Anonymous Coward · · Score: 0

    So what day of the year should be Mean Deviation Day?

  80. Measures of central tendency by dsdtzero · · Score: 1

    Physicist here...

    I should think the travesty in this article is an economist not making a huge deal about the real issue here and that is measures of central tendency (any measure) only really makes sense when you're looking at gaussian type data (don't economists have fat-tail debacles etched into them at school???). Using a mean and a standard deviation, rmse or whatever to encapsulate a power law distributed thing is dangerous when you start USING it for something (like derivatives pricing). Power law distributions are more prevalent than popularly imagined... Use care when using measures of central tendency on them.

  81. Re: response by Anonymous Coward · · Score: 0

    There is nothing standard about my deviation. But I can root a mean square.

  82. Re:Statistics add plausibility - maybe not meaning by jkauzlar · · Score: 2

    I like that little poke at journalists:

    t it is not just journalists who fall for the mistake: I recall seeing official documents from the department of commerce and the Federal Reserve partaking of the conflation, even regulators in statements on market volatility.

    In other words, it's not just journalists who fall for the mistake, so do educated people.

  83. The assumption of Gaussian distributions is worse by gantry · · Score: 1

    The real danger comes not from a 50% confusion between standard deviation and mean absolute deviation; but from the assumption that the statistical distribution is Gaussian.

    Before the credit crunch, financiers who considered themselves "masters of the universe" believed on the basis of the Black–Scholes equation that they could hedge their risks with a mean time to failure of billions of years. The probability distributions were assumed to be Gaussian, but this bore no relation to the past performance of the stock market.

  84. Statistics is easy by mtthwbrnd · · Score: 0

    You just have to choose the statistic which gives you the answer that you want and always insist that your opponents are using the wrong numbers or do not possess the understanding of statistics to hold an opinion. In the end the guy who shouts the loudest will win the day. We see it every day in the Climate Debate, which of course does not exist because "the debate is over!". [/semi-sarc]

  85. Even he got STD wrong. by Anonymous Coward · · Score: 0

    He described a simple RMS calculation... but he calculated standard deviation of a sample. (It's the difference between STDEVPA and STDEVA in Excel, or a difference of n or n-1 in the denominator of the equation before the square root.)

    The true standard deviation over the five values given is actually 14.05 not 15.7...

    It made it hard for me to keep reading when his first description and number are wrong...

    poleguy

  86. Re:Statistics add plausibility - maybe not meaning by cwsumner · · Score: 1

    I like that little poke at journalists: ...

    I assume you have met a journalist, before? 8-}