Why Standard Deviation Should Be Retired From Scientific Use

← Back to Stories (view on slashdot.org)

Why Standard Deviation Should Be Retired From Scientific Use

Posted by Soulskill on Wednesday January 15, 2014 @09:48AM from the hope-it-gets-a-good-pension dept.

An anonymous reader writes "Statistician and author Nassim Taleb has a suggestion for scientific researchers: stop trying to use standard deviations in your work. He says it's misunderstood more often than not, and also not the best tool for its purpose. Taleb thinks researchers should use mean deviation instead. 'It is all due to a historical accident: in 1893, the great Karl Pearson introduced the term "standard deviation" for what had been known as "root mean square error." The confusion started then: people thought it meant mean deviation. The idea stuck: every time a newspaper has attempted to clarify the concept of market "volatility", it defined it verbally as mean deviation yet produced the numerical measure of the (higher) standard deviation. But it is not just journalists who fall for the mistake: I recall seeing official documents from the department of commerce and the Federal Reserve partaking of the conflation, even regulators in statements on market volatility. What is worse, Goldstein and I found that a high number of data scientists (many with PhDs) also get confused in real life.'"

37 of 312 comments (clear)

Min score:

Reason:

Sort:

So you want to retire a statistical term... by Anonymous Coward · 2014-01-15 09:53 · Score: 5, Insightful

...because people use it incorrectly in economics? Get bent. The standard deviation is a useful tool for statistical analysis of large populations.
1. Re:So you want to retire a statistical term... by Fouquet · 2014-01-15 10:50 · Score: 5, Insightful
  
  +1 this. The problem here is the author's impression that "social scientists" and economists are scientists. The groups that he excludes in the first paragraph (physicists) are scientists. Anyone attempting to implement a statistical model designed for a large (and Gaussian) data set on a small number of data points (as the article's example does) should expect to get an answer that is at best marginal. Any scientists who ever received even the most basic of statistics and/or data analysis training knows this. Understand the problem first, then take enough data points, then carry out your statistical analysis & formulate conclusions.
2. Re:So you want to retire a statistical term... by mysidia · 2014-01-15 15:04 · Score: 5, Informative
  
  the mean *absolute* deviation, rather than the square root of the mean *squared* deviation (the standard deviation).
  The mean absolute deviation is a simpler measure of variability. However....
  The algebraic manipulation of the standard deviation is simpler; the absolute deviation is more difficult to deal with.
  Further, when drawing a number of samples from a large population --- the standard deviation of their mean deviations is substantially higher than the standard deviations of their individual standard deviations; that is to say, the standard deviation of a sample provides an estimate that is more in-line with the whole.
  That is to say.... there are cases where the Standard Deviation may be better, AND, much of statistics is using standard deviation as its basis.
  Fisher, R. 1920 Monthly Notes of the Royal Astronomical Society, 80, 758-770:
  the quality of any statistic could be judged in terms of three characteristics. The statistic, and the population parameter that it represents, should be consistent , The statistic should be sufficient, and the statistic should be efficient -- e.g. the smallest probable error as an estimate of the population. Both the standard deviation and mean deviation met the first two criteria (to the same extent); however, in meeting the third criterion -- the standard deviation proves superior.
3. Re:So you want to retire a statistical term... by Fouquet · 2014-01-15 17:04 · Score: 5, Informative
  
  That actually was part of my point. In my day job (and night job and weekend job, and, oh god I need a vacation) I'm an astrophysicist. I have more data sets that I can recall, and the number of problems for which I'm confident that the errors are Gaussian is at most 2 or 3. We're finally in an era where computational power facilitates forward modeling & Bayesian techniques that can provide good estimates of true uncertainties. But I (and many of my colleagues) barely understand how they work. Any expectation that most researchers are willing to invest the time to understand anything beyond Gaussian statistics is unrealistic.
Basic Statistics by TechyImmigrant · 2014-01-15 09:55 · Score: 4, Insightful

The meaning of standard deviation is something you learn on a basic statistics course.
We don't ask biochemists to change their terms because the electron transport chain is complicated.
We don't ask cryptographers to change their terms because the difference between extra entropy and multiplicative prediction resistance is not obvious.
We should not ask statisticians to change their terms because people are too stupid to understand them.

--
I should use this sig to advertise my book ISBN-13 : 978-1501515132.
1. Re:Basic Statistics by Mr+D+from+63 · 2014-01-15 10:04 · Score: 5, Funny
  
  We should not ask statisticians to change their terms because people are too stupid to understand them.
  But doesn't that give an unfair advantage to statisticians? You have to give everyone a chance!
2. Re:Basic Statistics by ShanghaiBill · 2014-01-15 10:20 · Score: 3, Informative
  
  The meaning of standard deviation is something you learn on a basic statistics course.
  I took a statistics course in college. The statistics professor taught us to think of the standard deviation as the "average distance from the average". So if you know the average (mean) then any random data sample will be (on average) one SD away. That is simple, neat, and easy to remember.
  It is also wrong.
3. Re:Basic Statistics by Mashdar · 2014-01-15 10:22 · Score: 3, Interesting
  
  Didn't you hear? Guassians are so 1893. And so are all of the other distributions with convenient sigma terms...
  And TFS calls standard deviation "root mean square error", which is only true if you assume the mean is a constant estimator for the distribution :(
  In any case, no one picked Gaussians because they are so easy to integrate... While we're at it, TFA should suggest we round the number e to 3, because irrational numbers are hard, and who cares what natural law dictates.
4. Re:Basic Statistics by gninnor · 2014-01-15 10:36 · Score: 5, Funny
  
  Then it would be the same as pi, and that would just be silly.
5. Re:Basic Statistics by Fly+Swatter · 2014-01-15 10:41 · Score: 3, Insightful
  
  Someone should tell that to the lawyers!
6. Re:Basic Statistics by stoborrobots · 2014-01-15 11:59 · Score: 4, Informative
  
  ... think of the standard deviation as the "average distance from the average" ... That is simple, neat, and easy to remember... It is also wrong.
  
  In fact, it is wrong in exactly the way that TFA suggests: you're describing the mean deviation...
  
  --
  "Go to CNN [for a] spell-checked, fact-checked summary" -- CmdrTaco
7. Re:Basic Statistics by camperdave · 2014-01-15 12:24 · Score: 5, Funny
  
  The phrase "orbital process" means entirely different things to brain surgeons and rocket scientists.
  
  --
  When our name is on the back of your car, we're behind you all the way!
8. Re:Basic Statistics by Daniel+Dvorkin · 2014-01-15 12:25 · Score: 3, Informative
  
  I should note that, contrary to the summary, Taleb is not properly a statistician--he's an economist
  To be fair, economics has contributed a lot to the growth of statistics as a field of study. Due to various historical quirks, econometrics developed as almost a separate field from statistics for decades, and economists have often looked at statistical problems with a fresh eye, and had insights that people working in the mainstream of statistics and biostatistics might have missed. In my own work, biostatistics-flavored bioinformatics, I've often found myself referring to the econometric literature.
  I have no idea if any of this applies to Taleb, though. Certainly TFA doesn't strike me as a particularly profound example of statistical reasoning ...
  
  --
  The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
9. Re:Basic Statistics by Ford+Prefect · 2014-01-15 14:37 · Score: 4, Funny
  
  Nuclear Resonance Imaging (NMR) was changed because people were afraid of word Nuclear despite it describing the process, unlike its replacement term.
  Also, if you arrived at a hospital saying you were there for an NMR, you might have received something other than what you were expecting.
  
  --
  Tedious Bloggy Stuff - hooray?
Issues by Edward+Kmett · 2014-01-15 09:56 · Score: 5, Informative

On the other hand, you also need to use 2-pass algorithms to compute Mean Absolute Deviation, whereas STD can be easily calculated in one pass. And you still need standard deviation as it relates directly to the second moment about the mean.
Also, annoyingly, Median Absolute Deviation competes for the MAD name and is more robust against outliers.

--
Sanity is a sandbox. I prefer the swings.
That's not the problem. by khasim · 2014-01-15 09:56 · Score: 4, Insightful

The problem is that people think they understand statistics when all they know is how to enter numbers into a program to generate "statistics".
They mistake the tools-used-to-make-the-model for reality. Whether intentionally or not.
1. Re:That's not the problem. by Deadstick · 2014-01-15 10:01 · Score: 4, Interesting
  
  Three characterizations of statistics, in ascending order of accuracy:
  1. There are lies, damned lies, and statistics.
  2. Figures don't lie, but liars figure.
  3. Statistics is like dynamite. Use it properly, and you can move mountains. Use it improperly, and the mountain comes down on you.
2. Re:That's not the problem. by JoeMerchant · 2014-01-15 10:13 · Score: 5, Insightful
  
  The problem is that peoples' attention spans are rapidly approaching that of a water-flea.
  Up until the past 50 or so years, people who learned about Standard Deviation would do so in environments with far less stimulation and distraction. Their lives weren't so filled with extra-curricular activities and entertainments that they never sat for a moment from waking until sleep without some form of stimulus based pastime. When they "understood" the concept, there was time for it to ruminate and gel into a meaningful set of connections with how it is calculated and commonly applied. Today, if you can guess the right answer from a set of 4 choices often enough, you are certified expert and given a high level degree in the subject.
  Not bashing modern life, it's great, but it isn't making many "great thinkers" in the mold of the 19th century mathematicians. We do more, with less understanding of how, or why.
3. Re:That's not the problem. by TsuruchiBrian · 2014-01-15 11:20 · Score: 3, Informative
  
  Not bashing modern life, it's great, but it isn't making many "great thinkers" in the mold of the 19th century mathematicians. We do more, with less understanding of how, or why.
  The easier math problems are lower hanging fruit. As time goes on, the problems that are left become increasingly hard. Even when they get solved, average people can't understand what it means, and that makes it hard to care about, and hward for newspapers to make money covering that story.
  Also when you read about the history of mathematics, it's easy to feel like these breakthroughs were happening all the time, compared with now, when in fact they were very slowly, and the pace of discovery is probably higher now than at any point in the past.
  It's easy to say music was better in the 70's than now when you condense the 70's down to 100 truly great songs, forgetting all the crap, and compare it to whats playing on the radio today.
4. Re:That's not the problem. by TsuruchiBrian · 2014-01-15 11:45 · Score: 4, Insightful
  
  I think it's also true that a larger percentage of people are going to university, so the average "intelligence" of people in university in terms of natural ability is probably lower now than when it was just the very best students attending.
  Most of the mediocre students today would have simply not gone to university in the past. I think the same principle holds when it comes to things like blogs. The fact that public discourse can sometimes make it seem as if people are getting dumber, when it is really just that more and more people know how to read and write and can now even be published, whereas in the past, there was a higher cost to publishing, and you were more likely to have something important to say before being willing to incur that cost.
Standard Deviation is Important by njnnja · 2014-01-15 10:02 · Score: 5, Informative

Standard Deviation is the square root of the second moment about the mean, an important fundamental concept to probability distributions. Looking at moments of probability distributions gives us lots of tools that have been developed over the years and in many cases we can apply closed form solutions with reasonably lenient assumptions. Then we apply the square root in order to put it in the same units as the original list of observations and get some of the heuristic advantages that he attributes to the mean absolute deviation.

But it is a balance, and any data set should be looked at from multiple angles, with multiple summary statistics. To say MAD is better that standard deviation is a reasonable point (with which I would disagree), but to say we should stop using standard deviation (the point made in TFA) is totally incorrect.
1. Re:Standard Deviation is Important by neonsignal · 2014-01-15 11:13 · Score: 3, Insightful
  
  I'm a little surprised at Nassim Taleb's position on this.
  He has rightly pointed out that not all distributions that we encounter are Gaussian, and that the outliers (the 'black swans') can be more common than we expect. But moving to a mean absolute deviation hides these effects even more than standard deviation; outliers are further discounted. This would mean that the null hypothesis in studies is more likely to be rejected (mean absolute deviation is typically smaller than standard deviation), and we will be finding 'correlations' everywhere.
  For non-Gaussian distributions, the solution is not to discard standard deviation, but to reframe the distribution. For example, for some scale invariant distributions, one could take the standard deviation of the log of the values, which would then translate to a deviation 'index' or 'factor'.
  I agree with him that standard deviation is not trustworthy if you apply it blindly. If the standard deviation of a particular distribution is not stable, I want to know about it (not hide it), and come up with a better measure of deviation for that distribution. But I think the emphasis should be on identifying the distributions being studied, rather than trying to push mean absolute deviation as a catch-all measure.
  And for Gaussian distributions (which are not uncommon), standard deviation makes a lot of sense mathematically (for the reasons outlined in the parent post).
Re:response by flibbajobber · 2014-01-15 10:30 · Score: 5, Funny

First!
... to within 0.5 standard deviations.
Actually, the more posts this story attracts, the more accurate your statement is, and the fewer standard deviations you are away from true first. Response times not being distributed in a Gaussian curve perhaps complicates things.
Standard deviation BAD, but mean GOOD? by PacoSuarez · 2014-01-15 10:33 · Score: 4, Interesting

Perhaps non-mathematicians don't have a problem with this, but it rubs me the wrong way.
What makes the mean an interesting quantity is that it is the constant that best approximates the data, where the measure of goodness of the approximation is precisely the way I like it: As the sum of the squares of the differences.
I understand that not everybody is an "L2" kind of guy, like I am. "L1" people prefer to measure the distance between things as the sum of the absolute values of the differences. But in that case, what makes the mean important? The constant that minimizes the sum of absolute values of the differences is the median, not the mean.
So you either use mean and standard deviation, or you use median and mean absolute deviation. But this notion of measuring mean absolute deviation from the mean is strange.
Anyway, his proposal is preposterous: I use the standard deviation daily and I don't care if others lack the sophistication to understand what it means.
I hate averages by tthomas48 · 2014-01-15 10:38 · Score: 5, Interesting

I also think averages should go away. Most people think they are being reported the median (the number in the middle) when people tell them the average. It's great for real estate agents, and people trying to advocate for tax reform, but the numbers are not what people think they are.
Re:The big picture by boristhespider · 2014-01-15 10:46 · Score: 4, Funny

I often change CSensiblyNamedClassThatDescribesItsFunctionWell to bTrue throughout the code for precisely this reason and no-one ever appreciates it :(
Mean Deviation is Always Zero by dcollins · 2014-01-15 10:47 · Score: 3, Interesting

Well... first of all, summary has it wrong. It's not "mean deviation", it's "mean absolute deviation", or just "absolute deviation" from the literature I've seen. (Mean deviation is actually always zero, the most useless thing you could possibly consider.)
Keep in mind that standard deviation is the provably best basis if your goal is to estimate a population *mean*, the most commonly used measure of center. Absolute deviation, on the other hand, is the best basis to use for an estimate of a population *median*, which is maybe fine for finances, which is what the linked paper seems mostly focused on. (Bayesian best estimators, if I recall correctly.)
If the main critique is that economists and social scientists don't know what the F they're doing, then I won't disagree with that. But no need to metastasize the infection to math and statistics in general.

--
We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
Re:"many with PhDs" by Daniel+Dvorkin · 2014-01-15 11:07 · Score: 3, Interesting

What other existing specialization in computer science, physics, etc,. do you feel is qualified to use Hadoop to process trillions of triple stores into a network and subsequently build highly multivariate link prediction models and evaluate their output statistically with respect to ground truth, to name but one trifling task?
As it happens, one of my colleagues runs a project which, among other things, does exactly that. His PhD is in computer science. I'm a bioinformaticist with a background primarily in biostatistics; I couldn't develop a tool like that, but I can certainly see the value in it. In general, I'm not arguing that the tasks currently getting lumped together under "data science" aren't valuable. I'm just saying that I'm not convinced they fit together into a coherent field that can meaningfully be studied in a single degree program, and attempts to make them so may well run into the problem of "jack of all trades, master of none."

--
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Re:The big picture by FriendlyStatistician · 2014-01-15 12:01 · Score: 4, Informative

Hi, I'm a statistician.
It's not so simple to just say "ok, we're going to use the Mean Absolute Deviation from now on." The use of standard deviation is not quite the historical accident that Taleb makes it out to be--there are good reasons for using it. Because it is a one-to-one function of the second central moment (variance), it inherits a bunch of nice properties that the mean absolute deviation does not. There is not a one-to-one correspondence between variance and mean absolute deviation.
Taleb is correct that the mean absolute deviation is easier to explain to people, but this is not just a matter of changing units of measure (where there is a one-to-one correspondence) or changing function and variable names in code (where there is again a one-to-one correspondence). Standard deviation and mean absolute deviation have different theoretical properties. These differences have led most statisticians over the last hundred years to conclude that the standard deviation is a better measure of variability, even though it is harder to explain.
Re:The big picture by mythosaz · 2014-01-15 12:09 · Score: 4, Interesting

I would have said "18 half gallon pottles to the quarter-barrel firkin."
Wolfram Alpha says 15.75 pottles to the firkin, but that's because of US/UK gallon conversions, I reckon.
352 nails in a chain - which was interesting to me, in that Google includes those units in its calculator.
I now know more about pottles, firkins, nails and chains that I did when I woke up. I shudder to think about what got pushed out of my old head to make way for this new minutia.
Re:The big picture by reve_etrange · 2014-01-15 12:21 · Score: 3, Insightful

I think NNT is saying that the MAD ought to be used when you are conveying a numerical representation of the "deviations" with the intent that readers use this number to imagine or intuit the size of the "deviations." His example is that of how much the temperature might change on a day-to-day basis. According to him, it's not just that the concept is easier to explain, but that it is the more accurate measure to use for this purpose.
Based on his other work I'm sure he understands that the STD is generally superior for optimization purposes, fit comparison, etc.

--
.: Semper Absurda :.
normal densities by stenvar · 2014-01-15 12:22 · Score: 3, Informative

For normal densities, standard deviations and MAD are just proportional, with a factor of about 1.25, so it doesn't matter which you use.
For non-normal densities, neither of them really is universally "right" for characterizing the deviation, but it's mathematically a whole lot easier to understand how standard deviation behaves in those cases than MAD. So even there, standard deviations are usually the better choice.
Re:So you want to decertify a management degree... by rwa2 · 2014-01-15 13:08 · Score: 4, Funny

...and besides... JUST THINK of all the rigorous Lean Management courses that will have to re-certify all of their "Six-Sigma Black Belts" to some kind of "Half-Dozen of the Other" degrees!
PANDEMONIUM!!!
Re:Would those data scientists with PhDs by The_Wilschon · 2014-01-15 13:44 · Score: 3, Insightful

I know several people who have left high energy physics to become data scientists. Nobody in HEP calls themselves a "data scientist", but that's (some of) what we do anyway. It's just analysis of very large data sets. Unlike in the life sciences, both HEP and many commercial / industrial environments have sufficiently large data sets that very complex questions can be asked and answered. You can never have "enough data" -- if you think you have "enough data", then you aren't asking hard enough questions.

--
SIGSEGV caught, terminating

wait... not that kind of sig.
Data Science by Lamps · 2014-01-15 14:51 · Score: 4, Informative

Data science is a field that combines machine learning and statistics to derive meaning from data. Data scientists should be reasonably well-versed in classical stats, but the data sets they deal with are often huge, ill-defined, and not amenable to analysis using classical methods. To deal with such challenges, data science recruits a healthy combination of certain areas of comp-sci (databases, machine learning, NLP, AI), statistical methods, and, quite often, improvisation.
Strange that there are so many people on here that are unfamiliar with data science.
Standard Deviation is fn of 2nd moment of the data by MickLinux · 2014-01-15 16:06 · Score: 3, Informative

I can really go for renaming standard deviation, but it should not be abolished.
Standard deviation is a function of the second moment of the data, and if you remember your laws for combining moments of inertia (the parallel axis theorem), then you'll understand better what you're dealing with.
2nd moments detail resistance to spin, and thus the resiliance of your findings to changes and errors.

--
Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
Re:The big picture by Jamu · 2014-01-15 23:29 · Score: 3, Funny

pnWhat vIs nWrong cWith aHungarian nNotation?

--
Who ordered that?