Slashdot Mirror


Is Statistical Significance Significant? (npr.org)

More than 850 scientists and statisticians told the authors of a Nature commentary that they are endorsing an idea to ban "statistical significance." Critics say that declaring a result to be statistically significant or not essentially forces complicated questions to be answered as true or false. "The world is much more uncertain than that," says Nicoole Lazar, a professor of statistics at the University of Georgia. An entire issue of the journal The American Statistician is devoted to this question, with 43 articles and a 17,500-word editorial that Lazar co-authored.

"In the early 20th century, the father of statistics, R.A. Fisher, developed a test of significance," reports NPR. "It involves a variable called the p-value, that he intended to be a guide for judging results. Over the years, scientists have warped that idea beyond all recognition, creating an arbitrary threshold for the p-value, typically 0.05, and they use that to declare whether a scientific result is significant or not. Slashdot reader apoc.famine writes: In a nutshell, what the statisticians are recommending is that we embrace uncertainty, quantify it, and discuss it, rather than set arbitrary measures for when studies are worth publishing. This way research which appears interesting but which doesn't hit that magical p == 0.05 can be published and discussed, and scientists won't feel pressured to p-hack.

184 comments

  1. Statistically, yes. by Narcocide · · Score: 1

    But not always.

  2. Lazy by Anonymous Coward · · Score: 0

    They have bad data and are tired of removing outliers and getting called on it. Laziness.

    1. Re: Lazy by Anonymous Coward · · Score: 0

      I'm pretty sure that is why they require repeatable experiments. The p-value is just to make sure you don't put too much stock on results that are not representative or typical. External constraints matter a lot. If you wanted to measure how much time a person took between meals or bathroom breaks there are some obviously unreasonable numbers that a p-value would eliminate quickly.

    2. Re: Lazy by Anonymous Coward · · Score: 0

      P-values are just a way of getting an idea about how much confidence you can have in a given result compared with other ones. The confidence level itelf is just an indication of how much confidence you can have in it, not what the likelihood of it being right is. You can't do that with this statistical tool. Statistics works better with larger sample sizes as you have no way of knowing what the precise likelihood of an outlier being the one you're interested in is.

      I don't doubt that it would be great to have something better, but at the end of the day, either the findings are reliable or they're not. At some point it's going to be a binary decision. I suppose, you could add a 3rd possibility for possibly reliable, but that's going to be most of the research in some fields and not terribly informative.

  3. Uh-huh by Anonymous Coward · · Score: 0

    Mark Twain knew this a century ago and actually said it. This is not news. I can only conclude that by, 'scientists', they mean, 'millennials'. It's called disconnecting from the hive and thinking for yourself, it's hard, I know (it isn't really that hard).

    1. Re:Uh-huh by Anonymous Coward · · Score: 0

      Nice solution to the question "what's a smug way to say get off my lawn, I'm better than you?" you have there.

    2. Re:Uh-huh by Anonymous Coward · · Score: 0

      Even illiterate people have understood the uncertainty of statistics if applied to specific situations. This of course goes both ways as is evidenced by the many people who piss away their hard earned money by playing lotteries or other forms of gambling where "player skill" is not a factor.

      The root cause of the problem however is probably that someone at some point had the smart idea that astrology was making them a lot more money than astronomy possibly could. Science when adhered to its own principles is rarely profitable.
      So they started this trend that turned science into just another business. And just like any business it goes where the money is.
      And you can bet your arse that this isn't the fault of Millennials, since this has been going on for a too long time. You can't even blame Boomers for this.

      Of course this does not mean that science is dead. It's still alive and kicking.
      Unfortunately it becomes increasingly difficult to distinguish between bona fide science and garbage as this requires thinking for yourself, which is not only course is very inconvenient for most people. Most people never learned to do it in the first place.

  4. Hail incoherentism! by Anonymous Coward · · Score: 0

    If there isn't some kind of cutoff-point to determine if an effect is real, how will we determine if it's real? The answer is that we'll be forced to fall back on rhetoric. If you want to see how far that kind of reasoning can take humanity, look at the thousands and thousands of years of almost-stagnation during the enlightenment.

    The contention that this group's argument starts with is that if some effect exists, then it can be measured, but failure to figure out how to measure it doesn't mean it isn't real - sure, we should accept that, but it's trivial! The effect might be real, but if we can't figure out how to measure it, it may as well not exist for how much it can inform the progress of human knowledge. Any scientist unable to recognise this is not a scientist, no matter their degree.

    1. Re:Hail incoherentism! by MightyMartian · · Score: 2

      When I took statistics, the text made it clear that a P-value of 0.05 is *somewhat* arbitrary, in that for any individual analysis, it is a useful threshold, but by itself not an absolute indicator of significance. I think the people in this group are guilty of overstating their argument. Determining P-value, or any other statistical measure of significance, is the *start* of a study, and then comes all the hard work of determining if that value is pointing to something truly significant. But a p value of 0.05 is certainly going to suggest that the finding is significant, but it is not THE definitive test.

      --
      The world's burning. Moped Jesus spotted on I50. Details at 11.
    2. Re:Hail incoherentism! by willaien · · Score: 1

      Having P = 0.05 has led to "P-Hacking". One example was the "chocolate" trial that had a small group eating different diets and tracked a whole slew of things, looking for a correlation in any of them. It happened to be that in the small group eating 1.5oz of chocolate and otherwise dieting lost slightly more weight than the group just dieting. Due to not having a specific goal in mind, and small sample size, they were bound to determine some sort of "positive correlation" somewhere, and there you go. If it hadn't been weight loss, it would have been heart rate or something else.

      https://io9.gizmodo.com/i-fool...

    3. Re: Hail incoherentism! by phantomfive · · Score: 2

      The real problem is when scientists aren't interested in finding something significant, they are interested in getting published. In that situation, even setting the threshold at .0005 will end up with p value hacking.

      --
      "First they came for the slanderers and i said nothing."
    4. Re:Hail incoherentism! by ceoyoyo · · Score: 1

      p 0.05 is supposed to be kind of a minimum threshold. Higher than that and you really can't draw any conclusions. Less than that, and you might have something. Maybe. It's basically a first level filter.

      Does that mean you get some false negatives? Absolutely. And you also get lots of false positives.

      There seem to be a bunch of people who want to look at confidence intervals and say "well, a good part of my confidence interval is over here, which is interesting, so this is important!" There are also a bunch of people who think that too many false positives get published and want to clean things up. These two things are at odds with each other.

    5. Re:Hail incoherentism! by ceoyoyo · · Score: 1

      I much prefer Bayes factor hacking. Sounds way fancier.

    6. Re:Hail incoherentism! by parkinglot777 · · Score: 1

      Then don't use the word statistical significance to express the word "real". This "p value exaggerate" has been discussed over 10 years ago, but the p-value is still being used because many people, like you, are, as you are saying, have to accept some thing or method in order to be "real". As MightyMartian said, p-value could be used in initial test, but it should never be used as "statistical significance" at all.

  5. I used to think so by goombah99 · · Score: 4, Funny

    Then I took a course on statistics, and the stats professor told me that 47.37% of all statisticians make up their own statistics.

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re: I used to think so by Anonymous Coward · · Score: 0

      That's what i remember from my stats course as well...

    2. Re: I used to think so by Anonymous Coward · · Score: 0

      Odd, mine said 47.38%

    3. Re:I used to think so by dargaud · · Score: 1

      On average humans have one tit and one testicle...

      --
      Non-Linux Penguins ?
    4. Re: I used to think so by narcc · · Score: 1

      I head 47.379

    5. Re:I used to think so by apoc.famine · · Score: 1

      Only if you round to the nearest integer of each.

      --
      Velociraptor = Distiraptor / Timeraptor
    6. Re:I used to think so by Aighearach · · Score: 2

      On average humans have one tit

      You understanding of mammal bodies is substantially lacking.

  6. P-hacking by goombah99 · · Score: 3, Funny

    100% of all published incorrect results have a P value above 0.05

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re:P-hacking by Impy+the+Impiuos+Imp · · Score: 1

      It would be nice to see how accurate .005 is with longitudinal studies of papers and their ultimate truthiness down the road.

      Like weather predictions of 30% chance of rain at 2 pm, did it actually rain 30% of the time?

      --
      (-1: Post disagrees with my already-settled worldview) is not a valid mod option.
    2. Re: P-hacking by Anonymous Coward · · Score: 0

      Good question easily answered in the affirmative with a test.

    3. Re: P-hacking by c6gunner · · Score: 5, Insightful

      100% of all published incorrect results have a P value above 0.05

      0.05 has always intended to be the bare minimum, not a guarantee of absolute truth. If you hit 0.05, and you haven't engaged in P hacking, it indicates that there may be an effect there and that more study is warranted.

    4. Re:P-hacking by Anonymous Coward · · Score: 1

      30% chance of rain means that 30% of the given area will experience rain at 2pm.

      Not that is rains 30% of a given amount of time.

      Also, this is regardless of the rain volume.

    5. Re:P-hacking by Anonymous Coward · · Score: 0

      It would be nice to see how accurate .005 is with longitudinal studies of papers and their ultimate truthiness down the road.

      Like weather predictions of 30% chance of rain at 2 pm, did it actually rain 30% of the time?

      The way that 30% prediction is made is that past measurements of similar weather conditions resulted in rain 30% of the time. So the answer to your question is yes.

    6. Re: P-hacking by Sique · · Score: 1

      But 0.05 is as arbitrary as any value. A p-value of 0.05 means, that out of 20 studies, that consider themselves significant because of their p-value, one is a pure statistical fluke. So why not 0.1? Or 0.01? Or even 0.000,001?

      --
      .sig: Sique *sigh*
    7. Re: P-hacking by Anonymous Coward · · Score: 0

      Correct. If you need a much lower p value (like 0.01) then your sample size needs to be bigger

    8. Re:P-hacking by apoc.famine · · Score: 2

      Like weather predictions of 30% chance of rain at 2 pm, did it actually rain 30% of the time?

      That sort of research is done all the time. Usually it's on far more specific parts of weather models than the overall model. Weather models are ridiculously complicated, and scientists spend a lot of time on minor components of them like modeling aerosols better since they form the nuclei of clouds and thus rain, or the vertical humidity profile, or boundary layer dynamics. There are so many minor processes that make up weather that most of the research effort goes into things that 99.9% of the population never will even know even exist. In conjunction, all of these things will be what predict rain or temperature at a certain time.

      However, once in awhile someone revisits the models as a whole, and you get something like this: http://www.inscc.utah.edu/~pu/...

      For hurricanes in particular: http://science.sciencemag.org/...
      (If you want the pop journalism coverage of that article: https://www.theatlantic.com/sc...)

      --
      Velociraptor = Distiraptor / Timeraptor
    9. Re: P-hacking by WhiplashII · · Score: 4, Insightful

      Worse than that, if you only publish one out of 20 studies, you are reporting noise.

      --
      while (sig==sig) sig=!sig;
    10. Re: P-hacking by Anonymous Coward · · Score: 0

      That's why studies are supposed to be replicated. Also a p-value of 0.05 does not mean that there's a 1/20 chance that it was a fluke, it's that there's you have 95% confidence in it. Which is not the same thing as there being a 95% chance that it's right.

      And that's part of the problem here confidence isn't what you're implying. I do think that the status quo is not anywhere near good enough, especially with the p-hacking and lack of replication that goes on, but it shouldn't be a matter of throwing it out, it should be a matter of finding something that is more effective at indicating the reliability of the data.

    11. Re:P-hacking by Anonymous Coward · · Score: 0

      30% chance of rain means that 30% of the given area will experience rain at 2pm.

      Not that is rains 30% of a given amount of time.

      Also, this is regardless of the rain volume.

      That's not how it worked in western Washington when I grew up. 30% chance of rain meant that it would rain 30 % (or more) of the day.

    12. Re: P-hacking by Anonymous Coward · · Score: 0

      The P value is the probability that what you are observing is by random chance. 5% chance that what is being reported is random chance seems pretty damn high for a lot of work.

    13. Re: P-hacking by Anonymous Coward · · Score: 1

      No. The p-value is the probability of observing the value, or a more extreme value, given that the null hypothesis is true.

      That does not mean it is more random. It could be centered around a different mean, in which case you would say:

      Given the data, I am going to reject the null hypothesis in favor of the alternati e hypothesis because the p-value is less than .05

    14. Re: P-hacking by fropenn · · Score: 3, Insightful

      Of course 0.05 is arbitrary. But researchers have to run studies using budgets that limit the amount of subjects in the study and they also are up against the level of accuracy of the test / instrument / survey. Obtaining extremely low p-values requires one or more of these:

      1. Very large sample sizes.

      2. Extremely effective intervention that produces huge differences between your groups.

      3. Extremely accurate instruments / measures.

      4. Lying.

      These things all come at a cost, which has to be balanced between doing fewer studies at higher cost or more studies at less cost.

    15. Re: P-hacking by Anonymous Coward · · Score: 0

      Yes, some square inch of territory in the forecast area received a single drop of rain.

    16. Re:P-hacking by Anonymous Coward · · Score: 0

      In most cases, the two interpretations are equivalent. See ergodic theorem. https://en.wikipedia.org/wiki/Ergodic_theory#Ergodic_theorems

    17. Re: P-hacking by ShanghaiBill · · Score: 4, Insightful

      Worse than that, if you only publish one out of 20 studies, you are reporting noise.

      All publicly funded research should be published.

      Often the failed experiments are more important than the successes.

      Where would we be today if Michelson and Morley hadn't published their failure to measure the ether?

    18. Re: P-hacking by Anonymous Coward · · Score: 0

      > All publicly funded research should be published.

      Or, since much (most?) publicly funded research is already not published, maybe this is an indication that research shouldn't be publicly funded.

    19. Re: P-hacking by Anonymous Coward · · Score: 0

      Have you actually TAKEN a statistics class? Even like 101? The entire point is to balance the risk of committing a Type I vs Type II error. You need to draw the line somewhere, and the general consensus for many (DEFINITELY NOT ALL) studies is that you set alpha 0.05, because 95% confidence is pretty darn good for many (ONCE AGAIN DEFINITELY NOT ALL) cases. Any time the consequences of committing a Type I error are severe, you decrease alpha and any time you want to err on the side of committing a Type II error you increase alpha.

    20. Re:P-hacking by mysticgoat · · Score: 1

      I'm across the river from Washington, in Oregon.

      Here, a 30% chance of rain means that if you be on it raining, you will win 3 out of 10 bets.

    21. Re: P-hacking by denzacar · · Score: 1

      Because 95% accuracy is GOOD ENOUGH for everyday life. And should the results of at least 19 studies agree... that would be more than enough.

      Also, p-value of 0.05 doesn't mean one study out of 20 is a pure statistical fluke.
      It means that in 1 case out of 20 WITHIN the study - we don't know if it is a statistical fluke.
      Which is why studies needs large samples - so that way 2 guys out of 20 who just happen to be allergic to something in the room don't mess up the entire study.
      56 guys out of a 1000 on the other hand... Now that's significant.

      --
      Mit der Dummheit kämpfen Götter selbst vergebens
  7. Objection by Anonymous Coward · · Score: 1

    > 850 scientists and statisticians

    Not a statistically significant representation of the scientific community.

    1. Re:Objection by 110010001000 · · Score: 0

      I love it when journalists use the term "more than X". What does that even mean? It could be 851 scientists or 10,000,000,000 scientists. It is used to indicate what the journalist thinks is "a lot" (e.g. consensus) when the reality is that hardly anyone was asked.

    2. Re:Objection by Anonymous Coward · · Score: 0

      I love it when journalists use the term "more than X". What does that even mean?

      It means "barely X", but the journalist thinks you should consider it to be a lot.

    3. Re:Objection by Anonymous Coward · · Score: 0

      N=850 is perfectly fine for this, provided that 850 grouping is representative of the population of scientists at large and the number of factors you're looking at is limited.

      Honestly, I don't get where some of you people come up with those ideas. Getting that many scientists, or really most other types of people, together to participate is not easy.

    4. Re:Objection by 0ld_d0g · · Score: 1

      It probably means that they were not sure of the exact count. Happens when you have to get your article out on a deadline..

  8. All odd numbers are prime by goombah99 · · Score: 4, Interesting

    A prime number is divisible only by itself and 1
    1 is prime (by this definition)
    3 is prime
    5 is prime
    7 is prime
    11 is prime
    13 is prime
    9 is experimental error.

    The proposition that "all odd numbers are prime" has a P value above 0.05.

    --
    Some drink at the fountain of knowledge. Others just gargle.
    1. Re: All odd numbers are prime by Anonymous Coward · · Score: 0

      The even number 2

  9. not helpful by Anonymous Coward · · Score: 0

    If the goal of a field is to develop an understanding of what's actually going on in a given domain, publishing even more marginal/unreliable results is counterproductive. The signal-to-noise ratio of the scientific literature is already getting worse by the minute....

    1. Re: not helpful by Anonymous Coward · · Score: 0

      Whatever floats your boat. Outliers of certain types are very easy to find. Over time there's less attention paid to noise or maybe the noise lessens as the experiments get better streamlined

  10. I don't know by Anonymous Coward · · Score: 0

    I picked a really bad time to stop shooting up heroin.

  11. Nope. by dohzer · · Score: 4, Funny

    Nope. I'll delete it from Wikipedia later today.

  12. Elegy in a *BSD graveyard by Anonymous Coward · · Score: 0

    *BSD is dying

    The Curfeu tolls the Knell of parting Day,
    The lowing Herd winds slowly o'er the Lea,
    The Plow-man homeward plods his weary Way,
    And leaves the World to Darkness, and to me.


    *BSD is dying


    Now fades the glimmering Landscape on the Sight,
    And all the Air a solemn Stillness holds;
    Save where the Beetle wheels his droning Flight,
    And drowsy Tinklings lull the distant Folds.
    Save that from yonder Ivy-mantled Tow'r


    *BSD is dying

  13. Obligatory XKCD cartoon by nickovs · · Score: 5, Funny
    --
    If intelligent life is too complex to evolve on its own, who designed God?
    1. Re:Obligatory XKCD cartoon by Solandri · · Score: 1

      That is actually one of the problems with statistical significance. It's only relevant if you're reporting one single result. If you're reporting multiple results, then that creates a second layer of statistical significance, where on average you expect several of those results to surpass your single-sample threshold of significance just by random chance. And so your findings are only noteworthy if you get more than a certain number of results which surpass your single-sample threshold.

      If you've got just one result, the confidence level in that result is 95%. If you've got two and one is "statistically significant", the confidence level drops to 90%. If you've got three it drops to 86%. etc. It converges on 1/e when the number of results you're reporting equals the single-confidence level. i.e. For statistical significance, since your threshold is a 1 in 20 chance of obtaining the result by random chance, the odds of getting one "statistically significant" result from 20 results is 1/e. (This might be easier to see in lottery tickets, where if you've go a 1 in 1 million chance of winning, and you buy a million random tickets, your odds of winning are 1/e.) When reporting larger numbers of results, you actually expect more than a single result to surpass your 95% threshold, so the odds of finding just one or even no "statistically significant" results begin to approach zero.

      The 1/e value is the only constant point in all these probabilities. The chance for everything else varies depending on your threshold level and the number of results. So the threshold for "statistical significance" actually varies with the number of results you're reporting. But scientists report their results using the single-sample threshold for everything, as if that's the constant point, not 1/e. Which I'm guessing is what TFA (didn't read) is complaining about.

    2. Re:Obligatory XKCD cartoon by ceoyoyo · · Score: 1

      "That is actually one of the problems with statistical significance. It's only relevant if you're reporting one single result."

      No, it's not. That's one of the problems with not knowing what you're doing. You're *supposed to* formulate a detailed hypothesis and analysis plan. That plan should include criteria for deciding what tests you'll do, and what combination of tests you will judge to be supporting each part of the hypothesis. Then you perform multiple comparisons correction based on the number of individual tests that contribute to your conclusions.

      Just because nobody bothers to do it doesn't mean the procedure doesn't exist.

  14. Obligatory by Anonymous Coward · · Score: 0

    https://xkcd.com/882/

  15. The Standards of Particle Physics by Anonymous Coward · · Score: 1

    In particle physics, (the field in which I have my Ph.D. but--full disclosure--no longer work), the standard is 3 sigma to claim evidence for an effect, and 5 sigma to claim discovery. Publication of results below 3 sigma is not only encouraged, but required...it's unethical to conceal such results. A null result can be a theory killer.

    1. Re:The Standards of Particle Physics by Anonymous Coward · · Score: 1

      Physics is an outlier, in human bio-mechanics (where I also no longer work) publishing anything but a positive and significant results is outright impossible. This poisoned the field with cooky models that have no predictive power. Consequence of this, I think Boston Dynamics will get to a full body human gait model over variable terrain before actual scientists conducting research in this field.

    2. Re:The Standards of Particle Physics by habig · · Score: 1
      I still do work in particle physics. Yes, I do understand that particles are way easier to be careful with the error propagation on than anything medical. But still, we do spend 90% of the time on any given analysis "embracing uncertainty, quantifying it, and discussing it,", as TFA says. Figuring out the error bars is the hard part, and also usually the part that referees pick at to make sure you did it right.

      (BTW: Isn't p=0.05 only a 2-sigma result? Ick.)

    3. Re:The Standards of Particle Physics by sfcat · · Score: 1

      (BTW: Isn't p=0.05 only a 2-sigma result? Ick.)

      Its a bit less than 2-sigma. It should be more like 3-sigma (about p=0.01) which would make p-hacking much more difficult as it would take 100 variations to see a probable null hypothesis. Although the exact methods of conversion are complex.

      --
      "Those that start by burning books, will end by burning men."
    4. Re:The Standards of Particle Physics by ceoyoyo · · Score: 1

      The publication of inconclusive results is a problem outside physics. Particle physics does have an advantage though: the data and analyses tend to be from only a few places. In parts of physics where Joe Anybody can ask a few undergrads a handful of questions and then write a paper, there's likely less publication of all those inconclusive results.

  16. that's a stupid idea by Anonymous Coward · · Score: 0

    this will lead to as many definitions of statistically significant as there are papers. At least know you have to hack a specific statistic if you want to cheat, and this can be spotted relatively easily. Compare that with the inevitable claim "this is significant because I say so" once p-values don't have to be reported or taken into consideration anymore.

    p = 0.05 is a pretty sensible value. It means that there is only a 5% chance that the results are random. That's a pretty safe assertion. If in some field p = 0.1 is more applicable, then well argue for it.

    The initial stupid idea of course is when a journal decides to publish based only on "proven" statistical significance. Either a research paper has interesting material in it or it does not, whether it is statistically significant is an entirely different question and should be left for the reader to decide.

    Come to think of it, of course, the whole peer review is one of the most stupid ideas of humanity. First of all, who are those peers that they think they can judgge someone's work? Usually the reviewers are neither current on some subject nor qualified to make a judgment call. Serious people know that and that's why they don't volunteer to be a peer reviewer. This essentially leaves all the morons that think they should have a say over others, people who feel better when they excert power on someone else.

    Better take the social network approach: Have people vote on a paper. Much better results.

  17. Re:p-hack? by Anonymous Coward · · Score: 0

    If capitalism is so great move to Russia. No they are not socialist anymore

  18. Quant vs Qual by Nidi62 · · Score: 1

    In my International Relations graduate program there was a big push towards quantitative research and analysis; there were two mandatory classes on it. However, I always felt that it broke things down into too simplistic a view, and while it could tell things might be correlated, it never told you why. And with human systems like societies, states, conflict, politics, etc, there are so many inputs, so many factors that contribute to why people act the way they do, what decisions they make, that to boil it down to one or two that are "statistically significant" isn't missing the forest for a tree, it's missing the forest for a bush. Complex systems very often have complex inputs.

    That's why I preferred a more qualitative approach: there was no arbitrary line of significance. It allowed you to explore more complicated or elaborate analyses. There was no worry about getting bogged down in what regression method you used or why, whether your math was wrong, or you excluded/included a variable that you shouldn't have. It gives you the chance to simply lay out your theory, your interpretation, and the evidence to back up that interpretation. And best of all, it allows you do it in such a way that it makes your research much more accessible to other people. I also prefer a more narrative style of writing anyway. Now, of course this for a humanities discipline. A more scientific discipline would require significantly more math.

    --
    The only thing necessary for evil to triumph is for it to be pitted against a slightly greater evil
    1. Re:Quant vs Qual by PacoSuarez · · Score: 4, Insightful

      And this is why there is so little truth to be found in the humanities.

      Here's a scenario: A white nationalist kills dozens of Muslims. Someone looks at this and sees evidence that the normalization of fringe views, characteristic of the way president Trump talks, is emboldening these maniacs to act violently. Someone else looks at this and sees evidence that white middle-class uneducated men have been marginalized by our economic system and are at their wits' end, which is the same phenomenon that lead to Trump being elected.

      The kind of narrative-based elaborate analyses that you advocate doesn't help us decide which of the points of view above is right, and we carry on with our preconceptions, unable to learn anything.

      Narratives allow you to explain the past perfectly using models that have no predictive value. The only way to make progress when trying to understand a complex system is to come up with very simple hypotheses and try to validate them empirically. Of course this is very hard to do, but I think people in the humanities do a poor job and fool themselves into thinking they understand things they don't understand.

    2. Re:Quant vs Qual by Anonymous Coward · · Score: 0

      A person with no basis in humanities and a expert understanding of statistics isn't capable of much. A person with an expert understanding of humanities and no basis in statistics can rule the world.

      Numbers do not carry the meaning and significance that people assign to them.

    3. Re:Quant vs Qual by Nidi62 · · Score: 1

      Here's a scenario: A white nationalist kills dozens of Muslims. Someone looks at this and sees evidence that the normalization of fringe views, characteristic of the way president Trump talks, is emboldening these maniacs to act violently. Someone else looks at this and sees evidence that white middle-class uneducated men have been marginalized by our economic system and are at their wits' end, which is the same phenomenon that lead to Trump being elected.

      The kind of narrative-based elaborate analyses that you advocate doesn't help us decide which of the points of view above is right, and we carry on with our preconceptions, unable to learn anything.

      You've proven my point: they're both right. When people in power either espouse certain views or give support (whether implicit or explicit) for those views it emboldens others who hold those same views. At the same time, it's a commonly held belief that marginalization, perceived or actual, can lead one to more extremist views. Both of those very likely factored into why the person in your scenario acted the way that he did. Using numbers tries to break everything down into black and white. With people, everything is grey. There is no one right answer, but there may be many correct answers.

      And, as I like to say around here at work with our increasing focus on measurable and metrics (which we've fortunately gotten away from somewhat), numbers lie. They may tell you what's happening (assuming you are actually measuring the correct thing) but they cannot tell you why it's happening, and any solution made based on the numbers may not be the truly correct solution.

      --
      The only thing necessary for evil to triumph is for it to be pitted against a slightly greater evil
    4. Re: Quant vs Qual by phantomfive · · Score: 1

      Best description of narratives ever. It also explains why marketers like them so much.

      --
      "First they came for the slanderers and i said nothing."
    5. Re:Quant vs Qual by Anonymous Coward · · Score: 0

      All of the 'someone's in your little scenario are pundits, not researchers.

      And the way you're using pundits' narratives in order to disparage everybody and everything in the massively broad category of 'the humanities' is in and of itself, a narrative.
      And not a very good one.

    6. Re:Quant vs Qual by Anonymous Coward · · Score: 0

      You've proven my point: they're both right.

      How do you know? They both sound truthy, but are they correct? What would be the observable difference between a world in which they were correct and one in which they were not?

      Answer that last question, come up with a test to tell the difference, run the test, and crunch the numbers to determine the result. Maybe you'll find that both are true, in some proportion, or that one is true and the other isn't.

      But if you just say "they're both right" with such confidence, then you're not doing research: you're just asserting your prejudices. Which, unfortunately, seems to be standard procedure in the social sciences.

    7. Re:Quant vs Qual by Kjella · · Score: 3, Interesting

      Narratives allow you to explain the past perfectly using models that have no predictive value. The only way to make progress when trying to understand a complex system is to come up with very simple hypotheses and try to validate them empirically. Of course this is very hard to do, but I think people in the humanities do a poor job and fool themselves into thinking they understand things they don't understand.

      A person is not a dice, no matter how much you want it to be. You can ask a fairly simple question like "Would you pose for nude art?" and get a survey answer. But if you break it down there'll be a ton of factors and the more answers you get and the more fine masked you make your model you'll only end up finding more and more differences plus the answer will not remain constant in place or time with a strong group dynamic and feedback loops. And you still will not have found a meaningful answer to why, only a bunch of correlated variables. Qualitative studies do the exact opposite, they don't generalize they ask one and one subject to explain their reasoning and try to summarize them into common sentiments. It's a much more accurate description for each person and the group as a whole. It's just really hard to compare scores because it's not on a measurable willingness scale.

      Yes, we've vaguely identified some risk factors that are usually present in a terrorist. We've got a long manifestos on why exactly that person turned into a terrorist. But everyone at risk are somewhere in between, they're not just risk factors and they're not clones of the terrorist. It's something like the Heisenberg's uncertainty principle for the social sciences, the more specific knowledge you have of an individual the less applicable it's to the group and the more general knowledge you have on the group the less accurate it's for the individual. They're both circling what nobody knows for sure, what exactly goes on in somebody else's head. Until we discover mind-reading technology that's going to be an approximation at best. Just because you can sell power tools to most Americans if you throw a dart at a map you could hit an Amish community.

      --
      Live today, because you never know what tomorrow brings
    8. Re: Quant vs Qual by Anonymous Coward · · Score: 0

      >"at their wits end"

      Here's a tip. Any path in problem-solving that leads you to 8chan is not actually a path to a solution, it is a problem finding a place to validate not finding a solution.

      It also doesn't involve wit, in the traditional meaning of the word. Not even half-wit nor dim-wit. Whatever the right word is for null-wit or null-acumen.

      Your points about narrative as explanation are well made, and there is more to explanation in the sciences, humanities and arts than can be provided by using p values as the hammer to crack the problem. Your two example narratives however, by being posed as alternatives of equivalent value, run the risk of legitimising the abhorrent murders.

      A better illustration could have been chosen.

    9. Re:Quant vs Qual by Anonymous Coward · · Score: 0

      Your statement that "a person is not a dice" is no where near beeing backed by science. I would suggest that by our current knowledge people are extremely complicated dice, but dice nevertheless. There is lots of research in favor this, such as how "choice" seems to be something that the mind constructs after the fact.

      Yet somehow you agree with parent only to go off on an excuse as to why its not concievable.

      "Youre right, but its really hard and tons of work for many people over decades so ill just write some nonsense fanfic interpretation instead"

      Could have been said by anyone in humanities.

  19. Science is hard by Sarten-X · · Score: 2, Interesting

    This way research which appears interesting but which doesn't hit that magical p == 0.05 can be published and discussed

    The significance value is essentially a measurement of how good a researcher is at their job. Unfortunately, a lot of academics feel that they shouldn't be bothered by silly things like "accountability", because they've chosen the noble ivory tower of research.

    If your experiment can't hit that level of certainty, redesign your experiment. Go get more samples, run more simulations, and grow more cultures. Alternatively, go ahead and publish, but include the note that the job isn't actually finished. Use the partial result to justify asking for more funding so you can complete the work.

    • Half of your samples died unexpectedly? If you were a better researcher with better lab practices, you'd have had someone check that the equipment stayed plugged in over spring break.
    • Nobody responded to your survey? Maybe you should try something more effective than standing in a corner of the local pub for an hour asking the drunks if you can "get something good from them real quick".
    • You can't get enough reagents for your chemical process? Perhaps you should have actually budgeted for supplies, rather than host an open-bar party celebrating that you received that grant.
    • You ran out of time on the cluster computer? Next time try asking the computer science students to review your program for efficiency, rather than trying to run a direct implementation of your whiteboard notes.

    (These are all things I saw first- or secondhand during my time in academia)

    I'd be fine getting rid of the p-value, but it would have to be replaced by something else that does an equal job of filtering out the half-assed crank "research" that makes more headlines than discoveries. The only replacement I can think of that wouldn't be vulnerable to similar "hack" methods would be to require that every experiment go through an exhaustive process inspection before, during, and after the run. That's an even more painful thing to deal with than making sure your experiment can produce significant results.

    --
    You do not have a moral or legal right to do absolutely anything you want.
    1. Re:Science is hard by Anonymous Coward · · Score: 3, Interesting

      This is absolute horseshit. There is often background noise in a measurement that you CAN NOT GET RID OF. Therefore you will never get a perfect 0 p-value. In fact, you will often be unable to reduce it beyond a certain point NO MATTER HOW GOOD YOUR EXPERIMENT IS.

      What the article is arguing is that we should not be using a blunt instrument like a p-value which is often a lazy person's (like the parent poster) substitute for quality, but instead should be assessing research on its relative merit and making judgments about quality from a deeper understanding of the problems that some experiments face. Attittudes like the one the parent poster gives are why p-hacking and its associated problems exist - dilletantes like Sarten-X substiute p-values for quality, whereas actual statisticians know it cannot be used in that way.

    2. Re:Science is hard by Anonymous Coward · · Score: 1

      They should publish p-value and effect size. I'm also a big advocate of robust statistics; the assumption of an underlying normal distribution is not always justified for real world data, central limit theorem notwithstanding.

    3. Re:Science is hard by houghi · · Score: 5, Insightful

      If your experiment can't hit that level of certainty, redesign your experiment.

      Or perhaps the thing you thought was sure, isn't at all and you just proved that your idea was wrong.

      A researcher should prove and disprove, not only prove.

      --
      Don't fight for your country, if your country does not fight for you.
    4. Re:Science is hard by AmiMoJo · · Score: 1

      Alternatively, go ahead and publish, but include the note that the job isn't actually finished.

      That's what they are arguing for. Too many scientists won't submit and too many journals will reject based on the p value alone, which means some interesting ideas and data goes unpublished.

      They are not arguing for "crank" research, just that the p value isn't the be-all and end-all, and in fact no single data point should be. Work should be considered on all its merits.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
    5. Re:Science is hard by jeff4747 · · Score: 1

      The significance value is essentially a measurement of how good a researcher is at their job

      Oh dear god no.

      Proving something is not true (aka, results are not significant) is an incredibly valuable thing.

      Demanding that all researchers produce experiments that prove their hypothesis true and only true is awful, and how you get p-hacking. And it is also what you are demanding here.

    6. Re:Science is hard by werepants · · Score: 4, Interesting

      The significance value is essentially a measurement of how good a researcher is at their job.

      This is totally wrong, and reflects the exact misconception that the article is talking about. For quite a while my job was doing experiments on hardware that cost as much as $100k per sample, where test time would cost $1000/hr or more, and you needed hundreds of hours of testing to get any kind of reasonable certainty. Budgets are finite, and at some point you have to decide how good is good enough, or even if isn't good enough, there just isn't any money left to do better. We could only estimate effects to within a couple orders of magnitude at times. However, we put error bars on fucking everything, so we were very explicit about how much slop there was in the answers. How good a researcher is at their job is determined by how much they can get done with finite resources, and how deeply they understand the limitations of their knowledge. All researchers should be trying to get maximal knowledge per dollar (or per time, in some cases), and sometimes an experiment with large uncertainty is the appropriate approach, or the only thing that is feasible within time/funding/physics constraints.

      Sure, if you are doing something basic like surveys, it's not hard to increase statistics. But if you are doing medical research on a new drug, costs can run into billions and you've got major ethical quandaries every step along the way. If you are developing a drug for a rare condition, there might only be a handful of test candidates in the world, and so you literally can't increase your sample size unless you wait a decade for more incidences to crop up. In that interval, depending on the specifics of the disease, people could be suffering or dying needlessly because you haven't gotten your drug approved.

      Yes, bad research is bad, and journals are replete with examples of terrible studies being published. But the p-value doesn't help that situation - it makes it worse, because it's treated as a binary marker of success. You can easily produce a great p-value by approaching science in the exact wrong way... look for significant correlations in a large, highly multivariate dataset and you are guaranteed to find some total nonsense correlations that look flawless (like the insanely tight correlation between swimming pool drowning deaths and Nicolas Cage movies... true story).

      What we actually need is more rigorous peer review and greater transparency and information sharing in science. If it becomes standard practice to make all of your raw data and calculations public, then it will become obvious very quickly when people are fudging numbers and inflating their stats.

    7. Re:Science is hard by Sarten-X · · Score: 2

      I have a fair coin that always lands on heads, just with about 50% background noise.

      The whole point of an experiment is to remove the "background noise", which is another way of saying "uncontrolled variables". If your experiment can't isolate the target variable, then you need to fix your experiment. In the extremely rare case that the experiment can't be fixed, like in cases where a small number of particles matters (including the very small number of photons hitting a telescope sensor), you still should be acknowledging your experimental problems. Own up to having a low p-value, and explain how you did absolutely everything possible with today's technology to pull signal from that background noise.

      [We] instead should be assessing research on its relative merit and making judgments about quality from a deeper understanding of the problems that some experiments face.

      I agree, but to do that, we'd need a good way to quickly educate every other scientist on that "deeper understanding", and why it's not possible to do any other experiment that does a better job of isolating the variables. Without that, it's easy to simply claim that an overly-complicated random-number generator with cherry-picked results is really an extremely-sensitive test apparatus supporting some pet theory.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    8. Re:Science is hard by Sarten-X · · Score: 1

      What they're arguing for, in their words:

      The editors [of The American Statistician] introduce the collection with the caution “don’t say ‘statistically significant’”. Another article with dozens of signatories also calls on authors and journal editors to disavow those terms.

      We agree, and call for the entire concept of statistical significance to be abandoned.

      TFA summarizes this as a "ban on p-values".

      I'm all in favor of evaluating work on its merits, but the p-value is still a useful tool for measuring one of the most important merits: the chance that the result was completely coincidental.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    9. Re:Science is hard by Sarten-X · · Score: 1

      As I said, go ahead and publish, but include the note that the job isn't actually finished. Use the partial result to justify asking for more funding so you can complete the work.

      For a while, my job was finding the handful of drug-trial candidates you mentioned. I understand that there are times when you realistically cannot hit a high degree of probability, but that doesn't mean a p-value isn't critically important, and it certainly doesn't mean a p-value should be "banned" from peer-reviewed journals.

      If you only ran three trials before funding ran out, that's fine... just make absolutely certain that it's clear that's what happened. In that case, your published result isn't so much a claim that "this is a discovery", but "we did this, and saw this, and more work might show a discovery".

      That's how we get the "greater information sharing" that we all love. We don't get it by sweeping poor rigor under the rug.

      --
      You do not have a moral or legal right to do absolutely anything you want.
    10. Re: Science is hard by phantomfive · · Score: 2

      The scientists in the article are complaining that people conclude two things are the same when there is no statistical difference between the two. You can't conclude that: all you can say is "we aren't sure."

      --
      "First they came for the slanderers and i said nothing."
    11. Re:Science is hard by Falconnan · · Score: 1

      Well, that's the rub, isn't it? Rare is the experiment that proves an idea true. Most experiments are designed to falsify an hypothesis. Statistical noise comes in and complicates this, and can't always be accounted for. Hell, there's a philosophical case to be made that reality is the result of noise and its cancellation.

      It all really comes down to acknowledging that there's always some uncertainty to any measurement, whether due to limits of the measuring device, random noise, or previously unknown variables. This is normal. This is life. Acting like science happens in a vacuum (the quantum physics and astronomy crowds will love that metaphor) is missing the point.

      The big problem is we don't talk about degrees of uncertainty in the core research. The next big problem is our focus on positive implications as opposed to negative results, especially as publications go. And it would probably help if we eliminated the stigma attached to retractions. "We found a problem with the study," isn't supposed to be shameful, it's supposed to improve the process. Even in science, shit happens.

    12. Re:Science is hard by Anonymous Coward · · Score: 0

      Did you test the 737 max?

    13. Re:Science is hard by Anonymous Coward · · Score: 0

      All researchers should be trying to get maximal knowledge per dollar (or per time, in some cases)

      Ahh hahahaha. So naive, you are.

      As President Eisenhower warned us in our farewell address:

      Today, the solitary inventor, tinkering in his shop, has been over shadowed by task forces of scientists in laboratories and testing fields. In the same fashion, the free university, historically the fountainhead of free ideas and scientific discovery, has experienced a revolution in the conduct of research. Partly because of the huge costs involved, a government contract becomes virtually a substitute for intellectual curiosity. For every old blackboard there are now hundreds of new electronic computers.

      The prospect of domination of the nation's scholars by Federal employment, project allocations, and the power of money is ever present and is gravely to be regarded.

      Yet, in holding scientific research and discovery in respect, as we should, we must also be alert to the equal and opposite danger that public policy could itself become the captive of a scientific-technological elite.

    14. Re:Science is hard by Anonymous Coward · · Score: 1

      As I said, go ahead and publish, but include the note that the job isn't actually finished. Use the partial result to justify asking for more funding so you can complete the work.

      No! My god, no, that's a terrible idea.

      If there is no effect, then your p-value will not show significance. If you run a larger trial, your p-value will still not show significance. If you run your trial over a longer period, your p-value will still not show significance. How much funding should you spend devising more extensive experiments to search for an elusive result?

      A statistical test like this can have two results. One is that there is a significant result, beyond some threshold (e.g. p0.05), meaning that there is probably an effect. The other is that there is no significant result, in which case you can place an upper limit (with some confidence) on the size of the effect. Both are valid results that tell us something about the world.

    15. Re:Science is hard by Anonymous Coward · · Score: 0

      Yeah, no problem, I'll just go spreading ebola some more. Or I'll reboot the universe so I can get some new readings of the quadrupole moment of the CMB. Or I'll start getting some more cosmic rays to hit the planet. Hell, I'll go out and restart the mudskippers leaving the ocean so I can look through how quadrupeds evolved again. I'll go crash some more black holes into each other so I can measure the gravitational waves emitted.

      Face it, your system sucks. You cannot change a lot of how science is actually done - we don't always, or even often in some fields, get to control things.

    16. Re:Science is hard by ceoyoyo · · Score: 1

      Meh, you just have to call it something else. Confidence interval. Likelihood ratio. Bayes factor. The dirty little secret is that these things are all mathematically equivalent, or very nearly so, for the vast majority of analyses that are actually conducted.

      People like simple solutions. A demon to exorcise. P-values fit that. The real problem is lazy interpretation. Any single result is questionable, no matter how well the data is collected and analyzed. A journal article is not truth, it's an observation. But that uncertainty bothers people.

    17. Re: Science is hard by ceoyoyo · · Score: 1

      They are absolutely right on that score. People do that ALL the time. I tell my students it is the first sin of statistics because I'm sure it's responsible for the vast majority of committed statistical fallacies.

      It's the root of the "difference of difference" error, which is apparently present in 50% of neuroscience papers that have the opportunity to make it.

    18. Re: Science is hard by phantomfive · · Score: 1

      What's the difference of difference error?

      --
      "First they came for the slanderers and i said nothing."
    19. Re: Science is hard by ceoyoyo · · Score: 1

      Here's a blog (with a link to a published paper) discussing the error and it's incidence in neuroscience: https://www.theguardian.com/co...

      Basically, imagine you've got a control group and two different treatments. You determine that treatment group A is not significantly different than control, but treatment group B is. So you conclude that treatment B works better than treatment A. Implicitly, you've assumed that the non-significance of group B means "no difference" or at least "less difference." Both of these options are fallacies.

      That *seems* like a silly example, except that it's present in so many papers. But the error can be much more subtle. Have you ever seen a pair of fMRI images side by side, with blobs on one that aren't on the other? fMRI "scans" are actually statistical maps (a t-test per pixel, hopefully mostly corrected for multiple comparisons) with the coloured blobs indicating p some threshold like 0.05. Comparing two of them is committing the difference of differences sin on a massive scale.

    20. Re: Science is hard by phantomfive · · Score: 1

      Huh, so basically they are taking .05 and pretending it's zero. That is fascinating, I will look out for that now. I'm glad we had this conversation.

      --
      "First they came for the slanderers and i said nothing."
    21. Re:Science is hard by epine · · Score: 1

      If your experiment can't hit that level of certainty, redesign your experiment. Go get more samples, run more simulations, and grow more cultures.

      Ridiculous. You have to budget before data collection. But this approach could be valid, I suppose, anywhere money grows on trees.

  20. Bio/Medical Fields by Roger+W+Moore · · Score: 4, Insightful

    Plus they are almost all from biology or medicine. Just because their fields don't seem to understand what statistically significant means does not mean that the rest of us do not. Their example when two results measure the same value but one is within one sigma of a null result and the other is not they claim that people interpret this as two incompatible results!? I do not know of any physicist who would look at those data and make that assertion.

    Their paper reads more like a "I wish our colleagues understood simple statistics". Banning certain terms is not going to address the underlying problem they clearly have. The solution to ignorance is education, not censorship as they really ought to know, working in universities!

    1. Re:Bio/Medical Fields by omnichad · · Score: 3, Insightful

      Statistics in medicine are inherently messier. We don't clone people to do experiments and they don't intentionally kill people. You don't get clean control subjects.

    2. Re:Bio/Medical Fields by oh_my_080980980 · · Score: 1

      It's about publishing Potsy. Something you would know if you were an actual researcher. The publishing game is about publishing statistical significant results. If you don't have that, your research does not get published.

      If you had RTFA you would know that no one is banning anything:

      "We are not calling for a ban on P values."

      "Nor are we saying they cannot be used as a decision criterion in certain specialized applications (such as determining whether a manufacturing process meets some quality-control standard)."

      " And we are also not advocating for an anything-goes situation, in which weak evidence suddenly becomes credible."

      "Rather, and in line with many others over the decades, we are calling for a stop to the use of P values in the conventional, dichotomous way — to decide whether a result refutes or supports a scientific hypothesis."

      Reading is fundamental and you failed. Moron.

    3. Re:Bio/Medical Fields by Roger+W+Moore · · Score: 2

      It's about publishing Potsy. Something you would know if you were an actual researcher.

      I am an actual researcher. Given your lack of understanding of statistics and reliance on ad hominem attacks, if you are a researcher too then you are clearly the target audience that this paper is trying to help by reducing your exposure to simple statistical concepts that you are likely to misinterpret.
      I never said that they were calling for a ban on p-values, I said that they were calling for an end to "statistical significance". To quote:

      We agree, and call for the entire concept of statistical significance to be abandoned.

      This is just stupid. You do not stop using a valuable and sensible concept simply because some people who should know better do not properly understand it. Drawing a conclusion from your analysis is a fundamental part of doing science and it is completely proper that an author of a paper should make a statement of their conclusions based on the data. When this is whether a particular hypothesis is correct or not then you have to address the binary nature of the result otherwise you have not done your job. How strong a statement you can make will, of course, depend on how good your data are. This can vary from "the data are consistent with the Standard Model but do not rule out the presence of new physics" to "The $EXOTIC_NEW_PROCESS is ruled out at the 95% confidence level".

      Reading is fundamental and you failed.

      Please do not project your own failings onto others.

    4. Re:Bio/Medical Fields by Anonymous Coward · · Score: 0

      You don't get clean control subjects.

      Doesn't matter. As long as you're randomly assigning your subjects between the control group and the test group, your statistics will tell you the probability that your possibly-significant result came from anything except the treatment you're applying. (For example, if it came from variation in the subject pool.)

      The problems arise when (a) experimenters try to get clever by non-randomly assigned control and test subjects, e.g. to match ages or weights, to decrease the noise; (b) experimenters try multiple trials and pick the most significant result (i.e. p-hacking); or (c) experimenters do something tricky, like converting from a categorical measure to a numerical one, by plugging values into some statistical software without having a clue what it's doing.

    5. Re:Bio/Medical Fields by ceoyoyo · · Score: 1

      Sad but true. And I do medical research.

      One time a particularly annoying research assistant came running down the hall all excited about two recently published papers that showed exactly the situation you mentioned. Look! Contradictory results! Who's wrong? Uh, those results are compatible with each other. One of them had a confidence interval that completely included the other.

    6. Re:Bio/Medical Fields by ceoyoyo · · Score: 1

      No it's not. *Data* in medicine is inherently messier. That makes good statistics more important. In most cases the actual stats are easier: measurements in medicine tend to be so much crap averaged together that the central limit theorem works quite well, Gaussian assumptions are valid, and the t-test reigns supreme.

    7. Re:Bio/Medical Fields by ceoyoyo · · Score: 1

      Yes. They basically want people to stop saying p 0.05 and instead say p = 0.xxxxx. It's a great idea. As far as I can tell, it mostly happened twenty years ago when people learned how to use computers. Every once in a while I review a paper by someone who didn't get the memo and make them include their actual p-values.

    8. Re:Bio/Medical Fields by omnichad · · Score: 1

      That's just a narrow definition of statistics. Gathering that data falls under "statistics" as well.

    9. Re:Bio/Medical Fields by ceoyoyo · · Score: 1

      I absolutely include designing the study and gathering data under the heading of statistics.

    10. Re:Bio/Medical Fields by Roger+W+Moore · · Score: 1

      Statistics in medicine are inherently messier.

      I agree that it is harder to quantify your uncertainties because you have so many variables but this is what leads to incorrect uncertainty values. What we are talking about here is the correct interpretation of a stated uncertainty which is an entirely different problem to whether the stated uncertainty is correct.

    11. Re:Bio/Medical Fields by omnichad · · Score: 1

      I don't think you could ever accurately quantify the uncertainty.

    12. Re:Bio/Medical Fields by Roger+W+Moore · · Score: 1

      That's actually a given in any experiment in any field: all uncertainties are themselves uncertain to some degree. However, this in no way stops you from being able to correctly interpret what a stated uncertainty means.

  21. This won't address the underlying problem by SlaveToTheGrind · · Score: 3, Interesting

    Even without a magical "significant/insignificant" threshold, researchers will still evaluate, judge, and compare levels of significance. The pressure will just shift to come up with results that are "MORE significant" rather than "LESS significant," and thus p-hacking will continue by those that were willing to cross that line in the first place.

    The root cause is going to remain until peer reviewers force researchers to commit to how they're going to evaluate their measurements before they take those measurements. But the likely outcome would be either a lot less research would get published at all or published research would start to lose some of the imprimatur it now enjoys, including that of the peer reviewers. So that's unlikely to happen.

    1. Re:This won't address the underlying problem by Falconnan · · Score: 1

      I agree 90%. Another thing that would help would be to reduce pressure to always get a solid result at the end. "We did the experiment as designed and approved and we got very little to show for it," needs to be acceptable to avoid fraud, as well as to improve processes. "This experiment fails because 'x'," is a beautiful thing, since it has value: This experiment for this purpose doesn't work, so skip it or improve on it, please.

    2. Re:This won't address the underlying problem by ceoyoyo · · Score: 1

      Nope. What needs to happen is we need to give up on this idea that papers must be True. Scientific papers evolved from personal letters and presentations at scientific society meetings. A published paper is basically "hey guys, I think I found this thing that might be cool. Take a look?"

      The key being the last part. Have a look and see if you see the same thing. If a bunch of us do, we might be onto something. If, instead, you get a reputation for finding random crap nobody else can replicate, well....

    3. Re:This won't address the underlying problem by Anonymous Coward · · Score: 0

      Interesting, so 'truth' is determined largely by 'researcher self preservation' - drive to keep his/her job. Who knew, huh? Sounds like a broken system stuck in
      a loop.

  22. I prefer to use an average for my mis-information by Anonymous Coward · · Score: 1

    On average, humans have one breast and one testicle.

    It's even worse when economic trends are reported in the popular press.

  23. Re:All odd numbers are prime by colinwb · · Score: 3, Informative

    1 is prime by that definition, but it's mostly called a unit and defined as *not* prime to make factorising integers into primes unique (up to the order of the factors): Prime number - Primality of 1

  24. These statisticians are idealists by plague911 · · Score: 3, Interesting

    Sure, in a perfect world we would all discuss the exact probabilities. The reality is we all (even professionals in an industry) have a limited attention span. Benchmarks are useful, even imperfect benchmarks. This is just another example of some purists thinking we should move to some idealized but impractical situation

    1. Re:These statisticians are idealists by Anonymous Coward · · Score: 0

      Sure, in a perfect world we would all discuss the exact probabilities. The reality is we all (even professionals in an industry) have a limited attention span. Benchmarks are useful, even imperfect benchmarks. This is just another example of some purists thinking we should move to some idealized but impractical situation

      Science and scientists seem to be treated as ineffable, until criticism comes that cant be parried with authority. Then we get apologies and inherently un-scientific post rationalizations like this.

      It's the primary driver of why people's belief in science is fading. Scientists who do not believe in empiricism's ideal need to understand the moral of "the emperor has no clothes."

    2. Re:These statisticians are idealists by plague911 · · Score: 1

      Nonsense. We trust in science, not because we as humans are perfect, but because its a process we use to try and get as close the truth as we can. Admitting to inherent imperfection in humans in no way takes away from the scientific process. Peoples belief is fading because we have a couple political dogmas floating around that pump out the idea that the goberment/scientists/the illuminati is lying to them. Humans are flappable and yet the scientific process is ineffable when compared to "my neighbor told me"

    3. Re:These statisticians are idealists by Anonymous Coward · · Score: 0

      Sure, in a perfect world we would all discuss the exact probabilities.

      What the f?

      This is a deterministic universe, the real "probability" of anything is either 100% or 0%. That we currently dont understand all that goes into it - or entertain naive childhood notions like chance, random or choice - doesnt make it any less true.

  25. In defense of the p-value by psychic_bacon · · Score: 2

    I'm really curious about what people think about this comment and my attempt to defend p-values and statistical significance testing as a concept. I used to hate p-values like any respectable scientist, but in teaching intro college stats class (targeted to behavioral science), I've come to appreciate them, for one major reason.

    1. We have to take uncertain science and make certain decisions about the conclusions. Science gets simplified to dichotomous decisions. You either approve the drug or not. You either eat eggs or don't eat eggs. The defendant is guilty or not guilty. In each of these cases, we take scientific and other evidence and have to make a decision: do we trust these data. Confidence intervals, odds ratios, etc, help give a picture but they don't give a clear guideline about what to accept.

    2. It's really hard to understand (and teach) Bayesian and other approaches. I think that statistical significance is a decent proxy, as long as the limitations are well-understood. I am a big believer in teaching science research to people who have no desire to ever be "researchers", and in order to evaluate their studies, statistical significance is a good proxy. If you are doing an intro biology lab testing whether there are more bacteria on your hands after washing your hands versus hand sanitizer, a t-test with a p .05 criterion is a good approach. It won't get published in JAMA, but it's good for teaching research concepts.

    3. Reviewers still want p-values. Each time I have submitted a manuscript without p-values, I get a nasty reviewer who requires p-values. Maybe I've had bad luck, but I'm guessing this is pretty common in the literature. Any time I try a statistical technique that goes beyond null hypothesis testing, there is at least one reviewer who doesn't understand the technique and gripes because there are no p-values or decision criteria. As long as this is required to publish, we need to do it.

    So these aren't very good defenses, but it's why I'm still teaching p-values and null hypothesis testing. Maybe we will get rid of it, but like some other comments here, it leaves the question of what the alternative would be.

    1. Re:In defense of the p-value by Anonymous Coward · · Score: 0

      Report both p-value and effect size. For the latter, see https://www.leeds.ac.uk/educol/documents/00002182.htm

      I'd be ok with removing the arbitrary p = 0.05 threshold. I'd also love to see more negative results published; I love PLOS One for many reasons, but their willingness to publish negative results is a big one.

      I'm also a big fan of robust statistics. I found "Fundamentals of Modern Statistical Methods: Substantially Improving Power and Accuracy" by Rand R. Wilcox to be an enlightening read. This may or may not be appropriate for an intro stats class, although if you had time to cover the basic principles of robust statistics and hypothesis testing with a trimmed mean, that would be amazing. Thinking more about it though, this topic probably deserves its own course.

    2. Re:In defense of the p-value by jeff4747 · · Score: 1

      The idea of this proposal is not to abandon p values. It's to stop using p less than 0.05 as a magical threshold.

      Loosening that limit is also far more useful in studies involving the "softer" sciences, where it's not possible to control all confounding variables. I wouldn't expect much of a benefit in astrophysics as you'd get in nutrition.

      So you'd still report p values, and something with a p-value that indicates the result is basically random probably wouldn't get published. But there'd also be more papers and discussion around things where a particular experiment only reached, say, p=0.1. And that discussion could lead to further research and better designs that get a better p-value.

      Or several different approaches to study the subject at hand, each of which with a p over 0.05 because it's studying something where controls are limited or not possible (eg, can't kill your test humans). Several studies with p a bit over 0.05 that show the same result is probably as significant as one study with a p less 0.05 when you're talking about complex and uncontrollable interactions.

    3. Re:In defense of the p-value by fropenn · · Score: 1

      You have to teach it because students need to be able to read articles and interpret what they are reading. The p-value will be reported in scientific studies for a long time to come, so this is an essential skill.

    4. Re: In defense of the p-value by phantomfive · · Score: 1

      Everyone needs to understand statistics in the modern world. People who don't get lost and very, very confused.

      --
      "First they came for the slanderers and i said nothing."
  26. The problem is more that people don't understand by Opportunist · · Score: 1

    Mostly, they don't understand that the world isn't black and white.

    People want answers. That's a given. And they used to turn to science for this. I say used to, because more and more people think that woo has better answers for their questions. The reason is less that science does not have answers, but that the answers science has require thinking and understanding. They are rarely YES or NO. There's a lot of ifs and buts attached, but people don't want that. They want easy answers.

    And reality has rarely easy answers.

    "Statistically significant" doesn't mean "resoundingly YES". But that was what was read into it, and of course that expected YES cannot be delivered.

    Yes, reading statistics requires some effort by those trying to understand them. Unfortunately that's not what people want to do when they're looking for answers.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  27. Yes by Chris+Mattern · · Score: 1

    If you understand what it means and how to apply it. If you blindly slap on the formula and use the resulting number to say, "Look, it's significant!", then, no, it isn't.

  28. Re: p-hack? by Anonymous Coward · · Score: 0

    Trump is hiding in the capital. Once he says something stupid and untrue like he gets 4 more years (as if anyone would believe such a thing after all this time) the addled brains will pounce

  29. Re: All odd numbers are prime by goombah99 · · Score: 0

    It must be odd if it's prime.

    --
    Some drink at the fountain of knowledge. Others just gargle.
  30. No. by Anonymous Coward · · Score: 0

    Correlation does not equal causation. How many times have "scientists" tried to convince us things that are obviusly false are true? Evolution, global waming, the "big bang", etc. All fake science used to push a liberal big goverment agenda. This article about Statistical Significance just further supports the fact that science is not useful.

    The only real truth to be found is in the bible.

  31. 0.051 by cheaphomemadeacid · · Score: 1

    meh just set it to 0.051 and watch 90% of "science" publication burn

    1. Re:0.051 by Actually,+I+do+RTFA · · Score: 1

      I'm guessing you don't understand p-values and statistical significance. Setting the limit of publishability to 0.051 would increase the number of papers that passed the test.

      --
      Your ad here. Ask me how!
    2. Re:0.051 by goose-incarnated · · Score: 1

      meh just set it to 0.051 and watch 90% of "science" publication burn

      You mean 0.049.

      --
      I'm a minority race. Save your vitriol for white people.
  32. Re: I prefer to use an average for my mis-informat by Anonymous Coward · · Score: 0

    I think you will find all healthy humans have two breasts, the variable being size of development of mammary tissue from gender hormones.

    As this indicates the statistic significance, or the average, is only as good as the data or research behind it.

  33. Re:p-hack? by omnichad · · Score: 1

    The wait is over - YOU did!

  34. Re:All odd numbers are prime by rossdee · · Score: 1

    "A prime number is divisible only by itself and 1
        1 is prime (by this definition)"

    When I was learning Maths (Mathematics is plural where I come from) I was taught that 1 is not prime, it is a special case.

    Anyway for 1 the statement becomes:
    1 is divisible by 1 and 1

    But 1and1 is now IONOS

     

  35. Letters by Anonymous Coward · · Score: 0

    In a nutshell, what the statisticians are recommending is that we embrace uncertainty, quantify it, and discuss it, rather than set arbitrary measures for when studies are worth publishing. This way research which appears interesting but which doesn't hit that magical p == 0.05 can be published and discussed, and scientists won't feel pressured to p-hack.

    People did that kind of publishing via folded papers in envelopes called "letters." Their purpose was to discus about interesting things and prepare the work towards a formal publication. Who knows, maybe such discussions could be possible even with today's technology?

  36. Why lie when we can look ourselves? by Anonymous Coward · · Score: 0

    Or you could have read his manifesto and not had any debate.

    He thought Muslims were invaders that caused overpopulation. He was worried about overpopulation causing worse AGW and was taking care of the problem since no one else was. Called himself a socialist, eco-fascist.

    But that bit of his manifesto didn't make the news and they are attempting to prevent people from being able to read it by pulling it down where they can.
    Story about it.

  37. Re:The problem is more that people don't understan by andrewbaldwin · · Score: 1

    And reality has rarely easy answers.

    Which is why engineers answer most questions with "it depends".

  38. Re:All odd numbers are prime by Anonymous Coward · · Score: 0

    Prime numbers have to be greater than 1 so 1 is not a prime. 2 is a prime because it's only divisible by 2 and 1. Any number who's sum of the digits is a multiple of 3 cannot be prime as it will be divisible by 3. Any number that end in 5 other than 5 cannot be a prime because it is divisible by 5.

    Here is a table of all prime numbers up to 1,000:

              2 3 5 7 11 13 17 19 23
    29 31 37 41 43 47 53 59 61 67
    71 73 79 83 89 97 101 103 107 109
    113 127 131 137 139 149 151 157 163 167
    173 179 181 191 193 197 199 211 223 227
    229 233 239 241 251 257 263 269 271 277
    281 283 293 307 311 313 317 331 337 347
    349 353 359 367 373 379 383 389 397 401
    409 419 421 431 433 439 443 449 457 461
    463 467 479 487 491 499 503 509 521 523
    541 547 557 563 569 571 577 587 593 599
    601 607 613 617 619 631 641 643 647 653
    659 661 673 677 683 691 701 709 719 727
    733 739 743 751 757 761 769 773 787 797
    809 811 821 823 827 829 839 853 857 859
    863 877 881 883 887 907 911 919 929 937
    941 947 953 967 971 977 983 991 997

  39. nothing wrong with the math by Anonymous Coward · · Score: 0

    There's nothing wrong with significance testing itself. What's wrong is when you have a strong motivation to have have significant result, which makes you have a significant result way more often than is to be expected, and when scientists do sloppy application of significance testing. As we all know, rarely are applications of statistics textbook in nature. There are little assumptions here and there that are made. These are little opportunities for bias. Worse, I'd bet that even most refereed papers are barely scrutinized well enough to confirm that the choice of statistical tests were apropos. For example, I've reviewed medical articles where everything was done right but sample size thresholds were not properly satisfied for the tests used and those are the EASY cases to spot.

  40. Re:All odd numbers are prime by fibonacci8 · · Score: 1
    --
    Inheritance is the sincerest form of nepotism.
  41. Thank you! by Anonymous Coward · · Score: 0

    Slashdot.Org is full of athiestic animals who seem to ignore that the USA is turning into a africa style shit-hole full of undesireablals all based on liberal left wing stupidity. The truth of the bible helps people avoid the evils of multiculturalism and homosexuality and shuld be paid attention to instead of fake science and left wing terroristic politics.

  42. Rutherford on Statistics by Anonymous Coward · · Score: 0

    If your experiment needs statistics, you ought to have done a better experiment.

  43. Works as advertised by Anonymous Coward · · Score: 0

    Ever roll a d20? Ever get a Nat 20? There is a 1-in-20 chance that a p=0.05 is a "nat 20". If you re-run the study and have similar findings, there is a 1-in-400 chance.

    p=0.05 indicates "try again and see".

    The real problem is that replication isn't sexy. It isn't sexy to conferences, journals, funders, or even the replicators (I'm re-doing the work of others!). As such, we get to the 1-in-20 stage and not to the 1-in-400 stage.

  44. Woosh by Anonymous Coward · · Score: 0

    the woosh is strong on this one.

  45. Nature is not always Gaussian by pz · · Score: 1

    The issue I find with nearly every single biological application of p-value testing is that either the wrong test is used, or, far more frequently, the necessary validations of the assumptions of the test have not been made. I assume that among those many articles from The American Statistician (a journal that I do not read) that point will have been made because although it is a subtle one, it isn't that subtle, and it is important.

    The most commonly used statistical tests assume that unaccounted experimental variability will be Gaussian in nature. That assumption is patently false for the general case. Noise sources for some things are Gaussian -- thermal noise in an electrical signal for example -- but many, many biological sources are not.

    When Nature is non-Gaussian, you have to be extra super careful with your tests of significance. And nearly every paper that I've read skips doing noise analysis to validate their tests. Even the lowly mean and standard deviation functions assume Gaussian variability for correct interpretation. The alternative is to have p-values that are so small that results are obvious by inspection --- and then you don't need statistics.

    That's the sort of science I strive to perform.

    --

    Put my fist through my alarm clock with its ding-dong death inside my ear. - The Blackjacks.
    1. Re:Nature is not always Gaussian by ceoyoyo · · Score: 1

      No, there's a reason most people learn how to do Gaussian stats and then stop. MOST measurements have Gaussian error because that's what you get when you average or add up a bunch of random variables. Most measurements are really composites like that, and the noise is quite Gaussian.

      It's very important to recognize situations where that's not true though. Counts, surveys, ordinal scales, data that's been transformed, etc. People are legitimately terrible at doing that.

  46. It's Elemetrary Statistics 101, from 60 years ago by guacamole · · Score: 1

    If you browse around a typical statistics textbook, you will probably find a brief discussion about the difference between statistical significance and real world significance. It seems like a lot of people in sciences, specially in the soft sciences are chasing after the statistical significance because it's now some kind of a prerequisite to get published. However, their findings can amount to very little in the real world. Imagine for example that you find out there the commute distance is statistically significant between people who drink diet coke and tomato juice. Sounds like a great title for a click-bait report. But in reality, your estimates can be 7.34 miles vs 7.36 miles, a difference of 40 meters.

  47. Just an excuse to excuse publishing crap papers. by Chas · · Score: 1

    Yay.

    So we can look forward to even MORE broken, badly researched, pointless garbage being published as academically or scientifically relevant.

    Look at the finances of any journal pushing this crap. They're probably on borrowed time, in the financial sense.

    --


    Chas - The one, the only.
    THANK GOD!!!
  48. Re:The problem is more that people don't understan by Opportunist · · Score: 1

    And this is why we don't make good politicians. Politics need easy answers. They needn't be correct or even solve anything, but they have to be easy to understand.

    --
    We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
  49. Re:p-hack? by Anonymous Coward · · Score: 0

    Congratulations, you're the pea-brain who somehow dragged Trump into this first.

  50. Not Statistically Significant by Matheus · · Score: 0

    850 scientists and statisticians spouting this idea seems like a large number but compared to their total cohort around the globe I'm sorry but...

    They are not statistically significant.

    That and they are lazy.

    1. Re:Not Statistically Significant by WillAffleckUW · · Score: 1

      To get a truly significant number, you would need 850 scientists, and 850 controls (or non-scientists). And you would need a truly randomized sample of both. If all the scientists are the same age BMI and gender, it's not even close to randomized. Throw in some post-docs.

      --
      -- Tigger warning: This post may contain tiggers! --
  51. The P-Value Song by Anonymous Coward · · Score: 1

    By Michael Greenacre & Gurdeep Stephens
    Barcelona, Catalonia & Victoria, Canada, March 2015
    [Video link]

    Statistics, logistics, cladistics seem to me
    To have a common theme scientifically,
    Economists, biologists, with PhD degrees,
    They all need some proof of their theories.
    A letter is the key, you'll see clearly,
    Not B nor G nor V -- but it's the P !

    There's no values like P-values
    Like no values I know
    Think of something that is not worth proving,
    An hypothesis that everyone calls null,
    If your P is too large to reject it
    Then your experiment is rather dull.

    There's no values like P-values,
    Especially when they are low,
    Don't be sad if your P's over point-O-five,
    Just try again with samples twice the size,
    Everything is possible, just trust in me:
    Put your faith in the P.

    The F test, the Z test, the chi-square and the T
    And other cryptic terminology
    Anova, regression, tests distribution-free,
    They all need some sort of guarantee.
    So if you find a tiny effect size
    The P-value will be a good disguise.

    There's no values like P-values,
    The frequentist's hero,
    When you get that data modeling feeling
    But results you have are not a lot,
    You will need some stats that are appealing
    To show the journals your work is hot!

    There's no values like P-values
    Especially when they're low
    Don't be sad if your P's over point-O-five,
    Just try again with samples twice the size
    Everything is possible, just trust in me:
    Put your faith in the P!

  52. Too many studies by Anonymous Coward · · Score: 0

    We have too many researchers doing too many studies about the same topic and we incorrectly view each study as a separate event. Without quantification of the number of unpublished, published, and significant studies on a given topic, an individual study's relevance is unknown. If 200 separate researchers did 50 studies each (or 2000 did 5 studies) for a total of 10,000 studies, at .05 p, we could expect 500 false positives. When a study is published without knowing the universe of all studies on that topic, we do not know if any report of a significance level is really significant. Add that there is a bias to report and document positive over negative results. There is also data mining, where an existing database is used to search for any relationship among the historical variables at a p value and then report that relationship. With a large universe of studies and with data mining of historical data, an individual studies significance level is unknown and reproducible results is very low. Combining the data of published studies does not help, since their is a bias is what is reported and published.

  53. Too many studies by rechtco · · Score: 1

    We have too many researchers doing too many studies about the same topic and we incorrectly view each study as a separate event. Without quantification of the number of unpublished, published, and significant studies on a given topic, an individual study's relevance is unknown. If 200 separate researchers did 50 studies each (or 2000 did 5 studies) for a total of 10,000 studies, at .05 p, we could expect 500 false positives. When a study is published without knowing the universe of all studies on that topic, we do not know if any report of a significance level is really significant. Add that there is a bias to report and document positive over negative results. There is also data mining, where an existing database is used to search for any relationship among the historical variables at a p value and then report that relationship. With a large universe of studies and with data mining of historical data, an individual studies significance level is unknown and reproducible results is very low. Combining the data of published studies does not help, since their is a bias is what is reported and published.

  54. if you're a scientist by Anonymous Coward · · Score: 0

    you don't use statistical significance to say "true" or "False. Only mathematicians would do that

  55. So what if it is "abritrary"? by Anonymous Coward · · Score: 0

    If you discard any choice because it is arbitrary, then that is itself an arbitrary choice and should, by your metric, be discarded and another found.

    Why not 0.1? Why not 0.05. if you can't say why not 0.05, then why does anyone have to "explain" to you why not 0.1.

  56. Boring. by Anonymous Coward · · Score: 0

    Yes, yes, yes, you're very smart for saying p-hacking live five thousand other fucking know-it-alls on slashdot. Here's the point, if p-hacking exists at 0.05, then what value will it NOT exist at?

    What was that? No value of p value can be immune to it only made harder rr easier?

    Well, then, what the fuck was the point of whining about p-hacking with a value of 0.05?

    The problem is, like with the wording of the submitters, treating p-values as determining true of false. That's not what it is, and only statisticians and idiots would do otherwise, the latter because they don't know any better, and the former because that is what their realm of expertise uses it for, frequently.

  57. SW by Anonymous Coward · · Score: 0

    Ask a stupid question ... GIGO. Social scientists give science a bad rep.

  58. Yeah, but rightwingers call Nazis socialists. by Anonymous Coward · · Score: 0

    So please explain how he meant socialist a la Bernie Sanders and not a la what he's been taught Hitler was?

    1. Re:Yeah, but rightwingers call Nazis socialists. by Anonymous Coward · · Score: 0

      Bernie Sanders supporter = Shooting GOP representatives on a baseball field
      VS
      Hitler = Killing Jews because they were jews

      Not sure there is much of a difference. Socialists seem to be mentally unstable no matter which definition you use. Unless you believe killing people based on political viewpoint is acceptable and based on religion is wrong.

  59. Re: All odd numbers are prime by Anonymous Coward · · Score: 0

    "Any number who's sum of the digits is a multiple of 3 cannot be prime as it will be divisible by 3."

    Let's consider the binary number 10101. The sum of the digits is divisible by 3. Therefore, by your reasoning, the number 13 in binary is divisible by 3.

    The problem with your logic is that our computations are done in binary, now, not in decimal. We've moved away from packed decimal a long time ago. :)

  60. Yeah, we get it wrong by Anonymous Coward · · Score: 0

    Unfortunately many scientists get this wrong and nearly everyone who inteprets results definitely make a profound mistake:

    It is easy to think the choice is between insignificant and significant; if something is not insignificant, it must be significant.

    But this is fundamentally wrong. There is a third choice: unknown. The real scale of significance is insignificant - unknown - significant.

    Statistical tests can only be used to distinguish between the two leftmost choices: insignificant or unknown. They cannot prove your data contains the signal you were looking for.

    A better way to see this kind of statistical analysis is:

    * If the p value is over *treshold chosen, often 0.05*, your data is indistinguishable from random noise.

    * If the p value is below your treshold, you data may or may not show a significant signal.

  61. S7 accuracy at CERN by Anonymous Coward · · Score: 0

    Correct me if I'm wrong, at CERN, they had to achieve S7 results, before they could conclude that they had found the Higgs-Boson particle. That was a lot of data that they worked with. This kind of accuracy would be hardly possible to achieve in other areas of science.

  62. Re:All odd numbers are prime by MooseTick · · Score: 1

    "Prime numbers have to be greater than 1 so 1 is not a prime."

    According to your definition. Like most terms, there is no king to give the definitive definition. To me, prime is a cut of meat.

    Also, remember that math is just a mental contruct that allows our human minds to interpret the universe around us.

  63. Re: All odd numbers are prime by gnick · · Score: 1

    The even number 2

    "All odd numbers are prime" does not imply "no even numbers are prime".

    --
    He's getting rather old, but he's a good mouse.
  64. Re:All odd numbers are prime by thrich81 · · Score: 3, Informative

    Actually 1 is neither prime nor composite by some deep mathematical definitions which go beyond the integers -- they go into the structure of algebraic rings which are generalizations of the integers. If you allow 1 (a unit) to be prime then you break some properties and theorems which everyone generally accepts in the algebra of the integers. The most well known such property is that of unique factorization -- any natural number is factored uniquely into prime factors. If you let 1 be prime then the prime factorization of a composite number can have any number of factors of 1 in it.

    The deeper definition of a prime (from my old abstract algebra book) is, "In the Euclidean ring R a nonunit p is said to be a prime element of R if whenever p = ab, where a, b are in R, then one of a or b is a unit in R."

    And there is a king which gives the definitive definition -- it is the accepted body of mathematical definitions by the world's mathematical community. There are sometimes differing definitions of a term, but those differences are usually well spelled out in any discussions. You can choose not to accept the definitions as the professionals in the field use them but then don't claim your definition is as good or useful as that of the pros.

  65. Reality is a bitch by Anonymous Coward · · Score: 0

    Clearly, in human medicine, we can turn everything into a coin flip. gender, heredity, environment, culture, diet, all of those can be perfectly categorized and we can get 10 thousand subjects with this fairly rare disease to make a test that meets your required level of sensitivity.

    Sure, in my branch of mathematics, I can give a very clear answer. Not so much in medicine or psychology.

  66. There's nothing wrong with p. by Anonymous Coward · · Score: 0

    p is well defined. Only idiots would want to abandon a useful tool to avoid its misuse by idiots. I think this is a corollary of one of the laws of Engineering: There's no such thing as foolproof; fools are far too clever. The fact that idiots will misuse and misinterpret it is a given. It WOULD be nice if we could keep the idiots out of the population of peer reviewers and journal editors, but I wouldn't hold my breath. The use of p-value should be discipline dependent, depending on the subject, the measurement, and the community. And most importantly, the details of the experiment.

  67. Re:All odd numbers are prime by Anonymous Coward · · Score: 0

    9 isn't experimental error it's juts the one datapoint which contradicts your premis you couldn't plausibly deny having cherry picked out.

    15 (not prime 3x5), 17, 19, 21 (not prime 3x7), 23, 25 (not prime 5x5), 27 (not prime 3x9), 29 ...

    And in the otehr direction:

    -1, -3 (not prime -1x3), -5 (not prime -1x5), -7 (not prime -1x7), -9 (-3x-3), 11 (-1x11)...

  68. It's not p, it's n by WillAffleckUW · · Score: 1

    I don't care how significant your p value is, if your n is less than 40 case/control match your values are meaningless, other than proof of concept for further study.

    Wake me up when you get 256/256 fully matched case/control with true randomization. Then we'll talk p values.

    --
    -- Tigger warning: This post may contain tiggers! --
  69. Re:Do like psychologists and forget replication by Anonymous Coward · · Score: 0

    Any major that has the word "studies" in its name is worthless.

  70. == not zero by Anonymous Coward · · Score: 0

    My old stats prof taught us to read 'statistically significant' as 'not zero'.

  71. Re: All odd numbers are prime by Anonymous Coward · · Score: 0

    "Any number who's sum of the digits is a multiple of 3 cannot be prime as it will be divisible by 3."

    Fixed for pedants....

    "For any integer n, base b > 1, divisor d of b-1, and other integer m congruent to n mod b-1, if d | m, then d | n."

  72. Lies, damned lies, and statistics by NewYork · · Score: 1

    Lies, damned lies, and statistics is a phrase describing the persuasive power of numbers, particularly the use of statistics to bolster weak arguments
    https://en.m.wikipedia.org/wik...

  73. Re:Just an excuse to excuse publishing crap papers by Anonymous Coward · · Score: 0

    Go science! Scientists, our only gods of truth we can trust! Everyone else is a trump era liar, but scientists preserved the holy altar of truth.

    There is a sucker born every minute and nobody cares. Enjoy everybody, you do nothing to prevent this so you deserve this.

  74. Doesn't "significant" mean "important"? by RespekMyAthorati · · Score: 1

    I've always thought "statistical reliability" was a better name.