Science and the Shortcomings of Statistics

Lies, Damned Lies, and Statistics. by Shadow+of+Eternity · 2010-03-17 14:31 · Score: 5, Informative

In other news math may not lie but people still can, all the honesty and good statistics in the world doesnt help end-user stupidity, and there are statistically two popes per square kilometer in the vatican.

--
A bullet may have your name on it but splash damage is addressed "To whom it may concern."

Re:Lies, Damned Lies, and Statistics. by jeckled · 2010-03-17 14:41 · Score: 3, Informative

Also, statistics are often manipulated to suggest correlations where there are none.
Re:Lies, Damned Lies, and Statistics. by Michael+Kristopeit · 2010-03-17 15:02 · Score: 2, Insightful

valid correlations are often manipulated to suggest causation where there is none.
in the end, it's only a problem if the person listening is an idiot...
Re:Lies, Damned Lies, and Statistics. by dwarfsoft · 2010-03-17 15:03 · Score: 4, Funny

As with everything, xkcd delivers. My personal favorite :)
People often get caught assuming that Correlation == Causation.

--
Cheers, Chris
Re:Lies, Damned Lies, and Statistics. by Cryacin · 2010-03-17 15:08 · Score: 5, Funny

Exactly. I would never believe a statistic that I did not make up myself!

--
Science advances one funeral at a time- Max Planck
Re:Lies, Damned Lies, and Statistics. by Opportunist · 2010-03-17 15:33 · Score: 1

While we're at it, stay away from hospitals! Most people in civilised countries die there rather than anywhere else!

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Re:Lies, Damned Lies, and Statistics. by scotch · 2010-03-17 15:34 · Score: 1

xkcd links - the new hot grits

--
XML causes global warming.
Re:Lies, Damned Lies, and Statistics. by Entrope · 2010-03-17 15:41 · Score: 1

That error at least something that is debunked fairly often. It's harder to explain to most people (those without sufficient background in mathematics or statistics) that you're going to have results that falsely appear to be "statistically significant" if you repeat a random trial often enough. For example, you should expect to find a wrong p=0.05 result within 15 trials. For whatever reason, people hear jargon like "statistically significant" and forget the truism about a stopped clock.
Re:Lies, Damned Lies, and Statistics. by Lars+T. · 2010-03-17 15:43 · Score: 1

and there are statistically two popes per square kilometer in the vatican.
But the expected value of popes per Vatican City is still one.

--
Lars T.
To the guy who modded me down from perfect to terrible Karma - Apple haters still suck
Re:Lies, Damned Lies, and Statistics. by menkhaura · 2010-03-17 15:56 · Score: 2, Funny

In other news, researches proved that water causes cancer. 100% of the cancer patients that died in 2009 drank water regularly.

--
Stupidity is an equal opportunity striker.
Fellow slashdotter Bill Dog
Re:Lies, Damned Lies, and Statistics. by Mitchell314 · 2010-03-17 15:59 · Score: 1

That since a dead clock is right twice a day, those two times cause the clock to work again?

--
I read TFA and all I got was this lousy cookie
Re:Lies, Damned Lies, and Statistics. by Mitchell314 · 2010-03-17 16:02 · Score: 2, Funny

*Only for nonzero values of pope.

--
I read TFA and all I got was this lousy cookie
Re:Lies, Damned Lies, and Statistics. by crmarvin42 · 2010-03-17 16:27 · Score: 4, Funny

That particular oversight drives me nuts. An extension of that is when someone uses orthogonal polynomial contrasts and multiple comparison tests on the same data without adjusting their alpha level. If Tukey's HSD accounts for all tests and gives you an overall alpha of 0.05, and you then proceed to run linear and quadratic contrasts, the combined alpha level is actually 0.10, not 0.05 because Tukey's doesn't adjust for contrasts and contrasts don't contain adjustments for multiple comparisons.

I'm actually at a scientific meeting and saw 7 presentations in which they "double dipped" on their statisitics before we broke for lunch.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde
Re:Lies, Damned Lies, and Statistics. by Shadow+of+Eternity · 2010-03-17 16:58 · Score: 1

Someone needs to find a way to throw Avignon into this.

--
A bullet may have your name on it but splash damage is addressed "To whom it may concern."
Re:Lies, Damned Lies, and Statistics. by jpate · 2010-03-17 17:06 · Score: 1

Depends on your definition of expectation ;)

maximum likelihood isn't the only way to go.... and sparse data is exactly where it crashes and burns
Re:Lies, Damned Lies, and Statistics. by siloko · 2010-03-17 17:29 · Score: 1

I really, really hope you're being ironic.
Re:Lies, Damned Lies, and Statistics. by Anonymous Coward · 2010-03-17 18:06 · Score: 1

As with everything, xkcd delivers.
It certainly does. It's about time someone made fun of the crowd who parrot "correlation doesn't imply causation" after finding this it in their introductory stats course. Someone should maybe remind them that causation does imply correlation.
People often get caught assuming that Correlation == Causation.
I'm not sure whether you have a super-dry sense of ironic humor, or whether you didn't get the xkcd strip.
Re:Lies, Damned Lies, and Statistics. by DynaSoar · 2010-03-17 18:13 · Score: 1, Insightful

there are statistically two popes per square kilometer in the vatican.
One does not do statistics on single data points. Statistics are for estimating results in large numbers of cases using smaller numbers of cases. When there is only one case, we use an entirely different means of reporting data. It is called a measurement. There is one vatican. Results would be reported as X per vatican. By direct measure, the answer is one pope per vatican. Your own result reflects your incorrect thinking. There is no square kilometer in one vatican. One does not use a unit of measure of area for a target smaller than that unit.

--
"I may be synthetic, but I'm not stupid." -- Bishop 341-B
Re:Lies, Damned Lies, and Statistics. by Opportunist · 2010-03-17 18:21 · Score: 2, Funny

Not only that, but it is also the key ingredient in most of today's problems. It's the core element of acid rain, it's a main ingredient in beer and many other alcoholic beverages that cause families to break apart, you find it in fattening food and it is the main ingredient in all high carbon soft drinks.
Consuming that stuff might also lead to antisocial behaviour, as it has been confirmed that all murderers, gunmen and even terrorists have consumed it pretty much all their life. When are we going to ban that substance? Doesn't anyone think of the children anymore?
Please read on. It is a serious problem and people should be informed urgently. Giving one sided, biased and doctored results is a really urgent problem in today's statistics and presentation of information. I beg you, make sure you read it and heed the warning. Think of the children!

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Re:Lies, Damned Lies, and Statistics. by the_womble · 2010-03-17 18:35 · Score: 3, Insightful

The problem is that a lot of people believe statistics produced by an expert such as a doctor. Sri Roy Meadow had people sent to prison, and lots of children taken away from their parents, by misinterpreting statistics.
Re:Lies, Damned Lies, and Statistics. by the_womble · 2010-03-17 18:37 · Score: 1

That should, of course, read Sir Roy Meadow....
Learn to read the preview!
Re:Lies, Damned Lies, and Statistics. by Hognoxious · 2010-03-17 22:06 · Score: 1

Hydroxymethylenemethide is the best cure.

--
Confucius say, "Find worm in apple - bad. Find half a worm - worse."
Re:Lies, Damned Lies, and Statistics. by nyctopterus · 2010-03-17 22:09 · Score: 1, Insightful

But a real correlation (i.e. not a fluke) does imply causation, it just doesn't tell you where the causation coming from.
Re:Lies, Damned Lies, and Statistics. by Chrisq · 2010-03-17 22:30 · Score: 1

And I thought he was a Hindu swami
Re:Lies, Damned Lies, and Statistics. by Chrisq · 2010-03-17 22:33 · Score: 5, Funny

That since a dead clock is right twice a day, those two times cause the clock to work again?
No, the clock is right all of the time, it just shows local sidereal time and is often in the wrong place
Re:Lies, Damned Lies, and Statistics. by Chrisq · 2010-03-17 22:42 · Score: 1

As with everything, xkcd delivers.
It certainly does. It's about time someone made fun of the crowd who parrot "correlation doesn't imply causation" after finding this it in their introductory stats course. Someone should maybe remind them that causation does imply correlation.
People often get caught assuming that Correlation == Causation.
I'm not sure whether you have a super-dry sense of ironic humor, or whether you didn't get the xkcd strip.
You have to be very careful in saying that correlation even implies causation. You could be looking at two effects of an underlying cause. For example you could look at people coming out of a pub and conclude that there is a strong correlation between having a stumbling gait and slurred speech. It would be wrong to conclude that one of the observed phenomena caused the other.
Re:Lies, Damned Lies, and Statistics. by imakemusic · 2010-03-17 22:55 · Score: 4, Funny

Indeed. For example: 6 out of 7 dwarves aren't Happy.

--
Brain surgery - it's not rocket science!
Re:Lies, Damned Lies, and Statistics. by schon · 2010-03-18 01:05 · Score: 1

Don't forget that before the 20th century, most people only lived to 32!
Re:Lies, Damned Lies, and Statistics. by Mashdar · 2010-03-18 01:09 · Score: 1

You are basically saying "something which exists implies that something caused it to exist". This is not what the phrase "correlation does not imply causation" addresses at all. People assume that if fat people are more likely to have heart attacks, that their fat causes heart attacks. A correlation between A and B does not imply that A causes B or B causes A, when all confounding variables C have not been ruled out (ie diet, exercise, accounting the sum of which completely remove correlation between weight and life expectancy, despite what the gossip media might imply).
Re:Lies, Damned Lies, and Statistics. by Bakkster · 2010-03-18 01:31 · Score: 1

And it's further complicated that beyond just A->B or B->A, you have additional cases such as C->(A+B), or (A+B)->D where your study inadvertanty selects for D.

--
Write your representatives! Repeal the 2nd Law of Thermodynamics!
Re:Lies, Damned Lies, and Statistics. by jimbolauski · 2010-03-18 01:54 · Score: 1

What is the clock is just slow then it's wrong all the time

--
Knowledge = Power
P= W/t
t=Money
Money = Work/Knowledge so the less you know the more you make
Re:Lies, Damned Lies, and Statistics. by Anonymous+Cowpat · 2010-03-18 02:37 · Score: 2, Interesting

I saw a fascinating presentation by an eminent professor of physics on what Meadow did wrong. It boiled down to mis-applying bayes' theorem. Meadow had got an extremely high probability of the accused being guilty out of it, what the professor did was poit out that (a) the probability put in for chance of two babies dying couldn't be taken by simply multiplying the chance of one dying by itself as the event may not be independent (b) that would be rather moot because the accused's chance of having 2 dead babies was 1.
Putting the correct numbers in and turning the handle produces a chance of guilt of about 1%.
What was most shocking was that, given the elementary error in the application of statistics, no-one called him on it in court.

--
FGD 135
Re:Lies, Damned Lies, and Statistics. by silentcoder · 2010-03-18 03:32 · Score: 2, Informative

The actual truth, as usual, is a bit more complex than the bit we all remember and quote.
Where a correlation occurs there are four distinct possible reasons:
Let say a correlation that during the time when X is known to have increased, Y showed a corelatory increase. then
1) It is possible that this is because X caused Y - e.g. the causation that way isn't implied - yes it's a possible implication.
2) It is possible that X in fact caused Y (e.g. the causation is in fact in the opposite direction of what the quoter of the stat is trying to say).
3) It is possible that X and Y were both caused by an unknown third factor Z.
4) It is possible that X and Y were caused by completely different factors and their correlation is purely coincidental.
The mere existence of a correlation does not imply any of these four possibilities more strongly than the other - they are equally likely unless additional data is presented to corroborate one.
The example from my philosophy textbook (which I'm shamelessly citing here) was this:
Between the period 1955 to 1965 the number of schools in the US where sex-ed was given increased by 75%, during the same period the amount of teenage pregnancies increased by nearly 80% (Both compared to the decade before that). Conclusion - giving sex-ed led to more teenage pregnancies.
This citation is classic example of the correlation/causation mistake in that it assumes option 1.
In this case option 2 actually seems quite likely - if teenage pregnancies were going up, that would put pressure on schools to give sex-ed to try and reverse this trend.
But what if we consider more available data. Specifically that the pill came on the market in 1953 sparking the sexual revolution.
If we consider that the pill let to a more relaxed attitude among teenagers about sex, but that this attitude probably spread a lot faster than actual usage of the pill then it explains the increase in teen pregnancies which combined with the known presence of this attitude would put pressure on the schools to give sex ed.
So then it suggests that in fact we have options 2,3 and 4 happening in a commonly reinforcing manner. The only conclusion that isn't supported by the data at all is option one.
With each bit of additional data added, including comparison with other times where there was a sharp increase in teenage pregnancy (like the early years of the current decade under the Bush administration) we find that the likelihood of X actually causing Y in this example gets smaller and smaller and in fact becomes statistically insignificantly small.
But that doesn't mean option 1 is never the right answer. Sometimes a correlation really is due to causation. You just cannot assume it without further evidence.

--
Unicode killed the ASCII-art *
Re:Lies, Damned Lies, and Statistics. by gumbi+west · 2010-03-18 03:34 · Score: 1

Is there a textbook or journal article (subscription required is okay) that describes what you are saying? Also, what field do you work in?
Re:Lies, Damned Lies, and Statistics. by Lost+Race · 2010-03-18 03:38 · Score: 1

So "miles per hour" is meaningless when you drive for less than an hour?
Re:Lies, Damned Lies, and Statistics. by skids · 2010-03-18 04:20 · Score: 1

Way over my head, statistically speaking :-) But then, I always froze up at "why always the sqrt(x^2) weighting function?" and couldn't get much further into statistics because that never made sense to me. I guess from reading up on that issue that it's just the default weighting function and while it may be right more often than another default, people don't generally consider using a different one because it changes a huge body of established techniques if they do.
The OP might have been able to squeeze another section out by mentioning that weighting functions may be inappropriately applied to the wrong types of variables, but I think there were enough foreheads hitting keyboards by the time the end finally came. Great article, but a bit dry for some audiences.

--
Someone had to do it.
Re:Lies, Damned Lies, and Statistics. by lxs · 2010-03-18 04:39 · Score: 1

Luckily, in 1901 Nikola Tesla invented the six bit age counter and we lived happily ever after.
Re:Lies, Damned Lies, and Statistics. by pjt33 · 2010-03-18 08:18 · Score: 1

I'm sure I remember reading in New Scientist in 1998 or so that there was precedent forbidding the defence from explaining Bayes' theorem because it would confuse the jury.
Re:Lies, Damned Lies, and Statistics. by bhiestand · 2010-03-18 19:11 · Score: 1

Indeed. For example: 6 out of 7 dwarves aren't Happy.
Reminds me of the joke that 4 out of 5 people enjoy gang rape.

--
SWM seeks new sig for a brief fling
Re:Lies, Damned Lies, and Statistics. by AK+Marc · 2010-03-18 20:49 · Score: 1

That error at least something that is debunked fairly often.

Then you missed the joke. Read the mouseover and the last frame again. He changed his mind after taking a class, but doesn't know whether they are related because he learned correlation doesn't mean causation. It's obviously a case of correlation equaling causation, and the mouseover also brings home the point that it isn't a proof, but that it often is strong evidence.

Even if it isn't proof, just whining "correlation isn't causation" doesn't do anything to argue against proof. It's just a whine by someone who has no legitimate alternate hypothesis.

For whatever reason, people hear jargon like "statistically significant" and forget the truism about a stopped clock.

Worse is where they hear "statistically significant" and think that it's actually significant. Say I'm testing whether idiots cause cancer. I assert that idiots give off idiot pheromones and test whether large amount of human pheromones cause cancer in rats. The answer comes back as "yes." So I publish, asserting that a study about idiots causing cancer is statistically significant. The results may be statistically significant, but not actually significant, and the findings are essentially unrelated to what's claimed.

--
Learn to love Alaska
Re:Lies, Damned Lies, and Statistics. by crmarvin42 · 2010-03-20 01:16 · Score: 1

Any introductory statistics text should cover the issues of multiple comparisons. Basically, if you have 6 treatments, that gives you 5 degrees of freedom to perform contrasts. That 5 specific comparisons, one for each degree of freedom. However, with 6 treatments, there are 6 x 6 or 36 possible comparisons. If you want to have a 5% probability that differences are based on random chance, then you use a P-value 0.05 as your cut off for statistical significance. However, if you want to do all 36 comparisons, that is 36 * 0.05 or 180% chance that one of your comparisons is a false positive. In order to keep the overall chance of false-positives below 0.05, you use an adjusted P-value for any individual comparison (0.05/36) or 0.00139, so that the cumulative chance of a false-positive is 0.05. (there are other methods for adjusting, but they are all based on adjusting the P-value for individual comparisons to achieve the desired overall P-value.

The problem comes from the fact that none of the procedures for correcting the overall P-value include a method for accounting for contrasts (linear effect, quadratic effect, etc.) run on the same dataset, so using both ends up giving 2 P-values for comparisons of the same data without any of the necessary adjustment for the extra comparisons.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde
Re:Lies, Damned Lies, and Statistics. by Anonymous+Cowpat · 2010-03-22 21:16 · Score: 1

sounds like the lack of explanation confused the jury...

--
FGD 135

Re:Its common knowledge by snl2587 · 2010-03-17 14:39 · Score: 2, Funny

How do you figure that? My latest calculations placed it at 70% [Note: Error +/- 10%].

Summery? by sincewhen · 2010-03-17 14:40 · Score: 4, Funny

It's not just statistics that people have a problem with...

--
-- Braden's law of data: All data spends some of its lifetime in an excel spreadsheet.

Re:Summery? by scdeimos · 2010-03-17 15:05 · Score: 1

I think it's meant to give you a nice, warm feeling just like on those hot summery days.
Re:Summery? by oGMo · 2010-03-17 15:18 · Score: 3, Funny

From your sig:

-- Braden's law of data: All data spends some of it's lifetime in an excel spreadsheet.

What's that law about spelling/grammar corrections inevitably having spelling or grammar mistakes in them?

--
Don't think of it as a flame---it's more like an argument that does 3d6 fire damage
Re:Summery? by martin-boundary · 2010-03-17 15:46 · Score: 4, Funny

Godwin's.

Only if the sentence misspells Hilter.
Re:Summery? by fotoguzzi · 2010-03-17 16:03 · Score: 1

Muphry's Law

--
Their they're doing there hair.
Re:Summery? by icannotthinkofaname · 2010-03-17 16:12 · Score: 2, Informative

That would be Muphry's law.
For details on Muphry's law, click on the above hyperlink. For more fun laws, click on the below hyperlink.
More fun here.

--
Let q be a radix > 1. I am in ur base-q, killing 10 d00ds.
Re:Summery? by sincewhen · 2010-03-17 18:14 · Score: 1

I dont no about that law.
Anyway, thanks for the reminder - I drive with sigs off so I'd forgotten I had one.

--
-- Braden's law of data: All data spends some of its lifetime in an excel spreadsheet.
Re:Summery? by Saroful · 2010-03-17 19:37 · Score: 5, Informative

And what's the law about spelling/grammar corrections that incorrectly correct the supposed spelling error? (Redundancy is purposefully deliberate.) "Its" is possessive. "It's" is a contraction of "it" and "is". -- This has been a message from your friendly neighborhood Spelling Nazi.
Re:Summery? by dkleinsc · 2010-03-17 23:09 · Score: 1

What do you mean, mein dickie old chum?

--
I am officially gone from /. Long live http://www.soylentnews.com/
Re:Summery? by bingoUV · 2010-03-18 00:36 · Score: 2, Insightful

No idea, but there must be a law about people assuming that editing Slashdot signature doesn't affect posts made previous to the edit.

--
Bingo Dictionary - Pragmatist, n. A myopic idealist.
Re:Summery? by ColdWetDog · 2010-03-18 03:38 · Score: 1

To you AND the mods:

One Slashdot WHOOSH! point.

--
Faster! Faster! Faster would be better!

I dated a short, summery girl once by Chess+Piece+Face · 2010-03-17 14:40 · Score: 1

She was like a little ball of sunshine.

As for statistics, does this really surprise anyone in a time when net polls are being reported as hard news?

Example: Standard Deviation by cytoman · 2010-03-17 14:42 · Score: 4, Interesting

My doctor was explaining to me that my blood sugar readings should not have a standard deviation of more than 1/3rd of the average blood sugar reading. Just to test if he knew what it meant, I asked him what a standard deviation was. Oh the fun when he tried to bullshit his way out of that one! He eventually told me that when I plot my data in Excel I can ask it to give me statistics on the column and it would mention what the standard deviation value was. But when I pressed on and asked him what a standard deviation is, he shooed me off and told me to go look it up. Never did he confess that he had no clue.

Re:Example: Standard Deviation by LingNoi · 2010-03-17 14:54 · Score: 1

I don't see this being a problem. It's not his job to know, just like it is not his job to know how to write the excel spreadsheet to come out with the values he uses to help you.
That being said it wouldn't hurt if everyone had a better understanding of statistics.
Re:Example: Standard Deviation by cytoman · 2010-03-17 14:59 · Score: 4, Informative

Standard deviation is what you learn very early in school. And this was a endocrinologist - a specialist who no doubt took a lot of Biostatistics courses and such, and used a lot of statistics all through his education. And you are telling me that it's not his "job" to know? Wow! We are talking the most basic stuff that anyone with a degree in the sciences should know. It's almost like saying that an English major can be excused if he doesn't know that 2+2=4 because "it's not his job to know".
Re:Example: Standard Deviation by ottothecow · 2010-03-17 15:14 · Score: 1, Interesting

Nah, it is his job to know how to use the calculation.
I certainly don't remember how to do all those statistics calculations by hand but I use SAS and excel almost every day and they don't seem to have forgotten...give me a few more years and I might be at the point where I wouldn't be confident trying to explain what a standard deviation actually "is"

--
Bottles.
Re:Example: Standard Deviation by cytoman · 2010-03-17 15:19 · Score: 4, Insightful

You are missing the point - he did not know what a standard deviation means! That is unforgivable for anyone with a medical degree...hell, it's unforgivable for anyone who has passed a course in statistics in school.
Re:Example: Standard Deviation by PSUspud · 2010-03-17 15:32 · Score: 2, Insightful

As a statistics teacher (HS / Tech school level), this doesn't surprise me in the least. Statistics and statistics education has become a giant game of "plug the numbers in and damn the understanding". When a student has never calculated a standard deviation by hand, how can they be expected to know what the heck a root mean square deviation from the sample mean really is?
Going further, I would say that statistics is a tool for answering questions. Like any other tool, it works well for some jobs and not for others. So far, no problem. But the problem comes from students that are just not willing to understand the questions that statistics can answer. Case in point -- a p value of 0.05 does _not_ mean that the null hypothesis has a 95% chance of being wrong. That's what stats students want it to mean, because they are not willing to ask the questions that stats can answer.
Until students are willing to actually do the work, for the sake of actually learning, I don't see any hope.

--
----- Why sig when you can sign? PGP key id 7675D05E
Re:Example: Standard Deviation by Ethanol-fueled · 2010-03-17 15:33 · Score: 2, Informative

s = sample standard deviation = sqrt((sum(x-xbar)^2)/(n-1)), where xbar is the mean
sigma = population standard deviation = sqrt((sum(x-mu)^2)/N), where mu is the mean
s is approximately equal to (highestValue-lowestValue)/4, range rule of thumb
Unusual values are outside +/- 2 standard deviations
Z = ((x-mu)/sigma) where Z is in terms of standard deviations.
Re:Example: Standard Deviation by Colonel+Korn · 2010-03-17 15:40 · Score: 1

Yeah because you never forget anything, right?
Well he should be able to at least say that it represents variability in a repeated measurement. He should at least know that.
Maybe he did know that but was stumbling because he was embarrassed for not knowing the formula.

--
"I zero-index my hamsters" - Willtor (147206)
Re:Example: Standard Deviation by Opportunist · 2010-03-17 15:43 · Score: 4, Interesting

Doctors are notoriously bad with statistics. But the real kings of bad statistics are psychiatrists.
Notice how a LOT of studies in psychiatry are essentially statistics, statistics and a bit of statistics? It might be the reason why a lot of the courses you have to pass to become a shrink also consist of a lot of statistics, statistics... you get the idea.
NOBODY who decides that his course of studies would be psychiatry decided for that because he enjoy statistics that much, though. Actually, most psych students struggle badly with statistics. Psychiatry is one of the fields where the label doesn't match the contents. It looks like you're going to do a lot of messing with people's minds (aka "solving their psychology problems") but actually, judging from the courses, you become a refined statistician who had a bit of a counceling tutoring on the side.
That's not what people become shrinks for, though. They want to sit in their office, put people on their couch (or, more modern, in a comfy chair) and get 100 bucks an hour for listening to some idiot whine. And most do just that and will do fine.
It gets bizarre when they somehow end up in a spot where they have to rely on their statistics. Hey, you got a masters in that, and that entails a buttload of statistics, so you can do it... Nobody really cares that 9 out of 10 that somehow managed to get their diploma by either learning what they absolutely needed (and forgot it right after the test, certain that they'd never need it again, because ... ya know, listening to idiots and stuff, not sitting there plotting standard deviations...) or by cribbing altogether.
And then you get studies of the usefulness of psychotropic drugs and wonder whose black hole they pulled that out of...

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Re:Example: Standard Deviation by JumpDrive · 2010-03-17 15:44 · Score: 3, Interesting

I agree with your concerns. Being a chemical engineer and a physical scientist, I have often found medical doctors understanding of chemistry and other sciences lacking. I once had an argument about chemical kinetics involved in a prescription drug I was taking, he basically told me I didn't know what I was talking about and blew me off. After another run in with him over another issue I fired him. But that's just one of my personal issues with a doctor.

Back when I was in graduate school me and my colleagues in graduate science taught pre-med chemistry and physics, which was a really watered down version of chemistry and physics which were taught to engineers and science majors. To be honest I thought it was kind of scary. All these years I was taught that medical student were supposed to be the best and the brightest, but we spoon fed them "baby chemistry" and "baby physics".

Since that time I have had many discussions with professors about this and they and I have come to the same conclusion, "the best and the brightest do not go into medical school". Thirty or forty years ago this may have been true, but economics has taken a turn and it just isn't the case anymore.

And why would they? They can make more money on Wall Street, they don't have to hassle with bureaucracy of health insurance, they don't have to hassle with lawyers, so why would the best and brightest go into medicine.

And you want to know what kind of income a hot little girl with a business degree can get. Pharmaceutical sales can pay 6 figures for one good figure. So the next time you see that good looking girl pulling that bag through your doctors office realize she is probably making a lot of money. More money than the average general practitioner .
Re:Example: Standard Deviation by cytoman · 2010-03-17 15:47 · Score: 4, Insightful

There are some things you should never be able to forget - the definitions and meanings of probability, mean, median, standard deviation and variance come to mind. You find yourself in situations everyday where you need to apply some of these things. Am I wrong about this? Do people forget basic definitions so easily?
Re:Example: Standard Deviation by Entrope · 2010-03-17 15:50 · Score: 1

Standard deviation and variance both characterize how widely distributed a sample is. When put on the spot (i.e. no reference materials allowed), could you explain the difference between the two without relying on the fact that one is the square root of the other? If not, is it really so unreasonable that a busy physician not want to recall and explain "standard deviation" to someone who may not have a good grip of the implicit context?
Re:Example: Standard Deviation by Capsaicin · 2010-03-17 15:54 · Score: 3, Insightful

Standard deviation is what you learn very early in school.
So early in fact that by you forget the details by the time you have had some serious study under your belt. Do you have any idea of the stuff you have to keep in your head to be an endocrinologist? So long as he remembers that it's a measure of variance (which he obviously does), it hardly matters whether he can explain to a mathematician how to derive it? And if OP gets off tripping up specialists with such minutae it ain't the specialist who has issues.
And you are telling me that it's not his "job" to know?
YMMV, but I would prefer to visit an endocrinologist who was an expert on the subject of hormones etc rather than stats.

--
Better to be despised for too anxious apprehensions, than ruined by too confident a security. --Edmund Burke
Re:Example: Standard Deviation by Ethanol-fueled · 2010-03-17 15:59 · Score: 1

When a student has never calculated a standard deviation by hand, how can they be expected to know what the heck a root mean square deviation from the sample mean really is?
It works both ways. If you're making them do calculations by hand when they get to linear regression[1] and Spearman's rank correlation coefficient(w/ties)[2], then you're a sadist and you'll turn a lot of young minds off to statistics.

[1] (n*Sigma*X*Y - (SigmaX)(SigmaY)) / (nSigmaX^2 - (SigmaX)^2)
[2] ((n*Sigma*x*y)-(SigmaX)(SigmaY))/(Sqrt(n(SigmaX^2)-(SigmaX)^2))*((Sqrt(n(SigmaY^2)-(SigmaY)^2))
Re:Example: Standard Deviation by Ethanol-fueled · 2010-03-17 16:05 · Score: 1

That's not what people become shrinks for, though. They want to sit in their office, put people on their couch (or, more modern, in a comfy chair) and get 100 bucks an hour for listening to some idiot whine.
No, shrinks don't even listen to whining, they just roll the pills. Listening to whining is what counselors are for, and they may cost extra depending on your insurance.
Re:Example: Standard Deviation by LingNoi · 2010-03-17 16:07 · Score: 1

And this was a endocrinologist - a specialist who no doubt took a lot of Biostatistics courses and such, and used a lot of statistics all through his education. And you are telling me that it's not his "job" to know? Wow!

All you said in your original post was "doctor". Now i'm starting to symphise with this doctor. You were probably being as non-specific as you're being here and now spinning this nonsense to make him look bad.
I have no idea what Endocrinology is so I looked it up. Seems to have a lot to do with hormones and how they effect the body. I fail to see how your criticism of his knowledge in statistics has anything to do with this doctors primary job which understanding how hormones effect the body.
Your attitude is a common problem I have found with slashdot users or programmers generally. If you don't understand every single detail of your job even though it has no impact on what you do then somehow you're a complete idiot that needs firing.
Re:Example: Standard Deviation by kevinadi · 2010-03-17 16:12 · Score: 1

Most people I know learn stat by using calculators and computers. Add that to the problem of people's fear toward mathematical formula with big sigma signs and big root squares, and you can be assured that no one would learn stat properly. I seem to get the impression (at least from people I know) that formulas with sigmas are "complex" and they just skip over it. This is a fundamental problem, and I had to explain to some very well educated people to see sigma signs as a "for" loop in computer programs. Then they think it's not so bad after all.
Stat depends on assumptions, and the assumptions must be stated prior to doing anything, otherwise the analysis in itself is useless. Most stat classes that I took used normal distribution as an assumption, and in many cases it just doesn't apply. The worst that I've seen is someone trying to use the 95% confidence interval (which is based on normal distribution assumption) on something that I know for certain is Laplacian distributed.
I think calculators and computers are the worst thing that can happen to statistics learning. It should not be used to learn stat, ever. They encourage people to be hasty, careless, and have the impression that stat is just a collection of magic formulas that gives whatever you want.
Re:Example: Standard Deviation by Jah-Wren+Ryel · 2010-03-17 16:13 · Score: 2, Interesting

And then you get studies of the usefulness of psychotropic drugs and wonder whose black hole they pulled that out of...
Indeed. Normally I would never cite an article in a McNews magazine like Time or Newsweek, but I found this explanation of the state of antidepressant drug efficacy to be one of the best I've run across so far - hundreds of billions of dollars all depending on some really, really bad math. Its like the collateralized debt securities of the drug & psychiatric industries:
http://www.newsweek.com/id/232781

--
When information is power, privacy is freedom.
Re:Example: Standard Deviation by crmarvin42 · 2010-03-17 16:44 · Score: 1

We are talking the most basic stuff that anyone with a degree in the sciences should know.
I'm sure this is going to come across as snarky, or as an attempt at being Jaded and thus cool, but I'm of the opinion that MD's and DVM's are not scientists as a general rule. I view them as biological mechanics. Their education in the nuts and bolts of the scientific method is practically non-existant in my experience as a Ph.D. student at a univeristy with a vet school, where I took more than a couple of classes. If the MD or DVM also has a Ph.D. or even an MS, then the story is usually different.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde
Re:Example: Standard Deviation by crmarvin42 · 2010-03-17 16:58 · Score: 3, Insightful

I am in the life sciences (ie not a computer programmer).

If MD's are reading medical journals and interpreting their results, which they all are expected to do (especially those with a Board Certified Specialty like Endocrinology) then there is no excuse for them to have forgotten what what the standard deviation is a measure of. They should be using the variance estimates provided in a data table to interpret the results it contains every time they read an article. If not, then they aren't worth the exorbinant fee's they are charging, because critical thinking is part of a physicians job description, and accepting whatever gets publish in the New England Journal of Medicine at face value is not.

I can accept forgetting the equation, but there is NO EXCUSE for forgetting that SD is a measure of varition (along with SEM, SED, and CV) as opposed to a measure of central tendancy (mean, mode, median). That is something they teach you in the first week of a statistics course, and is used every subsequent class because it is so fundamental to the interpretation of statistics. If I were cytoman, I'd be looking for a new Endocrinologist.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde
Re:Example: Standard Deviation by DarkSarin · 2010-03-17 16:58 · Score: 1

I'm gonna go out on a limb and say this:
if the doctor isn't an expert in statistics, how can he properly evaluate the effectiveness of the medicine he's handing you?
Can he say (with any really justified confidence) that it will actually help? Can he claim to know how the true probability that you have disease X when manifesting symptoms y & z? Does he understand the effect of false positives?
Without a solid grounding in stats, the answers are sadly, no he cannot make justified statements about the effectiveness of the medicine (sorry, relying on the pamphlets handed out by big pharma to tell him means taht i want to go elsewhere for my medical treatment).
My signature has NEVER been more appropriate.
MANY people using stats have NO CLUE what they are doing, but they do it VERY carefully.
I hate it when I see people double-dip on data sets....
For my thesis, approximately 5% of my correlations were statistically significant, and even THOSE were WEAK. I told my committee that I wasn't confident of any of the relationships.
Yes, I expect my doc to know stats. The THEORY behind them is just as important as the math.

--
"We don't know what we are doing, but we are doing it very carefully,..." Wherry, R.J. Personnel Psychology (1995)
Re:Example: Standard Deviation by rve · 2010-03-17 17:05 · Score: 3, Informative

You're mixing up psychiatrists, psychologists and psychotherapists.
A psychiatrist went to med school, got a doctors degree and specialized in problems with the brain. A psychologist went to university to learn the study of behavior of people. This involves a lot of statistics and many of them probably do consider it something they didn't go to college for, but it's a study that is supposed to follow the scientific method and prepare students for doing research, not therapy.
A psychotherapist is anyone who feels like calling themselves that. As a preparation they may have studied psychology at university, or they may have spent 20 years meditating in the Himalayas, or followed a short course at a religious group such as an institute of multiple personality disorder therapists or scientology.
Re:Example: Standard Deviation by Ethanol-fueled · 2010-03-17 17:07 · Score: 1

should not have a standard deviation of more than 1/3rd of the average blood sugar reading.
Additionally, it's unclear what was meant. Did he mean that the blood-sugar reading should not be outside +/- .33% of the population mean? Did he mean that his score should not be outside +/- .33% of the population standard deviation? Either the poster or his doctor has been talking out his ass.
Re:Example: Standard Deviation by Tromad · 2010-03-17 17:07 · Score: 1

I was in school to become a psychiatrist when I noticed the same thing and dodged a bullet. And it wasn't even the fact that everything was stats, it was also that all the stats were terrible. There is also a huge discrepancy between statistical significance and effectiveness, which the entire industry seems to not understand.
Re:Example: Standard Deviation by ShakaUVM · 2010-03-17 17:48 · Score: 1

>>There is also a huge discrepancy between statistical significance and effectiveness, which the entire industry seems to not understand.
Or that the severity of atypicals is often worse than the problems it solves.
Not always, but often enough that people will refuse to take their anti-psych meds.
Re:Example: Standard Deviation by Opportunist · 2010-03-17 18:09 · Score: 1

Well, you get what you pay for. I paid for mine, and it took him roughly 20 sessions until he said something along the lines of "Uhhuh... I think we are onto something here, these might help, if you want to give 'em a try..."
Well, they didn't help. But he certainly did.

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Re:Example: Standard Deviation by Opportunist · 2010-03-17 18:13 · Score: 1

Oh, the industry understands it just fine. They also understand that claiming something is "effective in treating X" means that they can make a fortune with that something.
That is, if the distance between cost to produce and price is big enough. If it isn't, it will certainly be not effective at treating anything.

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.
Re:Example: Standard Deviation by rve · 2010-03-17 18:21 · Score: 1

And then you get studies of the usefulness of psychotropic drugs and wonder whose black hole they pulled that out of...
Indeed. Normally I would never cite an article in a McNews magazine like Time or Newsweek, but I found this explanation of the state of antidepressant drug efficacy to be one of the best I've run across so far - hundreds of billions of dollars all depending on some really, really bad math. Its like the collateralized debt securities of the drug & psychiatric industries:
http://www.newsweek.com/id/232781
Has there been a similar study comparing various kinds of psychotherapy to a placebo? For example comparing the effects of a priest, a witch doctor and a psychotherapist?
Re:Example: Standard Deviation by hazem · 2010-03-17 18:44 · Score: 4, Interesting

There are some things you should never be able to forge.... Do people forget basic definitions so easily?
Given a couple years with little contact with people who speak your native language, you'll actually begin to forget that very language you have lived speaking all your life. So it doesn't surprise me at all that people would forget basic definitions if they don't actually think about those definitions very often.
I figure if you can forget your native language then pretty much all bets are off for the stuff you've known for a lot less time and used a much smaller percentage of your thinking life.
Re:Example: Standard Deviation by OrangeCatholic · 2010-03-17 19:22 · Score: 1

Stats is like engineering, in that it relies upon discerning reality from fantasy. The math helps, but it's a lot of hard work to apply it, probably more work than most people can bother with.
I took a stats class in college. I stayed about 3 days past the end of the semester to finish up my stats paper, and I was flabbergasted at the level of precision that could be obtained by looking at the data and seeing what it's telling you, rather than what you want to see.
In the end, writing the paper was intellectually easy because it wrote itself. But there was a tremendous amount of raw typing (i.e. analysis) required to get to the point where it was free from bias. The (relatively basic) statistical methods we used were tremendously powerful, but God knows my teammates couldn't do it, and they were B.S. candidates. The paper I turned in was about 15 pages longer than what we started with.
Re:Example: Standard Deviation by goose-incarnated · 2010-03-17 19:37 · Score: 1

You are missing the point - he did not know what a standard deviation means! That is unforgivable for anyone with a medical degree...hell, it's unforgivable for anyone who has passed a course in statistics in school.
Why? My g/friend's a doctor (surgeon) and neither she nor all her friends know what a standard deviation is. When I first realised this, I asked her about the curriculum at med-school, and yup.... it does not include stats :-/ which makes all those "studies" published in medical journals a little suspect.

--
I'm a minority race. Save your vitriol for white people.
Re:Example: Standard Deviation by OrangeCatholic · 2010-03-17 19:41 · Score: 1

It's hard to tell, since he said the "blood sugar results should be 1/3 of a standard deviation." Does that mean one result, vs the overall population? In that case, 1/3 SD would seem to mean unusually average. Or are we talking about multiple results over time? In that case, 1/3 SD makes more sense, since it implies stable blood sugar.
Re:Example: Standard Deviation by fsterman · 2010-03-17 19:42 · Score: 2, Insightful

Except, if you had read this story, you would have found that the antidepressant = placebo story to be incorrect due to poor statistical reasoning:
"Another concern is the common strategy of combining results from many trials into a single “meta-analysis,” a study of studies. In a single trial with relatively few participants, statistical tests may not detect small but real and possibly important effects. In principle, combining smaller studies to create a larger sample would allow the tests to detect such small effects. But statistical techniques for doing so are valid only if certain criteria are met. For one thing, all the studies conducted on the drug must be included — published and unpublished. And all the studies should have been performed in a similar way, using the same protocols, definitions, types of patients and doses. When combining studies with differences, it is necessary first to show that those differences would not affect the analysis, Goodman notes, but that seldom happens. “That’s not a formal part of most meta-analyses,” he says.
Meta-analyses have produced many controversial conclusions. Common claims that antidepressants work no better than placebos, for example, are based on meta-analyses that do not conform to the criteria that would confer validity. "

--
Is there anything better than clicking through Microsoft ads on Slashdot?
Re:Example: Standard Deviation by cycoj · 2010-03-17 19:50 · Score: 2

I think you don't have a clue how doctors work. Do you really think doctors evaluate the effectiveness of a medicine by reading through scientific articles and possible even recalculating the results? No, they follow guidelines which are written by other medical people who's job is evaluating this sort of stuff. That is a good thing, because the people who make the guidelines actually do know their stuff and I trust them do to the statistics more than some doctor. The doctor's job is to know what tests to run, what symptoms to look for etc. Not to do a statistical analysis about the likelihood that you have disease X or Y.
Re:Example: Standard Deviation by fsterman · 2010-03-17 19:52 · Score: 1

Odd how we have used double blind studies and statistics to tell us which treatments work the best, which is how we pulled ourselves out of the Freudian "They want to sit in their office, put people on their couch (or, more modern, in a comfy chair) and get 100 bucks an hour for listening to some idiot whine."
Oh, and most of us have to get a masters or PhD, so shove your "struggle with statistics" right where you got the rest of your information.

--
Is there anything better than clicking through Microsoft ads on Slashdot?
Re:Example: Standard Deviation by cycoj · 2010-03-17 20:07 · Score: 2, Insightful

Have you ever looked at the sheer amount of knowledge that doctors have to know (and actually do know)? Yes they are learning baby physics and baby chemistry. We have physicists and chemists to do the non-baby physics and chemistry. You could also say the same thing for other sciences. I know friends who've taught physics to chemistry students and that was baby-physics which the chemists struggled to understand. Similarly I've had to learn chemistry for my physics degree and have pretty much forgotten almost anything about it, that didn't prevent me from getting a PhD in physics.
I wonder how much of the physics or chemistry out of your field of expertise you still remember.
Back to the topic of doctors, a lot of the stuff that doctors do is purely knowing things, but they need to do a lot of it. They don't necessarily know exactly how a drug works, they just know when to give that drug. So a lot of their work could be done with a very big flowchart, except for the fact that quite a lot is actually observation not just what you tell them.
Re:Example: Standard Deviation by treeves · 2010-03-17 20:29 · Score: 1

It was pretty clear to me that the doctor said that the SD should be no more than 1/3 of the mean, e.g. if the mean was 100, the SD should be /= 33. Where such a rule came from I don't know, but the idea is that you don't want your blood sugar to fluctuate a lot. To me, a std deviation of 1/3 of the mean seems like a lot of variation since the readings can easily go up or down 2 std deviations (~95% of the readings will fall between +/- 2SD for normally distributed data) so could range from 33 to 167, a factor of about five, which seems a lot to me.
That said, the doctor should have been able to say its a measure of the spread or variability of the data.
A doctor should also be able to explain (at least in a basic way, no pun intended) what pH is.
And what a partial pressure is.
And a lot of other things not specific to his precise field of medicine.
That's why medical school is so demanding, and one reason why they "get paid the big bucks."
I was probably expecting too much when I commented on another /. story on the same topic a while back that doctors should understand Bayesian statistics, so as to be able to understand the expected likelihood that I have X, given a particular test result, the frequency of X in the population, and the rate of detection and false positives by the test. Most doctors can't estimate that very well either.

--
...the future crusty old bastards are already drinking the Kool-Aid.
Re:Example: Standard Deviation by geoffhall · 2010-03-17 20:41 · Score: 1

My doctor was explaining to me that my blood sugar readings should not have a standard deviation of more than 1/3rd of the average blood sugar reading. Just to test if he knew what it meant, I asked him what a standard deviation was. Oh the fun when he tried to bullshit his way out of that one! He eventually told me that when I plot my data in Excel I can ask it to give me statistics on the column and it would mention what the standard deviation value was. But when I pressed on and asked him what a standard deviation is, he shooed me off and told me to go look it up. Never did he confess that he had no clue.
I think you a being silly. The core of your doctor-patient relationship should be you trust he has the knowledge and judgement to treat your condition in a professional and considered manner. If he does not understand the statistics, it should not be a concern as many primary practitioners are not specialists, especially arcane things like deriving standard deviation. Doctors are typically foremost clinicians, not chemists, mathematicians, grammar freaks, computer geeks etc... If you are not comfortable with your doctor, find another. Anyway, doctors should follow sound scientific principals in their treatment but should NOT treat on the basis of numbers alone. It is enough other trusted and respected specialists distil the relevant information and say "this is best practice in this situation". This is usually called CME (continuing medical education). Access to literature on the internet should allow you read up on whether the regimen you are on is reasonable or not. Be aware, you WILL find published articles FOR and AGAINST any kind of treatment in medical journals and the REAL skill is knowing the context and the clinical biases that can occur and whether a particular therapy is appropriate. Diseases or conditions rarely exist in total isolation or as a single entity. Doctors have enough on their daily plate without other concerns in other areas. We TRUST nurses will take accurate readings of temp, BP etc, we trust pharmacists will dispense correct dosage, we trust OT to sterilise equipment adequately and we trust EXCEL to do SD. So???
Re:Example: Standard Deviation by jpmorgan · 2010-03-17 20:50 · Score: 3, Insightful

There's a reason why you keep getting modded up and those disagreeing with you keep getting modded down.
You're exactly right. Modern diagnostic medicine is predicated on interpreting statistical studies to make diagnoses. It is practically incompetence for a practicing medical doctor to not know what standard deviation means.
Re:Example: Standard Deviation by ShakaUVM · 2010-03-17 21:16 · Score: 1

I agree with your concerns. Being a chemical engineer and a physical scientist, I have often found medical doctors understanding of chemistry and other sciences lacking. I once had an argument about chemical kinetics involved in a prescription drug I was taking, he basically told me I didn't know what I was talking about and blew me off. After another run in with him over another issue I fired him. But that's just one of my personal issues with a doctor.
That's because you're talking to a doctor when you should be talking to a pharmacist. Preferably a clinical pharmacist, if it's important. The reason hospitals hire clinical pharmacists is because doctors know just enough about drugs to be able to prescribe them correctly, but when they get into trouble (try dosing a renally insufficient diabetic kid some time) they call in the big guns for advice.
Clinical pharmacists are more common on the west coast than the east coast (having originated in large part at UC San Francisco), but most major hospitals should have them if you have a question.
You could always pose a question to your community pharmacist or the person shoving drugs at you through the window at the hospital, but depending on how long they've been out of college, they might have forgotten their PK and physical chemistry and such.
Re:Example: Standard Deviation by Ihlosi · 2010-03-17 21:35 · Score: 1

That's not what people become shrinks for, though. They want to sit in their office, put people on their couch (or, more modern, in a comfy chair) and get 100 bucks an hour for listening to some idiot whine. And most do just that and will do fine.
You're talking about psychologists, not psychiatrists. Psychologists are the ones who are highly paid for listening to you, psychiatrists will fill you up with risperdal, prozac, ritalin or whatever.
Re:Example: Standard Deviation by Anonymous Coward · 2010-03-17 21:37 · Score: 1, Informative

Are you referring to PSYCHOLOGY or PSYCHIATRY (or both?).
The former do a lot more stats than the latter during their training. Psychology students in Australia (me) do at least 4 years of statistics (and often 6) before working in the field. Of course we all have a love-hate relationship with stats, but there is no way to get through the course(s) without working hard and actually learning the material.
Psychiatry students essentially study medicine (and do intermediate stats), and then specialise afterward. They require a completely different skill-set to Psychologists, and have different training.
Re:Example: Standard Deviation by u38cg · 2010-03-17 22:08 · Score: 1

Indeed. A particular irk of mine is that people often quote a standard deviation when talking about some variable that is not normally distributed, and then make probabilistic claims based on the cumulative distribution of the normal!

--
[FUCK BETA]
Re:Example: Standard Deviation by Trax · 2010-03-17 22:31 · Score: 1

As a medical student, we are taught chemistry and biochemistry (not so much physics) including chemical kinetics in both our undergraduate training and in medical school. Your doctor either forgot the basics since he hasn't put them to use or is a dolt.
As one of the other posters mentioned, we get sporadic lectures on statistics and often it is in 60 minutes or less. Unfortunately, there are more important things that we need to concentrate on and stats just falls on the backburner of "things to do".
Nonetheless, statistics is useful to glean important information from medical journal articles and justifying whether a particular study is correct or incorrect in its assumptions and conclusions and by association how you can better treat and manage patients.
Re:Example: Standard Deviation by TheTurtlesMoves · 2010-03-17 22:53 · Score: 1

It's not his job to know..
Yes it is. Especially if he/she make some medical recommendation based on it.

--
The Grey Goo disaster happened 3 billion years ago. This rock is covered in self replicating machines!
Re:Example: Standard Deviation by Chrisq · 2010-03-17 23:01 · Score: 1

Do you really want "there's a 80% chance that you have a cold, but a 7% chance this is flue, a 0.1% chance that it is early stage HIV infection, and a 0.01% chance it is lung cancer". This is based on the assumption that your chances of having these diseases is in line with the general population.
Re:Example: Standard Deviation by demonlapin · 2010-03-17 23:06 · Score: 1

Not being able to define it with the same precision as someone who works daily in the field is not the same as having no clue what it is, but enjoy your word games, and next time hire a statistician to analyze your blood sugar. He'll be able to tell you all about your statistics, but unless he's diabetic himself, probably won't have a clue what to do about it.

I've read your downthread comments, and FWIW biostats in medical school is usually a course involving 36 hours, tops, of instruction. Most physicians do not need to understand statistics in order to do their jobs, any more than they need to understand how lab equipment generates its results. I get statistics better than 90% of physicians, and I'd wilt like a flower in front of a first-year PhD student of the subject. Why? Because I don't do it all day. You can, and do, forget over time.

The ignorance of physicians on statistics is mostly rational ignorance - they've realized that most studies are crap, so they ignore anything that doesn't have lots of patients. It's only indefensible when they go around trying to spout off like they know what they're talking about - and that is, mercifully, rare.
Re:Example: Standard Deviation by demonlapin · 2010-03-17 23:08 · Score: 2, Informative

Med schools hire statisticians for this. Read the thank-yous. Even small studies will thank the biostatistician. The biostatistician will be an author on a major study.
Re:Example: Standard Deviation by demonlapin · 2010-03-17 23:12 · Score: 1

Can't speak for DVMs, but MDs mostly do not consider themselves scientists and will freely admit it. (Those that do, generally are - they have a PhD, or a research-heavy background.) It's biological engineering.
Re:Example: Standard Deviation by Tim+C · 2010-03-17 23:20 · Score: 1

Forgetting exactly how to calculate a standard deviation is one thing, but forgetting what it actually represents is entirely another, and would certainly shake my confidence in the person's ability to interpret the results they're getting out of their tools.

--
It's official. Most of you are morons.
Re:Example: Standard Deviation by colonelquesadilla · 2010-03-18 00:14 · Score: 1

You are way less cynical than me, I had the impression they prescribed the newest most expensive drug the hot pharm-rep chick came by to plug. That being said efficacy studies are regulated by the FDA, and I do feel relatively confident that most of the drugs on the market actually work. Just sometimes I wish I wouldn't have to argue to get the older, just as effective, drug that has a generic for 5 dollars.

--
It's either false dichotomies, or the terrorists win, you decide.
Re:Example: Standard Deviation by colonelquesadilla · 2010-03-18 00:15 · Score: 1

I sincerely doubt it is a flue.

--
It's either false dichotomies, or the terrorists win, you decide.
Re:Example: Standard Deviation by necro81 · 2010-03-18 01:00 · Score: 1

It's not his job to know
I disagree. If your doctor is telling you to make sure the stddev of blood sugar readings should be within a certain range, the doctor should know what the meaning of standard deviation is. If the doctor doesn't know, then how can that doctor make any sensible statement, diagnosis, or recommendation when the patient comes back with blood sugar levels that have a large standard deviation. How would that doctor know if it's a problem? How much of a problem is it? Are there physiological reasons to expect a standard deviation that large? Are there reasons, other than patient compliance, why it can't be smaller? If a patient has one particularly high reading, how might that effect the standard deviation and, more importantly, is that one reading clinically significant?

It may not be the doctor's job to know how to write a program to calculate the standard deviation of a dataset - that would just be reinventing the wheel - or even to express it in a compact mathematical form. But you'd better believe that the doctor should know what it is, where it comes from, and what it means.

To use an analogy, when Han Solo says that Millenium Falcon made the Kessel run in less than twelve parsecs, the only way you'd know if he's genuinely boasting or just bullshitting you is if he can explain why measuring the Kessel run in a unit of distance is better than a unit of time.

Another analogy: overclockers and computing enthusiasts put much stock in benchmark tests. But if you don't know what the test is actually testing, and what factors influence it, then all you are doing in the end is quoting some number from the magic box. You may not be expected to code your own benchmark test, but if you don't know its background, how can you expected to know how to improve the numbers? Do you need more RAM, a faster CPU, or a different motherboard?
Re:Example: Standard Deviation by tibit · 2010-03-18 01:43 · Score: 3, Insightful

I believe that forgetting something usually means you never really understood it. I don't think that if you really understand something, you'll ever forget it. There are things I do rather rarely yet I don't forget them because I understand them and could re-derive them from first principles.
Usually if I forget something, it means I never quite understood it in the first place.
I think that real understanding implies almost indefinite retention, and lack of retention can be usually be explained by lack of understanding.
It's very easy to forget something if all you know about it is a memorized definition and an equation and two.
If you use statistical terms and concepts daily, you should be able to explain them after being woken up in the middle of your sleep, after several hours of partying with lots of booze. Anything less probably means you're acting things out rather than understanding them.
Feynman often talked about the issue of real understanding. One could summarize his view thusly: if you cannot explain it to a non-specialist of decent intelligence, you probably don't understand it.

--
A successful API design takes a mixture of software design and pedagogy.
Re:Example: Standard Deviation by Entrope · 2010-03-18 01:52 · Score: 1

You've got that backwards. Inability to "detect small but real and possibly important effects" is a problem inherent to trials with few data points, not of meta-analysis.
Problems with meta-analysis include studies that don't use the same definitions (labels) for the characteristics being studied; studies without enough overlap in the subgroups they examine; determining the acceptance criteria for hypotheses (especially if you are testing more than one hypothesis in the meta-analysis); confirmation bias (studies with negative results often don't get published); and probably some others that I, as a humble software engineer, don't know.
If you want an example of how haphazard grouping of results can hide effects that are pronounced when you look at the right subgroups, read up on Simpson's paradox.
Re:Example: Standard Deviation by Gorath99 · 2010-03-18 02:30 · Score: 1

There are some things you should never be able to forget - the definitions and meanings of probability, mean, median, standard deviation and variance come to mind. You find yourself in situations everyday where you need to apply some of these things. Am I wrong about this? Do people forget basic definitions so easily?
Yes and yes. I can't read music either, even though that's another thing I learned in high school. Turns out that if I don't use it, I can forget pretty much any skill. Particularly if that skill was learned over a couple of weeks, and I never had to revisit it.
Do you really regularly use standard deviation outside of a professional context? What for, if I may ask? The fact that most people have never even heard of the concept suggests that it's really not all that critical.
(That's not to say that it isn't ridiculous that a doctor who bases his diagnoses on the concept doesn't have a solid grasp on it.)
Re:Example: Standard Deviation by ColdWetDog · 2010-03-18 04:09 · Score: 1

Then be afraid, be very afraid. Having attempted to teach residents the rudiments of basic statistics for a number of years, I would hazard an educated guess that the number of practicing physicians that could extemporaneously come up with a reasonable definition of 'standard deviation' is rather low. In fact, I just might test this at our next medical staff meeting. Pop Quiz!

--
Faster! Faster! Faster would be better!
Re:Example: Standard Deviation by ColdWetDog · 2010-03-18 04:16 · Score: 1

YMMV, but I would prefer to visit an endocrinologist who was an expert on the subject of hormones etc rather than stats.
A problem with this (common) scenario is that said statistics naive endocrinologist is at a great disadvantage when he / she goes home and reads the latest literature. Unfortunately, the TFA has a good point that much of the current medical literature gets statistics wrong - especially the new fad of meta analysis whereby groups of different studies are ground together in an attempt to troll for answers.

Unfortunately, the level of statistics needed to really understand these sorts of reviews are quite beyond the vast majority of physicians and so one is relegated to believing the interpretation of the authors (who, in their defense, do typically have statisticians on the paper). The fact that they still get it wrong is perplexing and very uncomfortable.

--
Faster! Faster! Faster would be better!
Re:Example: Standard Deviation by tignet · 2010-03-18 06:01 · Score: 1

You're mixing up psychiatrists, psychologists and psychotherapists. A psychiatrist went to med school, got a doctors degree and specialized in problems with the brain...
You're mixing up psychiatrists, psychologists and psychotherapists. Think of a psychologists as a general term, like "doctor," for which there are many specializations. A psychiatrist is a classification of psychologist, one of 56 defined by the APA.

One of the differences between a psychotherapist and a psychiatrist, as you note, is the medical degree and therefore, the ability to write prescriptions.

A marriage counselor could certainly have a Ph. D. but not have a medical degree. This falls under the umbrella of psychotherapist (or further subcategorized as a clinical psychologist). It also falls into your definition of anyone "who feels like calling themselves that." Boy, will those doctors have egg on their face once they realize they've earned a meaningless title!
Re:Example: Standard Deviation by rve · 2010-03-18 06:29 · Score: 1

A marriage counselor could certainly have a Ph. D. but not have a medical degree. This falls under the umbrella of psychotherapist (or further subcategorized as a clinical psychologist). It also falls into your definition of anyone "who feels like calling themselves that." Boy, will those doctors have egg on their face once they realize they've earned a meaningless title!
Certainly, but crackpot therapists like psychoanalysts, regression therapists, multiple personality therapists etc. also call themselves pshychotherapists.
Where I studied, psychiatrists only came from med school, not from the psychology department. There are similarities between the two, but the aim is different: treating patients vs. studying behavior. Maybe the definitions are different where you grew up?
Re:Example: Standard Deviation by tignet · 2010-03-18 07:17 · Score: 1

Certainly, but crackpot therapists like psychoanalysts, regression therapists, multiple personality therapists etc. also call themselves pshychotherapists.
Where I studied, psychiatrists only came from med school, not from the psychology department. There are similarities between the two, but the aim is different: treating patients vs. studying behavior. Maybe the definitions are different where you grew up?
You're absolutely correct, psychiatrists always have med school training. And after med school, they then undergo a significant amount of training in mental health. As you mention, the aim is to treat patients (applied psychology).

And, psychologists do generally take a different training route: getting a Ph. D. in clinical or counseling psychology, followed by an internship. But while psychologists can be research-oriented, they can also be people-oriented. a neuropsychologist, for example, is a psychologist that certainly doesn't study behavior.

I've heard it said that one difference between psychologists and psychiatrists is the medical degree; but some psychologists certainly do have medical degrees. I'd be more inclined to say that a psychiatrist deals with mental illnesses, whereas a psychologist deals with emotional ones. There's certainly room to argue, but it's an approximation.

Anyway, a clinical psychologist might be more likely to tell a severely depressed person, "Tell your wife that you are gay; it's time to come out of the closet." By using psychological methods to solve problems, that by definition makes them psychotherapists (practicing psychoanalysis).

I'll agree that psychotherapists can certainly be crackpots, especially since there isn't regulation (at least in the United States) needed to call yourself one. However, the idea that all psychotherapists are crackpots, or that all psychologists are research-oriented is incorrect.

I'm not so sure our definitions are different, but perhaps our perspective is.
Re:Example: Standard Deviation by panderso · 2010-03-18 09:06 · Score: 1

Cytoman -- I have to disagree with your assertion that your doctor didn't know what a standard deviation was. Your own account of the original story belies what you are saying. You said he wanted your blood sugar within 1 standard deviation of the mean. He had the essence of it when he told you that meant that the test value was within the first 1/3 of the values off the mean. As you know, the 1st standard deviation has approximately 1/3 of the values above the mean and 1/3 of the values below the mean. So what did you really want him to say. Did you want him to write you a formula?
Your endocrinologist just wanted to make sure your blood sugar wasn't too high. You wanted to talk about standard deviations.
He probably wanted to talk about making sure your diet was OK and your medications were at the right dose etc. Weren't those the things that were supposed to really matter at your appointment?
Re:Example: Standard Deviation by panderso · 2010-03-18 09:31 · Score: 1

I wish there were more patients like you who actually wanted generics. The way I figure, the more [insurance company] money is wasted on a more expensive drug that does the same thing as a cheaper generic drug, the less money is available to treat another patient (whose care then got denied by the insurer); or the sooner that insurer will raise rates and cause another employer to drop insurance coverage for their "team members". Unfortunately, for a large of patients, the only question they have is "Will my insurance cover it?", and if the answer is yes, then they don't care about the cost.
Re:Example: Standard Deviation by Unkyjar · 2010-03-18 11:14 · Score: 1

The question is, why would a Doctor (when talking to a patient about how to interpret one's blood sugar reading) use a term he doesn't properly understand? Doctors, when operating in their professional capacity, should know exactly what they're talking about. Why? Because when they smudge it people can end up dead.
Re:Example: Standard Deviation by JumpDrive · 2010-03-18 14:07 · Score: 1

You should remember that Engineers are just English majors who couldn't hack it.
Re:Example: Standard Deviation by JumpDrive · 2010-03-18 14:10 · Score: 1

Oh and I actually made it through graduate school, maybe I should tell UT I want my money back.
Re:Example: Standard Deviation by JumpDrive · 2010-03-18 14:26 · Score: 1

The doctor was describing to a colleague how he was measuring a byproduct of the medication he was giving me. I had read up on the drug and realized that he was measuring one of the byproducts of the chemical reaction occurring in my blood. I think he thought it a little insulting that I a mere mortal thought there was a simple explanation for what was going on and it wasn't really magic. So I really didn't have a need for a pharmacist, although that is usually my first source when I have a question about what it going on with a medication.
Re:Example: Standard Deviation by politovski · 2010-03-18 15:42 · Score: 1

well, not all of us are inept at the physical sciences. and actual m.d.'s are mostly the folks who were always top 5th percentile in everything. but, as for drug kinetics, unfortunately, that rarely if ever predicts how people really respond to a medication. and, most of us really don't understand statistics at all. and as for pharma reps, well, they employ more ex-college cheerleaders than any other industry. and god bless them for that...
Re:Example: Standard Deviation by ShakaUVM · 2010-03-18 18:13 · Score: 1

>>So I really didn't have a need for a pharmacist, although that is usually my first source when I have a question about what it going on with a medication.
Fair enough!
Most people have never heard of clinical pharmacists, and since I'm friends with several of them, I thought I'd get the word out. They're pretty bad ass when it comes to drugs.
Re:Example: Standard Deviation by AK+Marc · 2010-03-19 07:41 · Score: 1

Where I studied, psychiatrists only came from med school, not from the psychology department. There are similarities between the two, but the aim is different: treating patients vs. studying behavior. Maybe the definitions are different where you grew up?

Psychiatrists are MDs who are also psychotherapists. They get to prescribe medicines. Many psychologists become clinical psychologists, which are the same without the ability to prescribe drugs (and because of that, generally with an attitude that you can cure more things with talk and less with drugs). And anyone can become a "therapist." But my impression is that more people are treated by clinical psychologists because they are cheaper and more plentiful than psychiatrists.

--
Learn to love Alaska
Re:Example: Standard Deviation by AK+Marc · 2010-03-19 08:05 · Score: 1

I know friends who've taught physics to chemistry students and that was baby-physics which the chemists struggled to understand.

Then fail them out. It would be like if you taught baby calculus to physicists and they couldn't do it. You fail them and move on. They'll get another major or they'll take it until they get it. Chemistry is applied physics. You have to be a good physicist to understand electrons, and you have to understand electrons to understand chemistry. Though, you don't have to understand physics to be a chemical engineer, that's more applied chemistry.

Similarly I've had to learn chemistry for my physics degree and have pretty much forgotten almost anything about it, that didn't prevent me from getting a PhD in physics.

Electrical engineers have to learn the basics in mechanical engineering (trusses for sure, and maybe fluid dynamics), and if they are just doing electrical drawings or chip design, then they'll never use it, not anything related to it. But an doctor uses chemistry every time a prescription is written. If they don't understand it when they are prescribing it, there is a very real increase in the chances they will harm their patient.

I wonder how much of the physics or chemistry out of your field of expertise you still remember.

A crapload more than those that never took them. I worked for an ISP that started doing satellite work. I'm the only one at work who took physics, so I "got" the satellite part much faster and became the satellite guy. It may not have come up directly in class, but it made a difference in what I understood and could calculate on day 1, and seemed to carry over into what I was able do 4 years later, where everyone in the department with a question on orbital mechanics or such would still come see me.

--
Learn to love Alaska
Re:Example: Standard Deviation by AK+Marc · 2010-03-19 08:25 · Score: 1

So early in fact that by you forget the details by the time you have had some serious study under your belt.

Then the complaint is: "Because doctors are unable to admit error or ignorance, they will proceed without proper thought because to stop and ask for help would reveal their fallibility." Either he did know what it meant and couldn't express it because it's now just a foggy idea - "it's the average of how far things are from the mean" which could be considered correct or wrong, depending on the attitude of the listener so he didn't want to voice it, or he honestly didn't know, in which case he's dangerous.

Why is a doctor dangerous that doesn't know the meaning? Because he's making life or death decisions without understanding of the probability of any particular consequence. "High" or "low" isn't good enough. The difference between "low" of 1% (which many would consider a high probability, if the result is death) and 0.00001% is less than a 1% difference, so he isn't off by much. But then it's also a difference of 5 orders of magnitude, which is huge. If he can't understand and articulate this to patients, how can they be involved in their treatment?

YMMV, but I would prefer to visit an endocrinologist who was an expert on the subject of hormones etc rather than stats.

I'll go to the one that understands the difference between something 1% effective and 99% effective, after all, that's just stats...

--
Learn to love Alaska
Re:Example: Standard Deviation by AK+Marc · 2010-03-19 20:43 · Score: 1

The doctor's job is to know what tests to run, what symptoms to look for etc. Not to do a statistical analysis about the likelihood that you have disease X or Y.

How can he treat me if he doesn't know what I have? I hate to turn to fiction, but if you look at an early episode of House, that's what doctors do. They look at the symptoms, decide what fits those symptoms, then when more than one thing fits, they use statistics to determine the most likely. If there's a problem treating one (say one you give them aspirin and the other you transplant a kidney), you consider the statistics of the outcomes of the treatments for whether you might want to treat a lower probability disease first because the treatment for the higher one is damaging. If you can't do statistics, you can't do the job. Every disease isn't a perfect match, it's a likelihood of having it, and you gauge that with the probability of complications with treatments to figure out the best course.

If you just look for symptoms and ignore the statistics, you will give worse care. I wouldn't want you for a doctor.

--
Learn to love Alaska
Re:Example: Standard Deviation by AK+Marc · 2010-03-19 20:49 · Score: 1

Well, yes. If it presents like it could be HIV, even at 0.1%, then he should bring that up. And for lung cancer, he should have a history. If I don't smoke and none of my family does, that number will be small enough to be not mentioned. If I'm a smoker, then he should bring that up as well.

The problem with the US is that if the doctor tells you more than the bare minimum, then the patients get dumber. I don't know why, but if you say "it could be X or Y" then the patients assume the doctor is dumb. Rather than working with the doctor to get to the appropriate conclusion as fast as possible and start treatment (if any), people start working against them, second guessing them, and only listening to what they want to hear and ignoring everything else.

So should a doctor say all that every time? Probably not, at least until patients get some common sense. But do *I* want to know all that? Yes.

--
Learn to love Alaska

Re:Long winded troll by Anonymous Coward · 2010-03-17 14:47 · Score: 1, Insightful

Statistics is terrible for proving things, but rather good at disproving them.

Re:Maths anxiety by Nefarious+Wheel · 2010-03-17 14:50 · Score: 4, Informative

How to Lie with Statistics by Darrell Huff. Recommended reading.

--
Do not mock my vision of impractical footwear

It's a tough situation by robbyjo · 2010-03-17 14:50 · Score: 1

Actually, it's a tough situation. There is no real life experimental data can 100% fit the assumptions of commonly used statistical models. Real life data is messy. There is some degree of simplification. In addition, resorting to whiz-bang fancy methods that "fit" the real data may not be easily interpretable. Ease of result interpretability is what medical scientists want. There are other issues as well, such as computing time, equations derivability, etc.

In addition, many many medical scientists use statistics as a tool to filter things (e.g. candidate genes, target enzymes, treatments, etc). In this case, 100% accuracy is not really important. Once the scientists narrow down the genes, they can test the validity directly in either test animals or real people.

--

--
Error 500: Internal sig error

Re:It's a tough situation by crmarvin42 · 2010-03-17 17:13 · Score: 1

There is no real life experimental data can 100% fit the assumptions of commonly used statistical models. Real life data is messy.

Hence, "All models are wrong, but some models are useful" that every stat's professor has on at least one slide the first day of any experimental design class. I gave a presentation a couple of days ago at a scientific meeting in which I harped on the practice of blindly accepting a model as being appropriate in the face of evidence to the contrary simply becuase "We've always done it that way" and "It'd be inconvenient to use any other model". Anyone with any statistical training should be aware of this issue when performing their own statistical analysis, reviewing a manuscript, or reading a journal article. If not, then they are not doing their job.

In addition, many many medical scientists use statistics as a tool to filter things (e.g. candidate genes, target enzymes, treatments, etc). In this case, 100% accuracy is not really important. Once the scientists narrow down the genes, they can test the validity directly in either test animals or real people.
This is why microarrays are waining in popularity in research. Less than 10 years ago they were being touted as the "Big Thing" that was going to revolutionize investigative genetic research. However, I was sitting at a table with 3 geneticists last night and they were talking about how everyone was moving away from microarrays toward hyperfast sequencing. Their explanation was that microarrays gave soo much information (and consequently so many chances for false positives/negatives) that the reproducibility was very poor. Sequencing on the other hand has excellent reproducibility. That improved accuracy is worth the much higher price tag.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

Two weeks of six sigma classes... by ctmurray · 2010-03-17 14:52 · Score: 2, Funny

Our company six sigma training included two weeks of collecting and analyzing data with a stats package. I got enough experience to even train me how to use the program. I can still do a few things that come up regularly. Probably the best thing to come out of six sigma (for me at least).

Re:Two weeks of six sigma classes... by Bemopolis · 2010-03-17 18:03 · Score: 2, Funny

So, you got twelve sigma-weeks of statistical training?

--
"I guess the moral of the story is, don't paint your airship with rocket fuel." -- Addison Bain
Re:Two weeks of six sigma classes... by NeoSkandranon · 2010-03-18 03:07 · Score: 1

Out of curiosity, did you see that your fellow trainees had serious math issues? Out of the trainees (admittedly, few) my company has put forward the majority have come away from the class still having amazingly poor abilities at analyzing data.

--
If you can't see the value in jet powered ants you should turn in your nerd card. - Dunbal (464142)

Re:Long winded troll by khallow · 2010-03-17 14:53 · Score: 1

The entire article can be summed up by the tiresome cliche "correlation != causation".

That misses a lot of the problem. For example, observer bias through poor statistical design of the experiment or throwing out data can cause the appearance of correlation or causation in data that isn't so.

Personal experience by nanoakron · 2010-03-17 14:53 · Score: 5, Interesting

As a doctor myself, I feel I should add my $0.02...

Throughout med school we had the odd scattered lecture on statistics, and later when reading papers I used to skim over most of the maths just to look for the P value at the end (one representation of how statistically significant a result is).

However, I then took a formal stats course and was amazed at how little I understood - Monte Carlo techniques, Markov models, and even something as trivial yet important as the difference between a parametric versus a non-parametric test.

And then it struck me - most of the research I had read had applied parametric statistical tests to their data - that it, the researchers made an assumption that the underlying distribution of results would fall on a normal curve. Yet this simple assumption may be all it takes to skew the data when they should have chosen a non-parametric test instead.

So yes, stats are vitally important, badly taught, and focus too much on the maths rather than the concepts. Remember that we're doctors, not mathematicians - the last set of sums I did were in high school. If I need to analyse data, I'll probably plug it into SPSS - although now with my eyes open.

-Nano.

Re:Personal experience by Cryacin · 2010-03-17 15:13 · Score: 1

I figured that marathons are just running on a much higher level. Makes sense when you think that the sport was invented by some Greek guy running towards Sparta with an army of soldier with pointy spears chasing after him. That sort of thing would make anyone run long and hard.

--
Science advances one funeral at a time- Max Planck
Re:Personal experience by spasm · 2010-03-17 15:18 · Score: 1

And that, my friend, is why the NIH's constant push to produce more 'physician-researchers' continues to drive me nuts. Because they rarely insist K awards and other early-career training mechanisms require physicians intending to do research in areas where stats are important actually get any stats training..
Re:Personal experience by Frequency+Domain · 2010-03-17 15:24 · Score: 5, Insightful

...And then it struck me - most of the research I had read had applied parametric statistical tests to their data - that it, the researchers made an assumption that the underlying distribution of results would fall on a normal curve. Yet this simple assumption may be all it takes to skew the data when they should have chosen a non-parametric test instead.
So yes, stats are vitally important, badly taught, and focus too much on the maths rather than the concepts. Remember that we're doctors, not mathematicians - the last set of sums I did were in high school. If I need to analyse data, I'll probably plug it into SPSS - although now with my eyes open.
That's a good insight. I'm a statistics professor, and some of the problems I see are a) people generally get exposed to a single course in statistics; b) they're usually mathematically unprepared for it; c) so much gets squeezed into that one opportunity that heads are exploding; d) because of (a) - (c), everybody wants you to "just give 'em the formula"; e) since statistics is so widely used, there's a plethora of courses that are being taught by people who themselves are victims/products of (a) - (d), and are very happy to "just give 'em the formula"; and so e) most people plug and chug data through a stats package with no idea of the applicability, limitations, and interpretation of the results. The sheer volume of bad analyses is enough to make you weep, and contributes to the widely held perception about "lies, damned lies, and statistics". And that completely ignores the intentional falsehoods propagated by people who are trying to support various advocacy viewpoints, and will happily mislead the public with biased samples, Simpson's paradox, invalid assumptions, etc.
Re:Personal experience by fermion · 2010-03-17 15:52 · Score: 1

Remember that we're doctors, not mathematicians
Would a doctor admitted that he or she could not read past a tenth grade level? I think not. Yet I am amazed at the number of apparently educated people who are willing, even to the point of being proud, of the fact that they cannot do anything basic high school math.
Statistics is a very difficult subject. I have taken several courses and still cannot tell you the when to use a Paisson or Binomial distribution, but I do know the basics. For instance, most naturally occurring variable be frequency distributed according to a normal distribution. The key idea is that the variable is random. What this means in terms of medical studies is that the participants are chosen randomly. Defining how random a variable is a particularly hairy, yet important, part of statistics. If the sample is random, and representative, then the result are crap.
I know enough doctors and medical researchers to know that the statics illiteracy is universal. There are relatively simple books that explain much of what a researcher must know(I can't recall the names, but researchers around the hospital probably can recommend one). And, like I tell students, it is possible to know whether the results of a calculator, or SPSS, is reasonable.

--
"She's a scientist and a lesbian. She's not going to let it slide." Orphan Black
Re:Personal experience by martin-boundary · 2010-03-17 16:00 · Score: 3, Interesting

There's a certain level of historical baggage as well.
Even ten to fifteen years ago, students in Statistics courses had very little computer exposure, and that of course means any practical analyses would imply the use of approximations - hence the widespread use of chi squared tests and normal distributions for everything, whether appropriate or not.
If the statistician -> textbook -> student/scientist -> textbook -> scientist process is factored in, I have no doubt that it will take another generation or two before the old style of statistics is replaced sufficiently widely to be only a memory.
Re:Personal experience by Idiomatick · 2010-03-17 16:08 · Score: 1

It was to Athens. And no one with spears was chasing him. The Greeks beat the Persians at Marathon. And he was running back to bring the good news. He was just ... like REALLY excited about it. As the story goes, after he arrived in Athens he died of exhaustion.
Re:Personal experience by WeirdJohn · 2010-03-17 16:22 · Score: 2, Informative

It's the approach that you can just pump the numbers into SPSS or Statistica, and then call on a battery of tests until you get a "significant" result that results in the kind of errors the article (and a disturbing number of /. readers) fall into.
Unless you're dealing with large samples, all z and t tests assume normality in the population, with insignificant skew or kurtosis. Yet by definition, if we have enough data to be sure we have a normal population, we have enough data that the central limit theorem makes the differences moot. Even more extreme, if we have a complete description of the population (a census) we have no need to use any inferential statistics.
Meanwhile students are told to test the data for normality, homoscedacticity and linearity, to the point where the repeated tests on a single data set make the chance of a Type II error better than even. But by saying "SPSS said so" and burying assumptions beneath a mountain of waffle and misunderstood jargon we can still get these "results" published.
No-one who can't perform a balanced block design ANOVA by hand, or explain what transforming data does to residuals under assumptions of a linear additive model, should be allowed near statistical software in my opinion. And the so-called statistics packages in popular spreadsheets should be banned, and any student relying on them should be failed.
Re:Personal experience by failedlogic · 2010-03-17 18:09 · Score: 2, Interesting

I think the term "Statistics" has become too general that people don't understand how complicated it can be. People think of Statistics as Bob saying to Alice - "Get me the stats on this weeks' sales." Alice just goes digging around and gets Bob the total sales in $, # of units sold .... etc. People don't understand or know of the concepts that are involved in polling, they just thing they called 2,000 random people and that's it. That's statistics to the public and many college graduates.
I had to take a few stats courses for my BA. Learning stats is humbling - I know I really know nothing about it now after taking a few courses. The classes I took assumed you have no inkling of the basics of calculus or algebra (much past grade 9 level). I didn't know any calculus - I took some 1000 level Algebra but, after graduating a few years ago, I'm teaching my self Calc now and I'm realizing how much less I really understood about Stats at the time.
When people really don't understand the underlying mathematical principles they shouldn't use SPSS or Excel. Heck, if you ask people what 2+2 is, they know the answer. But you tell them to apply X or Y to such and such data set with SPSS, they probably won't investigate the results. Print it out with the report. Done! If you use a Stats program you should understand what your data means, what is happening to your data, what it means when X is applied to your data and what the end result means. I don't think a lot of people are humble enough to say they don't really understand. Little white lies!
Re:Personal experience by ShakaUVM · 2010-03-17 18:18 · Score: 1

That's a good insight. I'm a statistics professor, and some of the problems I see are a) people generally get exposed to a single course in statistics; b) they're usually mathematically unprepared for it; c) so much gets squeezed into that one opportunity that heads are exploding; d) because of (a) - (c), everybody wants you to "just give 'em the formula"; e) since statistics is so widely used, there's a plethora of courses that are being taught by people who themselves are victims/products of (a) - (d), and are very happy to "just give 'em the formula"; and so e) most people plug and chug data through a stats package with no idea of the applicability, limitations, and interpretation of the results.
I work as an evaluator, doing data analysis of reams of data for school districts, and I won't pretend to be a "stats wizard" - my background is in computer science, though I have a fair bit of stats background - but I get appalled at what I see other evaluators doing.
On guy, who was presenting at the Federal Department of Education conference had his experimental group of teachers write a test, which they then took. Amazingly enough, they did better than the control group of teachers who took the same test. Other evaluators have teachers listen to a lecture and take a quiz based on the subjects covered on the lecture, and then give the same quiz to a control group. They also do better! Weird, huh? And this is probably the most common methodology used to demonstrate program success in DOE-funded programs today.
In other words, it is more methodology that appalls me than the stats - in part, because I believe all statistics are somewhat suspect. Do you think that the distribution of people that take tests really fall on a Gaussian? And yet most of the common statistical tests assume Gaussian distributions. Teachers will post the standard deviation of a test result (and use that to "grade on the curve") when the distribution doesn't resemble a bell curve at all.
Education, in general, doesn't work like administering a drug to the population.
Re:Personal experience by hazem · 2010-03-17 18:59 · Score: 1

And then it struck me - most of the research I had read had applied parametric statistical tests to their data - that it, the researchers made an assumption that the underlying distribution of results would fall on a normal curve.
It seems I heard a radio interview recently that discussed how many of the mathematical models used on Wall Street make the same error. That the probability of events don't actually follow a normal distribution, so high-impact anomalous events that would normally be in the very thin parts of the normal curve are actually in a thicker part of some other curve... and thus more likely to happen.
I wish I could remember the interview.
Re:Personal experience by dookiesan · 2010-03-17 19:02 · Score: 1

I believe that many of these quantities still are tabled. Computers are getting quicker, but data sets are growing even more rapidly and these approximations still matter. There are many good methods that don't find use because the answers can't be tabled and take too long to compute through permutation or simulation.
Re:Personal experience by OrangeCatholic · 2010-03-17 19:54 · Score: 1

I doubt it. More likely, there are cases where polling x% gives you a more interesting result than polling 100%. But more accurate? How could it be?
Of course, polling x% is supposed to give you the same results with less work, which is the whole point.
Re:Personal experience by ShakaUVM · 2010-03-17 21:30 · Score: 1

>>I wish I could remember the interview.
http://en.wikipedia.org/wiki/Black_swan_theory
Re:Personal experience by u38cg · 2010-03-17 22:15 · Score: 1

Actually, I'd disagree with that, to be honest. Many real world applications of statistics are well suited to the standard assumptions. Where people go wrong is understanding the theory behind the assumptions that are made. How many people understand that the efficacy of the central limit theorem depends on the variance of the data? Thirty or forty years ago, probability usually began with some fairly intensive set theory. Nowadays, you're lucky if you get any. The existence of numerical recipes on demand has removed the requirement to think about your data, and hence the abuse.

--
[FUCK BETA]
Re:Personal experience by sebaseba · 2010-03-18 00:20 · Score: 1

Hey, would you maybe recommend a book on that: statistics which focuses more on concepts than formulas?
Re:Personal experience by Rockoon · 2010-03-18 00:56 · Score: 1

Hello Mr Profession.

If you had to recommend only one textbook to a person more interested in the understanding (as opposed to the application) of statistics and statistical methods (they wish to "grok"), what would it be?

--
"His name was James Damore."
Re:Personal experience by Frequency+Domain · 2010-03-18 04:20 · Score: 1

Depends on what you want. If you want to know what to be on guard against when somebody is trying to lead you down the garden path to false conclusions, I recommend How to Lie with Statistics, which has been mentioned by a few other folks. The examples are dated, but the principles haven't changed and the price makes this one a steal. If you want more of a flavor of statistical thinking, but without the math, then you might consider Statistics by Freedman, Pisani and Purves. People either love it or hate it, and it's substantially pricier.
Neither one of these is adequate preparation to go on and take a second-tier look at the field, but that's not what they were written to do.
Re:Personal experience by wfolta · 2010-03-22 07:06 · Score: 1

That's a good insight. I'm a statistics professor, and some of the problems I see are a) people generally get exposed to a single course in statistics; b) they're usually mathematically unprepared for it; c) so much gets squeezed into that one opportunity that heads are exploding;...
I'd add that the head 'sploding part, for me, was probability. I wasn't really able to overcome probability's counter-intuitive nature in the first half of Probability & Statistics, so I managed to create a really-sketchy-but-pulled-my-class-grade-average-up-from-dismal picture of statistics that turned burned me on my next two or three exposures to stats in other areas... I was well into graduate school before I finally started getting it.

Pirates cause cool weather by wisnoskij · 2010-03-17 14:53 · Score: 1

Funny Stat correlation: http://www.seanbonner.com/blog/archives/piratesarecool.jpg

--
Troll is not a replacement for I disagree.

Re:Pirates cause cool weather by symbolset · 2010-03-17 14:57 · Score: 1

Current meme trends indicate an 87% chance this will become a global warming thread.

--
Help stamp out iliturcy.
Re:Pirates cause cool weather by Nefarious+Wheel · 2010-03-17 15:04 · Score: 1

Careful, sir or madam, with that graph you are treading dangerously close to a theological argument here. Global Warming is the Flying Spaghetti Monster's way of telling us we need more pirates. If you want to know exactly how pirates and global warming correlate, please send money, and we will lease you an AVOM (Awesome Volt-Ohm Meter) with a blank face with which you can scare yourself until your midichlorians take over your reflexes.
"Luke Skywalker's a Jedi of course;
And he's prone to have much intercourse;
So he calls up his Princess, to beg for some incest,
Grabs a blindfold and uses the Force.

--
Do not mock my vision of impractical footwear
Re:Pirates cause cool weather by sycodon · 2010-03-17 15:20 · Score: 1

It's funny that you say that because is not global warming a statistical creature?

--
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
Re:Pirates cause cool weather by Idiomatick · 2010-03-17 16:10 · Score: 1

Only amusing because of the resurgence of piracy in Somalia over the last 10 years.... There has also been global cooling.
Re:Pirates cause cool weather by caerwyn · 2010-03-17 16:13 · Score: 1

No, actually. It's a strongly (validated) model-based creature, which has its own shortcomings, but it's not really a statistical creature.

--
The ringing of the division bell has begun... -PF
Re:Pirates cause cool weather by RightwingNutjob · 2010-03-17 16:51 · Score: 1

Hmmm.
Warming->torrential rains->good crop yields->less pirates
Cooling->drought->poor crop yields->bad economy->pirates
Flawless
Re:Pirates cause cool weather by NoMaster · 2010-03-17 17:37 · Score: 1

"For instance in Australia where they too[k] readings from several surrounding stations and came up with numbers for imaginary stations or corrected numbers for real stations... that's not stats?"
No, the first is modelling; specifically interpolation. And the applicability (I want to say 'validity', but that's over-simplifying it) of the derived results is highly dependent upon the specific model used. Some models may derive interpolated results statistically, but it's far from a given.
The second? Well, I'd like to comment more, but in high school science 30 years ago I was taught never to throw away data, no matter how 'wrong' it looked. Much more recently in my science degree it was heavily intimated that unless such outliers either prove you right or prove you wrong, you might be better off coming up with a good excuse why that data point is wrong and excluding it ;-)
(Yes, it was one of the 'softer' sciences; specifically, ecology & environmental science. I sort of agree with their point, because it's damned near impossible to get clean, steady, reliable data, and there's just no way to account for some of the weird outliers you see without starting up a whole new research project to study them - and that's the job of the person who reads your published paper and says "hey, let's test his hand-wavey excuse and see if it stands up!".
I do agree with the poster above on the excessive reliance & misuse of transformations to normalise data - I always felt it was better to suck it up, accept the fact your data has distributions known to no man and is skewed out the wazoo, and use the least crappy test available to see if there's any significance in it. Which is why there's a significant, if slow, flow of Eco & Enviro people eschewing non-parametric statistical tests for Monte Carlo and Bayesian analysis. Most of the old school smile and make fun of us, but like the results we get ;-)

--
What part of "a well regulated militia" do you not understand?
Re:Pirates cause cool weather by hazem · 2010-03-17 19:09 · Score: 1

The model merely jiggers with the numbers, but the statistics ultimately tells you the trend.
I suspect most climate models are actually very large systems of differential equations that express the relationships between various aspects of the climate. They would then be solved numerically in discrete time steps to predict the future state of the climate. The statistical analysis of observed data would then be used to set values of constants and to establish initial conditions.
Indeed, climate modeling uses statistics, but it's only one part of it and there's a lot going on that is not statistics.
Re:Pirates cause cool weather by caerwyn · 2010-03-18 02:32 · Score: 1

You take thousands of temperature readings over hundreds of years and come up with trends and that is not statistics?
No. It's data.
Past trends in data are not being extrapolated from to provide evidence directly for global warming. Instead, that data is being used to generate and validate models and theories on how the climate reacts to various changes; those models are then being used to make predictions. The statistics is only really used in a first pass at the data to remove any data bias, and then at the end to determine confidence intervals for the model outputs. Most of the work is a whole pile of differential equations.

--
The ringing of the division bell has begun... -PF

Re:Long winded troll by Nefarious+Wheel · 2010-03-17 14:53 · Score: 1

The entire article can be summed up by the tiresome cliche "correlation != causation"...

The logical fallacy is called "post hoc, ergo propter hoc" - "after this, therefore because of this".

Sort of like - I get a headache every time someone turns on the television, therefore headaches are caused by the television.

Oh, hang on...

--
Do not mock my vision of impractical footwear

Rats and Stats by Anonymous Coward · 2010-03-17 14:56 · Score: 1, Interesting

When I did my BA in psychology Statistics was the core of the degree. It was the one subject that you could not escape and had to take for the full year every year of the degree. I heard later that the Psychology department at that Uni was sometimes disparagingly described as teaching Rats and Stats psychology.

Re:No surprise here by initdeep · 2010-03-17 14:56 · Score: 1

this is why people now consider master's degrees to be the equivalent of a high school diploma.

if you want real fun, take the average master's degree idiot and start having them manually add fractions without changing them to decimals. such as adding a bunch of measurements off a tape measure together.

hilarity ensues....................

Statistical assumptions are often ignored by Anonymous Coward · 2010-03-17 14:57 · Score: 1, Insightful

Statistical methods are typically developed for fairly specific mathematical models. A practitioner may error greatly by using a statistical method outside of its intended purview. For example, many statistical tests assume that different groups of observations are independent or correlated in a specific way. If this isn't true then the resulting inferences can be very inaccurate.

Unfortunately the spread of "easy to use" statistical software is making this problem worse. Many scientists just enter their data and select an analysis from a drop-down menu - thinking that just because their data is in the right format that the results will accurate. It would be better if people had to think about what analysis to choose rather than just treating the choice of a test like the choice of a visual effect in photoshop.

IAAS (statistician), for what it's worth...

Re:Statistical assumptions are often ignored by solanum · 2010-03-17 15:57 · Score: 3, Informative

and IAAB (biologist) and I can tell you that most scientists don't have access to statisticians or don't have the grant money to pay for them. I also don't have time to learn SAS and code my own tests, therefore I use stuff like SPSS or Genstat (both of which do allow you to code your own tests as well). Just because they are easy to use doesn't mean I do or do not understand the tests, the assumptions or their results. I would say my grasp of stats is above average for my peer group, below where I would like it to be and obviously limited.
One thing that is interesting to me is that throughout my education and career I have been warned off using multiple means comparisons and LSD in particular (I understand why and have avoided where I can and the latter always). Yet the only actual statisticians I have dealt with in recent years have recommended me to use LSD on means comparisons with 10s of means. I would be hard pressed to publish those results.
In summary, whilst statisticians like to blame easy to use stats programs for bad stats the reality is they are just a tool and if statisticians can't agree on the acceptable use of the simplest procedures I'm not sure what chance the rest of us have of getting it right.

--
Si hoc legere scis nimium eruditionis habes.
Re:Statistical assumptions are often ignored by oldhack · 2010-03-17 16:11 · Score: 1

I do remember this bit. There is an assumption made of the underlying mechanics, represented as the distribution, and you're supposed to loop back to verify the assumption.
Stats is one subtle beast.

--
Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
Re:Statistical assumptions are often ignored by Guppy · 2010-03-17 19:11 · Score: 1

I also don't have time to learn SAS and code my own tests
And unless your employer or institution has some sort of site license, there's the issue of the rather high and repeating cost of licensing SAS.
Getting SPSS at least is easier to deal with, although they're getting more and more restrictive with their licensing as well, the newer versions expire. Luckily, there are cheap alternatives, stuff like R and PSPP (but tons of biology folks just shoehorn everything into Excel, bleh).
Re:Statistical assumptions are often ignored by demonlapin · 2010-03-18 00:55 · Score: 1

Si hoc legere scis nimium eruditionis habes.
Has it been too long, or shouldn't that be potes instead of scis?
Re:Statistical assumptions are often ignored by clintonmonk · 2010-03-18 03:14 · Score: 1

...throughout my education and career I have been warned off using multiple ... LSD
Avoiding LSD is good advice for anyone.

Fair and Balanced: Fox quotes the Bible as saying by vandelais · 2010-03-17 14:57 · Score: 2, Funny

that there are only 3 kinds of scientists: those that are good at math and those that aren't.

--
Game: Player 'Donald J Trump' now has AI skill level 'experimental'.

Re:Long winded troll by Homburg · 2010-03-17 14:58 · Score: 1

science is not in the bussiness of proof

So what is it in the business of?

Re:No surprise here by Homburg · 2010-03-17 15:00 · Score: 5, Funny

I think your example would be more persuasive if it involved algebra, though.

The problem is statisticians by BrokenHalo · 2010-03-17 15:01 · Score: 5, Insightful

In other news math may not lie but people still can...

Usually (in science at least) it's not even a matter of lying. Part of the problem is that the multi-headed monster that statistics has become has a tendency to lead people to over-use numerical "answers" vomited up by stats packages, without really understanding what they are for, or how to interpret them.

Statistics are very useful for predicting certain things, but all too often they are submitted as "proof" of a given condition, which is dangerous. Sometimes we need to throw away statistics and start applying common sense.

Re:The problem is statisticians by caerwyn · 2010-03-17 15:35 · Score: 4, Interesting

Actually, one of the most dangerous uses of statistics is exactly predicting with them inappropriately. Curve fitting is especially prone to this error- attempting to make any predictions outside of the central mass of the points used to *produce* the curve is completely bogus, and yet people do it all the time.

--
The ringing of the division bell has begun... -PF
Re:The problem is statisticians by hoytak · 2010-03-17 16:52 · Score: 1

Maybe, but often common sense will lead even the most diligent scientist astray. Many times the answer that "just can't be right" is; the problem comes when we "throw away the statistics" instead of figuring out why and how it gave the answer it did.
Furthermore, variances and probabilities and confidence intervals are often discarded in favor of a point answer. It's an unfortunate reality; properly done statistics very nicely captures real life uncertainties, but the untrained eye or the popular media doesn't work that way.
I think the ideal solution is for the general technical culture to become both more knowledgeable about basic probability and less accepting of bad statistics (e.g. the world WILL be 5 degrees warmer in 2050). This will encourage people to really understand the procedures they are using. And yes, there are many fronts to this problem -- as a phd student in statistics I'm well aware of the complexities involved -- but I do trust statistics more than my common sense.

--
Does having a witty signature really indicate normality?
Re:The problem is statisticians by Martin+Blank · 2010-03-17 17:38 · Score: 3, Insightful

Many times the answer that "just can't be right" is; the problem comes when we "throw away the statistics" instead of figuring out why and how it gave the answer it did.
I've adopted in my life a truism I learned from my flight training: deal with things as they are, not how we would wish them to be.
In my work in network security, I often come across some oddities, which I present to management. They can present some uncomfortable episodes, and management sometimes wishes to just sweep them under the rug instead of addressing the problems. Now that we have a newly-upgraded IDS, we're seeing things that we never noticed before, and I suspect that we're going to be getting new guidelines on what is important.
I hope that's just cynicism leaking through the rum, but I've been there long enough to thing it might be reality instead.

--
You can never go home again... but I guess you can shop there.
Re:The problem is statisticians by dcollins · 2010-03-17 17:49 · Score: 1

"... attempting to make any predictions outside of the central mass of the points used to *produce* the curve is completely bogus, and yet people do it all the time."
Well, no, not completely bogus, just more risk-prone. As one example, look at how close Guillaume Amontons got to predicting the value of absolute zero in 1702 by extrapolating from his mercury-based thermometer: http://en.wikipedia.org/wiki/Absolute_zero#History

--
We know where leadership by an anti-intellectual "strongman" who scapegoats minorities and likes boisterous rallies goes
Re:The problem is statisticians by Capsaicin · 2010-03-17 18:26 · Score: 3, Funny

*Ahem* Cue carbon dating.
To be fair the problem with carbon dating is not merely curve fitting. A larger problem is the when God created the universe in Oct 4004BC (or thereabouts), He created Adam with a belly-button.

--
Better to be despised for too anxious apprehensions, than ruined by too confident a security. --Edmund Burke
Re:The problem is statisticians by the_womble · 2010-03-17 18:38 · Score: 3, Funny

I feel somewhat vindicated for being no good at econometrics when I see where the people who were good at it have landed us.....
Re:The problem is statisticians by Aceticon · 2010-03-17 21:49 · Score: 2, Funny

Here's a good example (credits to Nassim Taleb and his "The Black Swan" book) on the risks of extrapolation (of which curve fitting is one method):
- Based on previous experience, a turkey will confidently predict that he will wake up every morning be fed during the day and go to spleep in the evening. He can be easilly extrapolate this from the fact that it has happened every day of it's life. At some point before Christmas this turkey is going to have a big surprise ...
Re:The problem is statisticians by Hatman39 · 2010-03-17 22:10 · Score: 1

Being a PhD student who mostly works with statistics (even though I have little formal training in it), I can attest to the truth of this. Sometimes you get results that do not vibe with what you are seeing, or that are doubtfull in some other way. Of course, we can blindly listen to the stats, or we can find out why the results are as they are. I try to do the latter, but the 'SPSS effect' tends to promote the former.
Re:The problem is statisticians by umghhh · 2010-03-17 23:08 · Score: 1

which just proves that scientists especially the 'social' brand are not real scientists. What a surprise.
Re:The problem is statisticians by tophermeyer · 2010-03-18 01:20 · Score: 1

which just proves that scientists especially the 'social' brand are not real scientists. What a surprise.
Social science is unlike Physical science to be sure. Social scientists are not always able to draw perfect conclusions or make perfectly accurate predictions, but that is not to say they are not scientists. Keep in mind that the entities that make up social groups are in fact individuals. These individuals have their own motivations and their own decision making processes. When you consider the whole system, it is incredibly complex and impossible to fully understand.
This like programming code can be understood if all of the conditions can be understood. Granted, its not easy to understand how the complete code for a program will interact, and what effect that will have on hardware, and how that hardware may degrade etc. It is much harder to do that with social groups, because we can't open up a person to find out exactly what makes them tick. We can observe and postulate, and make increasingly better decisions. But those individuals are capable of acting in ways that outside observers simply cannot understand.
IMO a great comparison is Meteorology. Earth weather patterns are incredibly complex. Meteorologists make predictions about what patterns will likely develop, but there are simply too many variables to be perfectly precise.
Re:The problem is statisticians by PachmanP · 2010-03-18 04:15 · Score: 1

IMO a great comparison is Meteorology. Earth weather patterns are incredibly complex. Meteorologists make predictions about what patterns will likely develop, but there are simply too many variables to be perfectly precise.
I worked in a weather office. The prediction method involved roulette.

--
You're thinking small. Why miniaturize the laser, when we could instead enlarge the sharks? -John Searle
Re:The problem is statisticians by Carnildo · 2010-03-18 08:45 · Score: 1

Actually, one of the most dangerous uses of statistics is exactly predicting with them inappropriately. Curve fitting is especially prone to this error- attempting to make any predictions outside of the central mass of the points used to *produce* the curve is completely bogus, and yet people do it all the time.
Actually, you can predict outside your data set. The rule of thumb is that for a dataset that produces a simple curve (say, an exponential or linear fit), you can extrapolate up to 10% of the width of your dataset on either side (eg. given world population data from 1900 to 2000, you can extrapolate the population in 2010 or 1890). The problem is that people tend to extrapolate much further out than that (say, predicting the population in 2100).

--
"They redundantly repeated themselves over and over again incessantly without end ad infinitum" -- ibid.

Current Data by Dripdry · 2010-03-17 15:02 · Score: 1

Does that mean that we should send people who know what they're doing to sort through results and draw more meaningful conclusions? Or just rerun the tests?

This seems obvious, so please don't waste mod points here, people who know what they're actually talking about will probably chime in.

--
-

Re:No surprise here by skine · 2010-03-17 15:02 · Score: 2, Informative

It's perfectly reasonable that someone use a calculator for sales tax (if an exact answer is desired).

Also, sales tax is multiplication - not algebra.

Re:No surprise here by Lunix+Nutcase · 2010-03-17 15:03 · Score: 1

And what are we supposed to make of your post where your supposed case for people not knowing algebra has nothing to do with algebra?

Study says by oldhack · 2010-03-17 15:04 · Score: 1

They're all buncha crap, and I say this with 95% confidence interval, or sum such stat shit that I wish I can remember.

--
Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.

Re:No surprise here by coolsnowmen · 2010-03-17 15:05 · Score: 5, Insightful

You are a jerk.
You are insulting your sister because she is bad at mental math? It is a skill; one not required for extensive knowledge of the social sciences. Additionally, maybe if sales tax is simple in your state like 10%, but where I live it is 4.5% which is not always easy to get exactly right in your head.

I had a roommate who was brilliant,funny, a singer and an artist, and yet, he couldn't calculate tip to save his life, but I don't certainly hold that against him.

Countless? by andr00oo · 2010-03-17 15:06 · Score: 1

> countless conclusions in the scientific literature are erroneous

Number of Publications: Finite
Number of Conclusions: Finite
Time taken to count erroneous conclusions: Finite

Countless Conclusions? I don't think so!

A large but unspecified number of conclusions in the scientific literature are erroneous: Not so compelling

Re:Countless? by martin-boundary · 2010-03-17 16:22 · Score: 1

Time taken to count erroneous conclusions: Finite

One, two, three, ...., four thousand seven hundred and sixty two, four thousand seven hundred and sixty three, four thousand seven hundred and sixty five, four thousand seven uh, wait that's not right sixty uh oh damn I'd better start again. One, two, three ...

Excellent by zoso1132 · 2010-03-17 15:08 · Score: 2, Insightful

One of the best articles I've seen on stats (and their misuse). I'm taking a data analysis course at the moment and I've spent at least a dozen hours simply computing confidence intervals, testing the null hypothesis, and determining significance. It really has changed how I view statistics because it keeps pounding in these very key but oft-ignored principles.

--
"Everything is linear if plotted log-log with a fat magic marker."

bad title by obliv!on · 2010-03-17 15:10 · Score: 5, Interesting

It is not a shortcoming of statistics that other people, like various scientists who aren't statisticians, don't know how to use or properly interpret statistics. It is a shortcoming of their knowledge.

It is not a shortcoming of the Copenhagen interpretation of quantum mechanics or the Chicago school of economics if I don't understand or know how to correctly interpret their results. It is my shortcoming and fault for not knowing enough to connect the dots.

I do statistical research some of that is through interacting with researchers in the biosciences. Often when I go to talk to a researcher and ask them if they could use some statistical or mathematical or computational assistance with their research it has almost always been a fruitful starting point to long conversations and getting into the research. Now sometimes it was simply a matter of looking at their F-test results or ANOVA scores and telling them what it meant (like with a regression model relating proportions of certain characteristics between taxa), more useful interactions for me often mean working on new algorithms or estimators or working with fitting a model from their empirical data because there isn't a reliable standard model to work off of (like intergenic distance between genes in an operon) that kind of challenge makes less engaging work worth the hassle. Maybe I'm odd because I've worked hard to have a good background in both statistics and biology, but I shouldn't be.

Although here is an observation that perhaps supports some of the intent of the article from my own experience. I was speaking with a biology graduate student and it came up that they had a biostatistics course in the department. Of course as a statistician my mind goes towards survival function, failure rate, life tables, censored data, bioassy, epidemiology, microarrays, clincal trials, topics along those lines. It turned out their course focused z tests, t tests, f tests, confidence intervals, point predictions, least squares regression, multiple regression, ANOVA, and things along these lines just with simulated problems in a lab setting. That is not necessarily a bad thing, but much of the core math was under played or missing like model assumptions and alternate formulations or things like dummy variables. The worst part was that even though they were doing well with the class they had no confidence in actually using the statistics and didn't understand how to interpret the meaning of something like a confidence interval, they knew how to calculate one, but it wasn't clear what it actually meant to them.

The corollary to the notion in the summary I'd rant and claim is that scientists overall have less than desirable skills in mathematics, statistics, and computation than those who studied those disciplines principally and that's hurting science. However many in those three disciplines really know little beyond basic results in any of the sciences which hurts the applicability of these mathematical fields to the sciences and likely hurt our ability to develop certain types of discipline specific results that can be generalized from work in application problems.

In either case whether you're a typical scientist or a typical math/stat/comp person in order to become proficient enough in the other areas it requires going an awfully long out of the way compared to any counterpart who simply does not care and goes straight through as many before have. While in some areas of research on either side it is no problem to do as has been done and not further knowledge into those other areas. Increasingly results that have the highest levels of impact are coming more and more from truly interdisciplinary research. In order to further encourage that for those who are interested in such fields (aside from making more clear what areas in any of the fields fringe to such interdisciplinary work) we need more incentive to study more than one field and/or better ways of enabling fruitful cooperation between the camps.

Re:bad title by As_I_Please · 2010-03-17 16:26 · Score: 1

Do you have any comment on Bayesian statistics? Would using that have better results?
Re:bad title by Daniel+Dvorkin · 2010-03-17 19:41 · Score: 1

Bayesian statistics are a tool, preferably one of many in the statistician's toolbox; one of my many beefs with TFA is that it presents the conflict between Bayesians and "frequentists" as something new and vital, rather than a largely settled argument. People who insist that one approach or the other is the One True are thankfully pretty rare these days.
In any case, no matter what flavor of analysis you're talking about, the simple fact is that statistics is a hard subject to learn; there are only so many years in a human lifetime. It's not impossible to learn enough statistics to call oneself a statistician, and also (for example) learn enough cell biology to call oneself a cell biologist -- it's just damned difficult. As a bioinformaticist, coming mainly from the math/CS/stats side of things, I know a hell of a lot more biology than most people do, but when I work with biologists I'm constantly reminded of how much I don't know. And they have the same experience from the other side.

--
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Re:bad title by obliv!on · 2010-03-18 10:04 · Score: 1

As Dvorkin put it, it isn't as simple as always use classical methods or always use Bayesian.

I also think sometimes whenever possible you need to use both to help give you more evidence. Particularly if the results are somehow unclear. If they both give similar or approximately the same results that might be pretty good evidence that the final result is reasonable.

If you get wildly different conclusions then you need to consider why you got different answers and maybe it is because one method better models the situation than the other (and there are some tools to help figure that out in some cases) or perhaps the irregularity is identifying a problem with the data collected that would have gone undetected if you only used one or the other (and there are tools for when this happens too in some cases).

Very careful model building, very careful experimental design, very careful analysis, and very careful model adjustment isn't a guarantee that the results will be correct with any methods, but it is the best that can be done and is critical to always follow.

Re:Long winded troll by Anonymous Coward · 2010-03-17 15:13 · Score: 1, Interesting

Evidence. Big difference.

Probability Theory: The Logic of Science by DuncanFoley · 2010-03-17 15:14 · Score: 1

The clearest discussion of the logic of probability reasoning I know of is E.T. Jaynes' Probability Theory: The Logic of Science. (Cambridge University Press). Many of Jaynes' excellent papers on statistics are downloadable from http://bayes.wustl.edu/etj/etj.html.

Re:Long winded troll by obliv!on · 2010-03-17 15:14 · Score: 1

Science is in the business of probably knowledge. So they really need to improve their probability and statistics knowledge.

Re:PhD Candidate in Biostatistics Here by MindlessAutomata · 2010-03-17 15:16 · Score: 2, Funny

I don't have to be a statistician to know that the above post is 97% bullshit.

Re:Long winded troll by TapeCutter · 2010-03-17 15:17 · Score: 1

Yes, it's rarely mentioned that causation implies correlation.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

50% of all statistics are useless by davidwr · 2010-03-17 15:18 · Score: 1

+/- 50%*.

*confidence interval=100%

--
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.

Re:No surprise here by Anonymous Coward · 2010-03-17 15:18 · Score: 1, Informative

Arithmetic is not algebra. Arithmetic is "What's 10% of $24.45?" Algebra would be "On a given day i, John sells n_i apples to Peter at x_i dollars each, and this price includes sales tax which is a constant proportion 0p1. Let x_1= .. x_2= ... ... What is the tax on the apples sold on days 1 to 12 inclusive?"

The difference is 24.45 . 10/100 versus p\sum_{i=1}^{12} n_ix_i. Granted, there isn't much difference there really, but come on, there is a time and a place for everything, calculators included.

only in medicine by rook166 · 2010-03-17 15:20 · Score: 5, Interesting

In reading a couple of these types of articles recently I've noticed that the articles always talk about this being a problem across all journals, but only seem to mention a couple of different disciplines - medicine usually chief among them. Has anyone heard/read anything naming a hard science (e.g. chemistry or physics) as full of bad stats? My hunch is that this happens most often in medicine because you have the combination of controlling for a lot of variables as well as inadequate mathematics training.

Re:only in medicine by kevinadi · 2010-03-17 16:24 · Score: 1

The article also mentions economics.
I think it's so problematic in medicine due to the cost involved to perform proper trials. Stat is a shortcut way to make what you're doing look good in the shortest amount of time possible.
Re:only in medicine by Idiomatick · 2010-03-17 16:47 · Score: 1

The softer the science the shittier the stats. Likely for the reasons you listed "the combination of controlling for a lot of variables as well as inadequate mathematics training". Not to offend any social science majors but they often are unable to or unwilling to learn the math required. Not that I think they are stupid, but that the mathematical mind and the social science mind are specced differently. This gap results in terrible stats.
Re:only in medicine by physicsphairy · 2010-03-17 17:15 · Score: 1

Looking at my school's course catalog, I'm not sure where people in any of the hard science programs would be getting this kind of knowledge. Even the probability course--which students grumble about if they have to take instead of the "easier" statistics class--is not that in depth. There are too many areas to cover in a single semester to focus on ironing out all the "gotchas" rather than simply introducing the core material. And no statistics related courses are required beyond that one class.
In other courses professors are rather more intent on teaching their own material. They'll teach t-tests, etc., if they have to, but not at the depth which would address the problems described in the article. In my experience the physics department has been the only place where they don't play fast-and-loose with mathematics in general.
I think you probably have less problems in the published research of the hard sciences in that that is often interested in applying an actual theoretical model. Misapprehending the finer points of the statistics involved is not as relevant as it is when you are just datamining the genome for any correlation you can grab at.

--
When things get complex, multiply by the complex conjugate.
Re:only in medicine by daver00 · 2010-03-17 19:31 · Score: 3, Funny

Physics (yes, Physics, THE hardest of hard sciences) is full of terrible mathematics, absolutely terrible, shockingly bad stuff. The good ones know it, some will say it doesn't matter because their butchery comes up with "accurate" results. If they can't even get their analysis right, what can we expect of the softer sciences? That said physics is not so much concerned with statistics as it is probability, none the less, they have some serious problems, for example they often simply decide highly non-convergent things should converge because the experiment says it should...
The greatest tragedy in modern science (in my eyes) is the loss of physics as a hard science, currently these guys are way off with the fairies and producing nothing of worth, string theorists are the worst. We'll see what the CERN guys manage to come up with, but right now the mathematicians have taken the ball and run with it. It has been said that physics has become too hard for the Physicists...
I am not trolling, I am quite serious about Physicists playing dodgey games with mathematics.
Re:only in medicine by ShakaUVM · 2010-03-17 21:35 · Score: 1

>>Physics (yes, Physics, THE hardest of hard sciences) is full of terrible mathematics, absolutely terrible, shockingly bad stuff. The good ones know it, some will say it doesn't matter because their butchery comes up with "accurate" results.
Are you talking about things like normalization?
Or are people fudging lab results and such?
Re:only in medicine by Anonymous Coward · 2010-03-18 00:25 · Score: 2, Insightful

Give some examples. I mean, real, specific examples of mathematical practices or mathematical theories that are invalid and why they are such. Based on what you said, my suspicion is you are basing your claim on a smattering of slashdot comments and no understanding of any of the physics you are referring to. Several points give you away:
1) You speak of physics but your two vague examples are (I'm guessing because your description is almost unrecognizable) renormalization theory, and string theory. You, and many others besides, forget that the many of physics sub-disciplines are not directly unconcerned with the former, and almost no one outside of high energy physics is involved in the latter. In other words, your examples leave out the bulk of physics being done.
2) Renormalization theory involves demonstrating that apparent divergences will exactly cancel. You do not just discard them. There was a saying that was popular in the 50's when people were developing the mathematical foundations for it: "Just because it is infinite, does not mean it is zero!". It was an extremely important milestone when Freeman Dyson showed in the early 50's that all such divergences - obeying certain, explicit criteria - occurring in quantum electrodynamics were renormalizable. In case you weren't paying attention, Dyson was a mathematician. In the following decades a lot of work was done to explore the mathematical properties of renormalizable theories, contrary to your assertion.
Now many theories are not - in the strict mathematical sense - renormalizable. In these cases, cutting off divergences is physically meaningful(condensed matter physics, where matter is discrete at small length scales), or physicists actively and openly discuss and search for ways to formulate theories that possess no divergences or are strictly renormalizable. One may also ask, what if the correct theory is *not* renormalizable? In other words, what if our theory, while mathematically sound, is physically inaccurate (which is the opposite of the bizzare paradigm you suggest)? This is something actively discussed (and even widely assumed) in the search for new physics, but if true, the effects are too small to be currently detectable. In other words, we are back to discarding things because they are small, which is standard practice.
3) String theory - which again, is actually a very small part of physics - is actually almost entirely mathematical, which you concede. The mathematics is fine; the question is what, if anything, does it actually mean? Your criticism makes no sense here - are you suggesting by having math taking over the physics, the math becomes bad?
4) You put accurate in quotes, as if to suggest it was a dubious claim. This is disingenuous - in fields where a physicist is liable to claim this, it is demonstrably true; theories are able to predict many constants (such as the magnetic moment of the electron) to experimental precision. Many general, quantitative phenomena that are predicted as a result of the mathematics have been experimentally verified. (BCS superconductivity, Bose-Einstein condensates, Bohm-Aharanov effect, Quantum hall effect, etc).
5) More generally physics has often been less then mathematically rigorous as new theories are developed and refined. Calculus - the basis for Newtonian physics - was not put on firm mathematical footing until the 19th century. And even then the intuitive form of calculus that Newton and Leibniz were thinking of was not formally developed until the 1960's(nonstandard analysis). Part of the maturation of physical theories is the introduction of mathematically rigorous foundations.
Seriously, make some specific claims rather than casting blanket aspersions. What physical theories today lack rigorous mathematical underpinning that physicists ignore?
Re:only in medicine by Vornzog · 2010-03-18 05:24 · Score: 2, Interesting

I've had my name included on several 'hard science' papers that had horrible statistical assumptions. I fought, and lost, because my professor had a big grant to maintain, and nobody else understood the underlying assumptions (we used an absolute scaling function, guaranteeing that our distribution was not normal, then tried to assume that it was normal). The second half of my thesis refutes the math in the last three papers I was on. Not one single person who read it understood it, which is sad because it wasn't actually all that impressive.
The only reason I'm not completely ashamed to admit that is that the bad stats don't actually change the conclusions in this case. They do invalidate the confidence intervals, though...
The training in stats required for 'hard science' is essentially nil. Most of the hard science folks I know who are not into high-end mathematical modeling just assume a normal distribution for their data, do a bit of analysis, and publish. I was in an analytical chemistry lab, where that sort of thing normally works, and to a very high precision. However, we were working with sloppy biological assays, where being within a factor of two is a miracle. Under those conditions, you need to know a lot more statistics.
Basically, the people who know enough math are working on well defined systems and theories, and the medical and biological communities don't know much math at all, but are working on very sloppy systems that need a lot of math to analyze correctly. It is therefore easier to spot the mistakes in those communities, but don't assume they aren't there in the 'hard science' papers.

--
-V-

Who can decide a priori? Nobody.
-Sartre

Re:Long winded troll by TapeCutter · 2010-03-17 15:20 · Score: 2, Informative

It's a troll because it implies scientists don't know about those things.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

Re:No surprise here by shermo · 2010-03-17 15:24 · Score: 1

I had a friend who is doing a PhD in maths and he can't calculate basic arithmetic to save his life. It's a redundant skill for pretty much everyone.

--
Insanity: voting in the same two parties over and over again and expecting different results

Re:PhD Candidate in Biostatistics Here by dorpus · 2010-03-17 15:25 · Score: 1

You think I'm full of it? Wait till you hear professors at seminars, making up whatever theories they like. I've witnessed professors from household-name schools acting like this.

Bad outcomes due to statistics? by scdeimos · 2010-03-17 15:26 · Score: 1

From TFA:

“There is increasing concern,” declared epidemiologist John Ioannidis in a highly cited 2005 paper in PLoS Medicine, “that in modern research, false findings may be the majority or even the vast majority of published research claims.”

One has to wonder, though: how much of that is due to misuse of statistics and how much is because it's paid research expected to get certain results in favour of those paying for the research?

Re:Its common knowledge by Opportunist · 2010-03-17 15:31 · Score: 3, Insightful

And 77.335% of all statistics claim more accuracy than their expected deviation warrants.

--
We used to have a Bill of Rights. Now, with the rights gone, all we have left is the bill.

Re:No surprise here by Antique+Geekmeister · 2010-03-17 15:31 · Score: 1

Given that sales tax varies based on type of purchase in some states, and is weird numbers like 6.5% in others, it can vary quite a lot. And oh, my dear lord, try dealing with "valua-added-tax" in Europe....

Re:Not Scientists by dorpus · 2010-03-17 15:31 · Score: 1

On the other hand, plenty of very smart physicists, mathematicians, etc. have approached medicine spouting much the same rhetoric as you. They very quickly became embarassed when they tried to apply their fanciful theories to medicine. If you have a better idea on how to tell apart correlation from causation in a medical context, let them know.

A Well Known Fact by rlp · 2010-03-17 15:34 · Score: 1

87.24% of statistics are made up.

--
[Insert pithy quote here]

Re:A Well Known Fact by ld+a,b · 2010-03-17 16:16 · Score: 1

Good try, but actually it is 92.951% +-0.0003. You can find the source here.

--
10 little-endian boys went out to dine, a big-endian carp ate one, and then there were -246.
Re:A Well Known Fact by RightwingNutjob · 2010-03-17 16:54 · Score: 1

Good try, but actually it is 92.951% +-0.0003. You can find the source here.
In my sampling, the probability of over-reporting of precision by extraneous sig figs is 1.00000000.

Re:Long winded troll by TapeCutter · 2010-03-17 15:34 · Score: 1

No it can't, what?

"These are subtler issues than the true-but-trivial—and tiresome—cliché you refer to."

Actually the subtler issue here has nothing to do with statistics, they are implying peer-review does not work.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

best class ever by Barleymashers · 2010-03-17 15:35 · Score: 1

I had taken a stats class in undergrad... did not really pay attention as I thought it had no use. While getting my masters I was obligated to take an advance statistics class. Going in, for the life of me I thought it would be a waste - it was the best class I ever took. I was able to use it in my job almost every week if not more ( most of the other classes were theoretical at best and had no real world application ). Ten years later, I still rely on things I learned in that class. Statistics should be mandatory for all in college regardless of major because it can be used for so many things.

What it actually said by williamhb · 2010-03-17 15:38 · Score: 5, Informative

Contrary to the parent poster's claim, the article does not focus on correlation vs causation. It focuses on people getting the correlation wrong in the first place. It lists several common mistakes scientists make when writing up research studies. (Not all scientists are very good at stats). These include:

If you run enough studies you are almost certain to find a difference that appears statistically significant at the p<0.05 level through chance alone. (It is incredibly unlikely that you will win the lottery; but across the whole pool of tickets someone wins it most weeks.) That makes studies that bulk analyze large amounts of data against many different factors, actively hunting for something that is significantly different, erroneous.
"p < 0.05" does not mean there is a 95% chance of your result being "true"; it just means that someone else rolling dice has a 5% chance of achieving the same result through chance alone.
Tests are often combined in ways that are mathematically inconsistent
Finding a statistical effect does not mean it is a strong effect
You cannot simply compare effect sizes between two studies because the results of their control groups may differ ("effect size analysis" is usually wrong)
Failing to find a significant effect does not mean there is no effect ("we found there was no significant effect on..." is misleading because "no satistical significance" is "no information" [your study didn't tell anybody anything] not "no effect" -- to prove "no effect" you need a different statistical test)

And lots of others. It then suggests Bayesian reasoning as an alternative to traditional statistical tests.

Most post-PhD scientists are aware of the common mistakes, but being aware that we make mistakes doesn't necessarily stop us from making them. If you chose a random set of conference proceedings, it is almost certain you will find at least one paper (and I suspect usually a dozen or more) that have statistical mistakes in them.

Re:What it actually said by RightwingNutjob · 2010-03-17 16:39 · Score: 4, Insightful

People who deal with raw physical measurements (radar engineers, astronomers, the guy who makes airspeed sensor of the B2--er,um...) have had this problem figured out for a while.

The result, repeatedly proven mathematically and by experience, is that the magic number is always Signal-to-Noise-Ratio. You can't get good information from crappy, scant, data.

Humanities and social-"science" types, and unfortunately the med school set, are by and large composed of people with varying degrees of pathological fear of mathematics, computation, and computer programming. I'd be willing to bet that a largish portion of even the post-PhD scientists who 'know' how to make a proper calculation for a statistical test don't really understand the physical meaning of the numbers they're copying and pasting in and out of excel.

When your attention and skill set are focused on looking through a microscope, or cutting up lab rats, or synthesizing chemicals, you probably never have the experience of being up to your eyeballs in noise estimates and P_FA's that bludgeon in the fact that your data really sucks because it's too noisy, and never need to answer fundamental questions like 'what's the probability that the ruskies will fire off a missile and this radar won't see it'/[insert biologically relevant example here], which *requires* learning the right way to do statistics.
Re:What it actually said by crmarvin42 · 2010-03-17 16:40 · Score: 1

If you chose a random set of conference proceedings, it is almost certain you will find at least one paper (and I suspect usually a dozen or more) that have statistical mistakes in them.
As someone who is sitting in a hotel room after the Midwest ASAS meeting in Des Moines, IA, I can personally attest to seeing improper statistics in the majority of the presentations I saw between 9 and 11am. There were at least 7 presentations in which they "Double Dipped" by running orthogonal contrasts (linear & quadratic) and mulitple comparison of simple effects (Tukey's HSD) on the same dataset with alpha = 0.05 for each type of test.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde
Re:What it actually said by tabdelgawad · 2010-03-17 17:30 · Score: 2, Insightful

Good summary, but I call bullshit on the article. Most of the problems you mention and the others in the article are common popular misinterpretations of statistical results, but that doesn't mean they're common mistakes made by researchers in the studies themselves. Any rookie peer-reviewer would spot them immediately if they ever make it into a manuscript.
This doesn't mean that there aren't a lot of bad statistics-based studies out there, especially in medicine. But the problems are usually much more subtle than the article implies. Standard statistical methods require many regularity and sampling assumptions to be valid, and a lot of times researchers take these assumptions for granted when even a little probing would show that they're violated. A lot of advances in recent econometrics have been in the development of robust methods (valid when standard assumptions are violated), and those advances unfortunately take a long time to filter down to the 'applied researcher' level. If you're an applied researcher, it's generally unlikely you'll use statistical advances you didn't learn as a grad student.
And frankly, I have no idea what the Frequentist/Bayesian debate has to do with any of this. To suggest that using Bayesian methods is some sort of solution for the problems listed in the article is ridiculous.

--
Imposing Libertarian views on everyone online since 1992.
Re:What it actually said by TapeCutter · 2010-03-17 18:39 · Score: 2, Interesting

"Contrary to the parent poster's claim, the article does not focus on correlation vs causation. It focuses on people getting the correlation wrong in the first place."

Fair point, I only skimmed the TFA but I still stand by my assertion that it's a troll of the "scientists don't understand statistics" genre, it even starts by claiming statistics is a "mutant form of math". Had they ommitted that drivel and not refrenced discredited papers then maybe I would have read the whole thing.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
Re:What it actually said by Ardeaem · 2010-03-17 21:55 · Score: 1

"p < 0.05" does not mean there is a 95% chance of your result being "true"; it just means that someone else rolling dice has a 5% chance of achieving the same result through chance alone.
Even this is not quite correct - it is wrong in a critical way. A p value includes the probability of obtaining not just the data you obtained, but all data which is "more extreme" than the data you obtained. For continuous distributions like the Normal distribution, the probability of achieving the "same result through chance alone" is literally 0, because the area under the normal curve at a point is 0. Misunderstanding this fact causes all sorts of problems.
Re:What it actually said by TheTurtlesMoves · 2010-03-17 22:51 · Score: 1

Most of the problems you mention and the others in the article are common popular misinterpretations of statistical results, but that doesn't mean they're common mistakes made by researchers in the studies themselves.
I work with the med school and vet school and a group of biologists. I assure you that claiming that these mistakes are common is an *understatement* of galactic proportions.

--
The Grey Goo disaster happened 3 billion years ago. This rock is covered in self replicating machines!
Re:What it actually said by thepotoo · 2010-03-18 00:01 · Score: 1

Riiight. I'll just leave this here (PDF).
(P.S. Things haven't gotten better since then.)

--
Obligatory Soundbite Catchphrase
Re:What it actually said by Anonymous Coward · 2010-03-18 00:27 · Score: 1, Insightful

I'm not a PhD and what you said is blatantly obvious to me from my 2 years in biology. That said, I've seen PhD's with 30 years experience engaged in magical thinking when it comes to numbers. It really does take an engineer to fully appreciate "garbage in, garbage out."
To give you insight to some of the things one does in biology, take a machine that you have only a very very basic understanding of. (Let's say, you only know it spits out a piece of paper that says "0.452" when you put something red in it). Now create an application and make it do something useful without killing anybody. The end result is, you'll get a thing which you still know almost nothing about, but can be statistically shown to be safe and give you the desired result under stringient conditions. That is until it doesn't for reasons which you may never completely understand. A radar is easy by comparison, because it isn't a black box. You know all the bits, you know how the bits are connected, and you should be able to replicate conditions easily. I can't exactly open up a cell and see everything that is going on.
Re:What it actually said by ColdWetDog · 2010-03-18 03:55 · Score: 1

Hey, I'm in medicine you insensitive clod....

But actually you're spot on. I was talking to some old friends from grad school the other day - solving the problems of the universe when we decided that much of what's Wrong In the World could be improved upon (but probably not actually solved) by forcing everyone attempting to get an advanced degree in anything to pass a moderately advanced statistics course.

That would likely leave me herding sheep as a career but it just might make a dent in the problem.

--
Faster! Faster! Faster would be better!

Re:Maths anxiety by reverseengineer · 2010-03-17 15:41 · Score: 3, Funny

Unfortunately, it is hard to break a viscous cycle. The high viscosity makes it easy to get stuck.

--
"FDA staff reviewers expressed concern about the number of patients who were left out of the study because they died."

Great article on similar topic by Peter Norvig by snowwrestler · 2010-03-17 15:45 · Score: 1

Warning Signs in Experimental Design and Interpretation

http://norvig.com/experiment-design.html

He does an excellent job of describing and illustrating common research mistakes, statistical and otherwise.

--
Build a man a fire, he's warm for one night. Set him on fire, and he's warm for the rest of his life.

Re:Long winded troll by TapeCutter · 2010-03-17 15:46 · Score: 1

It's in the bussiness of providing the best explaination for the available evidence. Proof is confined to axiomatic systems such as maths and generally you can't prove the axioms of axiomatic systems. Science is not an axiomatic system. See epistomology for further details.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

Missing: (huge) systematic error by harvey+the+nerd · 2010-03-17 15:50 · Score: 1

What is missing in this discussion is systematic error, which is often very large and often dwarfs the analyzed random error or even the result itself. Systematic error is frequently a basic problem in biological research and in emerging technologies with crude tools and poorly understood cofactors. The human factor can hugely inflate systematic errors where legal, marketing or politics are involved. The systematic error may not be uncovered for years or decades, if ever.

One can design "tests" that are beautifully reproducible and precise, but absolutely, and deliberately, absurdly wrong. And get away with it, nay, be be rewarded handsomely as a salable skill. It happens behind the scenes. I have direct experience in science and engineering where politics have butted in, but I see this as more common in medicine, pharmaceuticals and the medical journals. Multiple, blatant design and interpretation errors in any single article that are extremely hard to assign to mere stupidity and/or ignorance, that involve authors with clear conflicts of interest to victimize cheap (defenseless) generic drugs and supplements, and to promote their product.

How blatant does it get? I have industrial experience where a big name, big $ university consultant was given free reign to do a "political assignment". On a literature comparison of two materials' figure of merit, even after fair warning, he missed reality by 9+ ORDERS of magnitude, over a billion fold by avoiding data in equal test environments. The results of internally published, correct tests were later deliberately ignored. This did eventually lead to his backers' catastrophic failure and his dismissal. Millions wasted. This is one of dozens of such situations I've seen in intercorporate wars with NoAm and European companies (no names here). The pharmaceutical and medicine situation appears blatantly worse in terms of number of fundamental test errors in a given high profile paper, resultant damage and duration. But big profits are made!

Re:Long winded troll by Lars+T. · 2010-03-17 15:51 · Score: 2, Funny

Yes, it's rarely mentioned that causation implies correlation.

Interestingly, I have observed a correlation between people who cite that "correlation != causation" and those who ignore "causation implies correlation" in their arguments.

--

Lars T.

To the guy who modded me down from perfect to terrible Karma - Apple haters still suck

The problem is with statistics itself by Z8 · 2010-03-17 15:57 · Score: 3, Informative

I see a lot of posts bashing people for being idiots, and I'm sure that's often the case, but IMHO there are some big problems with statistics itself.

The most common school is the "classical" school, which is extremely counterintuitive. For instance, most people think that if a 95% confidence interval is 5 to 10, then the parameter has a 95% chance of being between 5 to 10. This would be true with Bayesian statistics, but exactly backwards for classical statistics. For classical statistics, it's that your 5 to 10 interval has a 95% chance of being around the parameter! This is a subtle difference that most statisticians don't even understand, and it screws up almost everyone. Furthermore the classical statement is much less useful than the intuitive statement that people think it is.
Relatedly, other schools which make more sense such as Bayesianism and likelihoodism aren't taught. Furthemore, nonparametric statistics are usually not taught to undergrads (unless they are statistics majors probably). In the real world, non-parametric statistics are often more useful because no parametric model is actually true (for instance, basic regression assumes that the Truth is in your model, and it almost never is).
Finally, a lot of statistics as it is normally taught depends on the central limit theorem. Any result that depends on the central limit theorem (or the law of large numbers) is often useless in real applications due to data poverty. The basic reason is that the average of i.i.d. random variables only converges to a normal distribution as 1/sqrt(n). Everyone knows this, and it's obvious that something that converges to 1/sqrt(n) is much much slower than the typical 1/n convergence, but people still rely on the central limit theorem.

Statistics is changing slowly (mostly because computers and R make non-classical statistics more practical) but the way it's taught still leads to problems.

Re:Fair and Balanced: Fox quotes the Bible as sayi by X0563511 · 2010-03-17 15:58 · Score: 1

... and those that may or may not be good at math. :P

--
For large sets, this will be our guide even unto death, for the LORD will work for each type of data it is applied to...

Re:Not Scientists by glwtta · 2010-03-17 16:00 · Score: 1

Wait, I'm sorry, biologists make the lesser contribution to medicine when compared to physicists and engineers? You do realize that all of medicine is biology, right?

People in the real Sciences would have been forced to take enough Mathematics and/or Statistics to be able to properly interpret Statistics.

You would think so, if you've never worked with Real Scientists. Most biologists and chemists (can't speak to the other ones) know just enough statistics to get by, and make exactly the kinds of mistakes TFA is describing - there's only so much you can "force" people to learn.

Then there's the whole not being able to tell the difference between causation and correlation. I could go on.

You seriously think this is a common problem in biomedical research? I mean the actual research, not the media spin on it.

--
sic transit gloria mundi

Good podcast on the topic by syousef · 2010-03-17 16:02 · Score: 1

http://www.americanscientist.org/science/pub/everything-is-dangerous-a-controversy

--
These posts express my own personal views, not those of my employer

Re:Long winded troll by williamhb · 2010-03-17 16:03 · Score: 3, Insightful

Actually the subtler issue here has nothing to do with statistics, they are implying peer-review does not work.

"Peer review" is another of the things that has been over-sold to the public. A science research group spends six months and a hundred thousand dollars conducting a research study using highly specialised equipement. They submit a paper to an academic conference or a small journal. It gets put out to review by three people who each spend about four hours reading it and reviewing it, and who usually do not have access to the equipment or the original data that was used in the study. Do you really think we're likely to catch every mistake at review? We certainly can't check the stats (except for the most egregious errors) because we don't have the full data tables they analyzed.

Scientists actually accept that inevitably some incorrect results will be published. More often in the smaller conferences than in the most prestigious journals, but even the journals have to publish a retraction every now and then. We also accept that most studies are never repeated, and so the "objective repeatable experiment" is rarely really tested for being either objective or repeatable. However, science has long had the "many eyes" effect at work. There are hundreds of thousands of scientists reading papers and using them in our own experiments. If some theorised effect out there is wrong, usually we'll find out eventually.

Re:Maths anxiety by caerwyn · 2010-03-17 16:08 · Score: 1

A cow is actually a rather topologically interesting beast.

Nah, it's just a funny torus.

--
The ringing of the division bell has begun... -PF

Re:PhD Candidate in Biostatistics Here by DamnStupidElf · 2010-03-17 16:10 · Score: 1

So call them out. If you don't, you're just a part of the problem you describe.

Re:Long winded troll by glwtta · 2010-03-17 16:11 · Score: 1

So what is it in the business of?

Disproof.

--
sic transit gloria mundi

Real Data by rozthepimp · 2010-03-17 16:12 · Score: 1

Research shows that 78% of all people who use the term "research shows" are just making s**t up.

Re:No surprise here by skine · 2010-03-17 16:17 · Score: 1

Basically, the point of higher-level math is what can be done without a calculator. A basic example is binomial probability. Say someone rolls six dice. A computer could be used easily to find the probability that exactly three dice are 5's by counting how many outcomes there are (46656), and how many outcomes have exactly three 5's (2500). What math does is to create an equation (n C r) p^r q^(n-r) and prove it works for any values of n, r, p and q. In the example, n = 6, r = 3, p = 1/6, q = 5/6, and P = 625/11664. (Thank you, Maple)

Sure, when one has spent enough time working with numbers, they'll have an intuition as to whether an answer makes sense. But not too many can add a 7.25% sales tax to a $64.38 purchase in their head.

Re:Long winded troll by TapeCutter · 2010-03-17 16:24 · Score: 1

You are conflating repeatability with peer-review. Peer-review is the formalised, first cut of the "many eyes effect" and will usually pick up obvious flaws in statistical methodology. To be sure peer-review is not perfect but I do not think it's being "oversold to the public", in fact I think it's been quite the opposite recently, especially in the field of climate science.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

Re:Long winded troll by Homburg · 2010-03-17 16:26 · Score: 1

It's in the bussiness of providing the best explaination for the available evidence.

That sounds right, but isn't part of being the best explanation, that such an explanation is true? Establishing the truth of an explanation would fit into one of the commonly used meanings of "proof."

I don't think your restriction of proof to axiomatic systems coincides with the way that that word is usually used. Axiomatic systems allow for a "proof" in the mathematical sense, but other kinds of demonstrations of truth are also possible, and could reasonably be called "proof."

Re:Long winded troll by Homburg · 2010-03-17 16:32 · Score: 1

W. V. O. Quine would like a word.

Re:Long winded troll by crmarvin42 · 2010-03-17 16:36 · Score: 4, Informative

Peer review is not about catching mistakes, although it can on occation. Peer review is about clear communication, such that the experiment can be repeated as identically as possible and that the readers can understand the authors justification for their conclusions. At least that's what every journal article I've read on the topic indicateded was the reason for the peer review processes creation. One of my advisors asked me about it on my written preliminary exam and I needed to do a lot of reading to be prepared for the oral exam. There were several different societies that claimed to have originated the idea, but no one claimed that the purpose was to catch mistakes, fabrications, or data manipulations.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

Lawyers, politicians, psychologists, & salespe by deodiaus2 · 2010-03-17 16:51 · Score: 1

Yes, as critical intelectuals, we are able to look at ourselves in this critical manner. However, really successful people, e.g. lawyers, politicians, psychologists, & salespeople never have this drawback. I remember Ronald Reagan talking about various issues and being absolutely wrong. However, he said it with such conviction and determination that I had to go back and check the facts. But apparently, he never did.
Another time, I remember reading about using DNA to cross check previous serious crime conviction. Judges and politician refuse to open closed cases, because doing so undermines the fact that maybe the justice system might be quite faulty. Rather than worrying about incarcerating innociant people, the legal profession was more worried about protecting their own future revenue stream.
Now, salespeople, no matter how professional and honest they might seem, are taught to never let a sucker get an even break. Doctors too, are often taught that you should never allow the impression that you might be wrong to be formed in people's minds.
Last year, NOVA had a episode about the practice of performing lobotomies on mentally ill people. One part of the story focused on treating one of the Kennendy girls during the 1960s. The girl had definite problems. However, the real tragedy of the story was how Harvard and Johns Hopkins cream of the cream doctors turned a girl with an IQ of a little girl into that of a vegatable. Although there were no scientific cases of a lobotomy of curing anyone with her problems, the doctors went ahead and preformed the procedures anyway. Well, the biggest irony of this was if these were the best doctors that money can buy, I shutter to think what would happened to people in mental institutes for the indigent and politically unconnected run by doctors graduating from state universities and military institutes.

Re:No surprise here by cortesoft · 2010-03-17 16:57 · Score: 1

Yeah... everyone knows sales tax is multivariable calculus!

Re:Long winded troll by Capsaicin · 2010-03-17 17:11 · Score: 2, Funny

Interestingly, I have observed a correlation between people who cite that "correlation != causation" and those who ignore "causation implies correlation" in their arguments.

Ah yes, but can you suggest any causal relationship between those two observations?

--
Better to be despised for too anxious apprehensions, than ruined by too confident a security. --Edmund Burke

MDs versus PhDs by Boawk · 2010-03-17 17:17 · Score: 1

I have come to the same conclusion, "the best and the brightest do not go into medical school".

Interestingly, at the medical school I attended, during the graduation ceremony the PhDs are called up prior to the MDs. This of course implies more deference to the PhD degree. I wonder if this is standard everywhere...

Re:No surprise here by crmarvin42 · 2010-03-17 17:17 · Score: 1

The ability to math in your head (rapidly), and the ability to perform statstical analysis that is appropriate and interpret it in a way consistent with the limiations of the model/design used are not even remotely related. I cannot do math in my head quickly. I tend to drop a digit somewhere and come out with the wrong answer. However, I have extensive training in statistical design and interpretation, and can spot errors in both much easier than my peers. The skills involved, while both mathematical, are not necessarily connected.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

Re:No surprise here by crmarvin42 · 2010-03-17 17:25 · Score: 1

this is why people now consider master's degrees to be the equivalent of a high school diploma.

Who are these "people"? I am a research scientist and I don't know anyone who thinks of a MS as an equivalent to a HS diploma. Hell, as biased as I am (and I know I am), I don't even consider a humanities MA as an equivalent to a BS.

As I stated in response to FrozenGeek (the OP), the ability to add fractions and the skills acquired in the pursuit of a research MS degree are not connected. My step father is a carpenter and he can add fractions without even thinking about it. He deals with them every day. I OTOH, need to write out any remotely complicated math that I do if I want it done quickly and acurately. He would be the first to tell you that he doesn't know shit about higher math, while I am considered by my peers to be above par in my statistical knowledge and understanding (as indicated by their frequent visits to my desk for help with their statistical work).

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

I went to read tf article... by timsch · 2010-03-17 17:25 · Score: 1

... and I noticed it is dated March 27. OK I guess it is when the magazine comes out but still it was a little ironic in this instance.

There are lies, damn lies... by cbope · 2010-03-17 17:33 · Score: 1

and statistics.

Seriously, the problem with statistics is that they can be manipulated to mean whatever the presenter wants. Taken out of context, which is how a lot of statistics are presented, enhances the problem. I wouldn't trust any statistic unless I can examine the data behind it.

Statistics are not inherently bad, but I think they are over-used in many areas and often present a purposefully distorted view of something. Statistics do not address causality.

Re:There are lies, damn lies... by u38cg · 2010-03-17 22:43 · Score: 2, Insightful

I'm sick of this bullshit. There is statistics, and there is lies. Statistical operations are mathematical procedures, which may or may not be appropriate. They are not, however, lies. They may be errors, deliberate or accidental. Lies, on the other hand, are what you introduce when the data does not fit the hypothesis you want to put forward. Blame the liar, not his smoke and mirrors.

--
[FUCK BETA]

financial advisors by deodiaus2 · 2010-03-17 17:35 · Score: 1

Better yet, last night on "Mad Money with Jim Cramer", someone pointed out that Goldman-Sacks research recommended selling all assets of HOG, yet looking at insider activity of their holdings, GS increased its holdings of HOG from .5M to 4M. Cramer attributed this to different divisions of GS advocating different positions. However, I think most viewers thought that GS is trying to get everyone to sell its shares of HOG so that GS can get them on the cheap.

Re:Not Scientists by crmarvin42 · 2010-03-17 17:44 · Score: 1

I have to say I agree with the OP. I got my Ph.D. in Animal Sciences from a University with a Vet School. Several of the courses I took were taught over in the Vet School and later I taught in a couple of classes attended by Vet Students. MS and Ph.D. students (ie scientists) are expected (and in turn expect) to learn concepts, apply critical thinking skills, and reason out problems from day one. Vet Students (the AnSci equivalent to Med Students) are expected to memorize facts for 2 to 2.5 years. There is no expectation of critical thinking, no reasoning out complex problems, no application of previously learned concepts to novel situations until year 3 in Vet School.

They are very highly trained, but the training they recieve is very different from the training I recieved. I very quickly started thinking of MD's and DVM's as biological mechanics by default. Nothing I've seen in my interactions with them has led me to believe that I am mistaken to think of the individuals that way until they proove otherwise on a case by case basis.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

Re:No surprise here by coolsnowmen · 2010-03-17 17:45 · Score: 1

Jesus, fucking, christ...I'm not even going to speed tomorrow after reading that.

I've Seen It All by DynaSoar · 2010-03-17 18:01 · Score: 1

I was specializing in methodology during my doctorate work and so had to not only have a good grasp on stats as performed but also able to at least estimate how well the analyses I was developing worked. We had a top notch stats professor who'd started in psychology and so ended up not only teaching all graduate courses for our department but also served as top level consultant for any and all of our projects. Since some of my work was in nonlinear phenomena and therefore stats, I spent many an hour trying to absorb everything he could offer.

When I'd gotten on top of the material, some of what I saw going on made me disturbed, angry and/or disgusted.

In EEG research it was common to go through an analysis system wherein one first does a test on all electrodes together to determine if there's a difference between conditions. Fine. But then to localize, one first divided the electrodes in half and tested left vs. right. Then one tested both left and right according to front and rear. And so on, until individual electrodes are compared. I as told this reduced false positives and retained power. I was told to do it in my dissertation. I was told who started this process. I wasn't told it was bullshit; I figured that out on my own. I looked up the reference. There was no mention of this process in the article. As is common when I tracked down such rituals, the article said to do what you could justify doing but to know what you could and could not justify due to your own ability. I also found an article that said such processes did not retain power nor reduce errors.

The stats prof pointed out that each collection of electrodes in each test was arbitrary. There was no reason that every possible combination should not be included in their ritual. A "real" result from the process should require that. I pointed out that our software localized electrical sources in the brain down to 1mm voxels (to work with fMRI data) making surface electrode analysis extraneous. I took these points and the articles back to the department and was told finally to "do what I had to" for my diss. I ended up using a nonlinear running t-test to analyze time series of signals in 2 msec windows and produced a 'movie' of dopamine effects on the frontal lobe across the first 20 msec post stimulus. Nobody on my committee could understand the analysis, but they all loved the movie. I didn't tell them I'd adapted the analysis technique used in fMRI because some of them had done fMRI research and thought they knew what they were doing. Had I had to explain my workings I'd have had to tell them they didn't understand what they were doing, and at that point I wanted to get done and get to my first job offer. NIH. Invited and non-competitive. They understood my work. Besides, by this time I'd already studied at Santa Fe Institute and had learned the difference between learning from people who knew more than I likely ever would and jumping through hoops for people who I'd already passed in ability.

I also saw colleagues doing fMRI work who had no clue they were pushing statistical testing so hard that due to the necessary correction factor they were trying to find individual data points with p values with up to 22 zeroes between dot and data, a certainty they could never realistically achieve, and a cut off level they'd never even consider trying to look at in any study where they knew at least some of what was going on. I've seen entire poster sessions at conferences on brain mapping where maybe 2 out of 200 could accurately and factually explain how their analysis worked (typically they worked with a biophysicist who could, but none of which understood the phenomenon under test well enough to describe it, meaning together they could produce results but not knowledge as they couldn't pass the latter back and forth between them).

And I've seen researchers who did understand fMRI and SPS (statistical probability mapping, the analysis technique used for fMRI). And they refused to use the technique for the reasons given. My boss at

--
"I may be synthetic, but I'm not stupid." -- Bishop 341-B

Re:I've Seen It All by u38cg · 2010-03-17 22:48 · Score: 1

I'm sure you've seen this, yes?

--
[FUCK BETA]

Re:Not Scientists by NoMaster · 2010-03-17 18:03 · Score: 1

People in the real Sciences would have been forced to take enough Mathematics and/or Statistics to be able to properly interpret Statistics.
You would think so, if you've never worked with Real Scientists. Most biologists and chemists (can't speak to the other ones) know just enough statistics to get by, and make exactly the kinds of mistakes TFA is describing - there's only so much you can "force" people to learn.

My favourite example of puncturing the "Real Scientists (tm)" who think they're above making these sorts of mistakes?

So You Think You Have a Power Law - Well Isn't That Special?

--
What part of "a well regulated militia" do you not understand?

Re:Long winded troll by TapeCutter · 2010-03-17 18:25 · Score: 1, Informative

I'm not talking about original intent, I'm talking about contempory practice, the first peer-review policy I looked at to check your assertion was the journal Nature. It doesn't say anything about clarity or repeatability, it appears to back up what I said, quoth the policy...

"Nature journals receive many more submissions than they can publish. Therefore, we ask peer-reviewers to keep in mind that every paper that is accepted means that another good paper must be rejected. To be published in a Nature journal, a paper should meet four general criteria:
* Provides strong evidence for its conclusions.
* Novel (we do not consider meeting report abstracts and preprints on community servers to compromise novelty).
* Of extreme importance to scientists in the specific field.
* Ideally, interesting to researchers in other related disciplines."

....[snip]...

"The editors then make a decision based on the reviewers' advice, from among several possibilities:
* Accept, with or without editorial revisions
* Invite the authors to revise their manuscript to address specific concerns before a final decision is reached
* Reject, but indicate to the authors that further work might justify a resubmission
* Reject outright, typically on grounds of specialist interest, lack of novelty, insufficient conceptual advance or major technical and/or interpretational problems"

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

Looking for a good book on statistics by steveha · 2010-03-17 18:37 · Score: 3, Interesting

I'm interested in learning the essentials of statistics. What would be a good book to start me out?

I got The Manga Guide to Statistics and it did introduce me to the very basics. However, there are many places where it just gives you an equation, without deriving it or even explaining it. After reading this book, I now know how to calculate standard deviation, but I'm still a bit vague on how people actually use it. I would like to see some examples of how people use statistics in (for example) science experiments.

My ideal book would explain the basics, with examples, and show how the math works. Ideally it wouldn't be a thousand pages long, either, but that's a secondary consideration.

Recommendations, please?

P.S. Those of you who know about statistics: how good are the Wikipedia pages on statistics?

steveha

--
lf(1): it's like ls(1) but sorts filenames by extension, tersely

Re:Looking for a good book on statistics by Daniel+Dvorkin · 2010-03-17 19:53 · Score: 2, Informative

Devore's Probability and Statistics for Engineering and the Sciences is probably the best one-volume, undergrad-level intro to statistics out there. Get a copy (I think it's on the sixth or seventh edition now; you can pick up a fifth edition for cheap) and work your way through that, and you'll have a pretty good idea of where all those formulae come from and how they're used. Get a copy of R and check out the "Devore*" packages in the package list too. If you want to learn more after that, I recommend Kutner et al.'s Applied Linear Statistical Models for applications, and Casella and Berger's Statistical Inference for theory.
The Wikipedia stats pages are pretty good for most things, but many of them are written with the assumption of a lot of background knowledge. If you open up a page on a particular stats subject and you comprehend it, great; if not, be prepared to do a lot of digging outside of Wikipedia, because trying to figure out the subject from the links to other WP pages is an exercise in circularity.

--
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Re:Looking for a good book on statistics by Chemisor · 2010-03-18 01:47 · Score: 1

Probability: The Logic Of Science by Jaynes. Although it is in part a rigorous text, you can skip the derivations and just read the examples. Most of the book is about how to think about probability, emphasizing the methods of correctly formulating the problem and explaining why most people fail at that (admittedly quite complex) task. Even if you don't understand a single equation in the book, you'll still benefit from reading it.
Re:Looking for a good book on statistics by obliv!on · 2010-03-18 04:12 · Score: 1

Like Daniel Dvorkin has said Devore's book Probability and Statistics for Engineering and the Sciences is an excellent starting point.

Definitely learn to use R since its free you don't have to worry about paying licensing fees. It is also widely used (no matter what you here from SAS, Minitab, SPSS, etc).

Books I would recommend that I think fit his other suggestions are Bowerman/O'Connell Linear Statistical Models: An Applied Approach and Wackerly et al Mathematical Statistics with Applications

Devore talks about Bayes Rule as does Wackerly and Wackerly's last chapter talks about some Bayesian techniques, but these are merely primers for what is typical in a Bayesian course. So I recommend these two books as analogous with Devore's: Bolstad Introduction to Bayesian Statistics and to Wackerly's: Hoff A First Course in Bayesian Statistical Methods

Some things you need from mathematics are the ability to integrate, work with matrices and matrix operations, and algebraic manipulation. Familiarity with transformations and operators especially linear ones is useful since many procedures in statistics are linear operators. The highest levels of statistics will get even more math intense using mathematical results from areas like ODE/PDE, Galios Theory, or general Measure Theory.

The wikipedia's statistics articles are pretty good overall, but as Dvorkin noted some are more technical than what would be friendly to those that are new to statistics. When you feel that's the case try using the sources linked as citations in the article or google confusing parts and it is generally possible to find an explanation for almost any background level.

However if you can get through these texts you're background would be pretty strong.

Re:Not Scientists by Secret+Rabbit · 2010-03-17 18:50 · Score: 1

Not rhetoric, fact.

But, telling correlation from causation is an easy one. Pay attention to the details and learn a lot more Maths. That'll give them some actual critical thinking ability instead of following a checklist.

It really is a rarity to see a study that has been designed properly. The only one that I know of is one studying Recurrent Brief Depression. The study used an entire practice of patients and filtered out people with other disorders to get "pure" RBD people. Then they were paired with healthy controls that were the same age and gender. That's a good experiment. Of course, it's still limited as socio-economic differences between the controls and there RBD counter-parts would change things. Then there's the statistics problem because it really isn't a random sample because all the people involved would live in the same area of the city. I could go on.

But, that's the best one I've found. And I've looked at *lots* of studies.

Also, when it comes to applying "their fanciful theories to medicine", what do you call MRIs? How about X-Ray machines? How about medicines? These things are made, by and large, Physicists, Engineers and Chemists respectively. The Biologists come into play more on the "practicing Medicine" side of things because they are actually working directly with biological systems. So, if the Medicine people want to do any good, then they'll use that to start.

However, this is exactly what the Medicine people aren't doing. They are so concerned with the short game of finding out X for Y, that they ignore the long term benefits of having an overall theory of the human body. They'd be a long way beyond where they are if they would spend even a little time on that.

But, then again, they aren't Scientists. So, they don't think that way. They are practitioners and as such, only see what is directly in front of them. That might have benefits, but it also has some serious drawbacks. Some of which I have listed.

Don't shoot the messenger.

Significance is NOT probabilty by drewhk · 2010-03-17 19:01 · Score: 2, Insightful

.. or at least not the probability of the hypothesis. This is one of the errors that people make. Having 0.95 significance do NOT imply having 95% chance for the hypothesis being true! The significance is the probability of the test outcome assuming the hypothesis is true (in other words it is a likelihood value). You have to multiply it by a prior to obtain real probabilities.

Significance values will not even add up to 1 over the two hypothesises!

The root of the problem is that frequentists can not use probabilities for statements -- only for events. In frequentist terms you have to have a sigma algebra over some Omega state space which is measurable. Bayesians on the other hand can talk about the probabilities of any statements using probability theory as an extension of formal logic. I really recommend reading the books of E. T Jeynes and David McKay.

Other false assumptions people make with statistics:
- Everything is normally distributed
- Everything has a variance
- Everything has an expected value
- Hypothesis testing is without bias (in fact it is equivalent to give 50% prior probability to both hypothesises)
- Variance means average distance from mean
- Empirical variance does not have a variance

Re:Significance is NOT probabilty by whoisisis · 2010-03-17 21:02 · Score: 1

> Other false assumptions people make with statistics:
> - Everything is normally distributed

This is almost always a true assumption, at least with a big enough data set:
http://en.wikipedia.org/wiki/Central_limit_theorem
Re:Significance is NOT probabilty by drewhk · 2010-03-18 07:28 · Score: 1

The central limit theorem holds only when some strict conditions are met -- like finite variance of the contributing probability variables.
There are other stable distributions than the normal. Check this:
http://en.wikipedia.org/wiki/Stability_(probability)
"Important special cases of stable distributions are the normal distribution, the Cauchy distribution and the Levy distribution"
The Cauchy has neither variance neither expected value. Levy distributions also do not have higher moments.

Re:Not Scientists by Secret+Rabbit · 2010-03-17 19:13 · Score: 1

*posted as is without editing, worts and all*

There is a difference between Medicine and Biology; they are NOT the same thing. Medicine is the biology of the human body. Period. End of story. Biology concerns itself will ALL life. In short, Medicine is the APPLICATION of Biology to humans. Different. But, if I'm wrong, go ahead and explain to me how those two domains are the same thing and the same size.

When it comes to Biology's contribution to Medicine, why don't you actually look up what the Engineers and Physicists have done compared to the Biologists before commenting. Biology has really only come into play recently.

Furthermore, the more you get away from Maths, obviously, the less will be known. However, if you've look at the modern Chemistry curriculum, and consider what needs to be known to understand the typical *required* Quantum Chemistry course... that's a fair bit of Maths. Btw, there's a reason why I mentioned Biology's relatively limited contribution to Science. It's because they've really only come into there own, as a Science, recently. Another couple decades or so, and they might be where Chemistry was a couple decades ago. Most of Chemistry today is actually quite good.

When it comes to the causation/correlation problem, yes it is a BIG problem. Just look through PubMed if you don't believe me. It is *very* common to have papers on there that calculate CIs with 20-30 patients (or less) like it means something. Sorry, but if they think that, they're clueless. It takes a statistically significant number of patients studied to make a CI meaningful. That's why I only really pay attention to survey studies (and view others with extreme scrutiny). They are the ones that have the highest possibility of being worth reading.

Finally, I have worked with Scientists. Physicists in particular. I also have payed attention to what the other disciplines have put out. Chemistry is meh, Biology is lesser (to one degree or another depending on the specific field within it) and Medicine is a joke. It might be politically incorrect to say such things. But, it is the honest truth. There's not really any shame in it as the more applied one goes, the more complicated things get. But, to ignore ones place is inviting disaster. That's really the point. To get them to know there place. Enough people have died due to there god complexes, overconfidence and not really understanding things (and not knowing it). They really need to acknowledge the limitations of what they do and who they are.

When it comes to the MDs that I get along with and respect. It's those that explicitly state what they are comfortable doing and what they aren't. It's those that are willing to work /with/ me not the ones who think its OK to tell me what to do when it's something that I care to be involved in. Etc. Guess which type is more rare and the average age of the ones that are more humble.

Re:Long winded troll by Marble1972 · 2010-03-17 19:13 · Score: 1

science is not in the bussiness of proof

So what is it in the business of?

Excluding mathematics, science is generally in the business of disproof.

The use and abuse of statistics. by AliasMarlowe · 2010-03-17 19:18 · Score: 3, Interesting

I'm actually at a scientific meeting and saw 7 presentations in which they "double dipped" on their statisitics before we broke for lunch.

Double-dipping is bad enough, but the medical field is rife with multiple-dipping. Each dataset is plumbed to test dozens of hypotheses, without appropriately adjusting the acceptance criteria. Even with separate datasets, if you test 20 hypotheses and discover that each one is just valid at the 95% confidence level, then there is a very good chance that there are some false positives. In the medical alleged-sciences, however, all 20 would be blindly proclaimed as truth.

And then there are the social nonsenses^W sciences... If practitioners of some discipline do not understand how to use quantitative methods, they should limit themselves to qualitative argument only. Unfortunately, in statistics as in other fields, those who are ignorant or incompetent are generally unaware of the extent of their ignorance and incompetence.

--
Those who can make you believe absurdities can make you commit atrocities. - Voltaire

Re:The use and abuse of statistics. by Peter+Mork · 2010-03-17 22:20 · Score: 4, Insightful

And then there are the social nonsenses^W sciences... If practitioners of some discipline do not understand how to use quantitative methods, they should limit themselves to qualitative argument only.
Has it ever been demonstrated that social scientists have a worse understanding of statistics than physical scientists? I ask because my observations are the opposite. The physical scientists run a t-test and declare the matter resolved (significant or not-significant). Given the complexities of social sciences, these scientists check the assumptions required to use a test (e.g., normalcy) and have a good understanding of the statistics involved. (The obligatory exception is statistical genetics: physical science with a solid statistical basis.)
Re:The use and abuse of statistics. by Fallingcow · 2010-03-18 04:40 · Score: 1

Has it ever been demonstrated that social scientists have a worse understanding of statistics than physical scientists?
You raise a good point. The problem is that the people performing such a study would be social scientists, and we can't trust their results since they're so bad at statistics.
Re:The use and abuse of statistics. by gumbi+west · 2010-03-18 06:45 · Score: 1

I worked as a physicist for a long time and I met very few really great statisticians and almost none who were good but not great statisticians.
The problem with physical scientists is that they don't really need statistics--they just draw lines based on tons of data and "noise" is just a passing problem because you just do everything over again until s/n is huge.
In contrast, social scientists deal with a million times more issues: in situ experiments, non-normal noise, censored data, survey non-response, etc.

And 50% is pulled out of ... by freaker_TuC · 2010-03-17 19:35 · Score: 1

.. a place where the sun doesn't shine (often - statistically), does that mean 100% of those are stinky?

--
--- I am known for the ones who want to find me on the net. Is that a privacy risk or a privilege? One might wonder..

LSD .. Of'course... because of the bad trip? by freaker_TuC · 2010-03-17 19:42 · Score: 1

One thing that is interesting to me is that throughout my education and career I have been warned off using multiple means comparisons and LSD in particular (I understand why and have avoided where I can and the latter always). Yet the only actual stati

I think hippies tried to warn us too in the seventies of avoiding bad trips (LSD) .. Didn't know there was any math involved in that ...

--
--- I am known for the ones who want to find me on the net. Is that a privacy risk or a privilege? One might wonder..

Indeed very tough by freaker_TuC · 2010-03-17 19:50 · Score: 1

That would mean your 100% is truely only valid for 99% at most?

Why to people ask .. are you 100% sure? while the answer is mostly "I think so" ?

To stretch this ... are you correct about your statement, even if it is statistically only for 99% correct?

--
--- I am known for the ones who want to find me on the net. Is that a privacy risk or a privilege? One might wonder..

Re:No surprise here by goose-incarnated · 2010-03-17 19:51 · Score: 1

So, work out 5% and subtract 10% of that from your answer. Not too difficult to do mentally if you want a rough approximate. 5% is merely half of 10%, so if the amount is X, then 4.5% of X is 1/2(X/10) - 1/2(1/2(X/20)).

For example X=234, 10% is 23.4, 5% is roughly 11.7, 10% of that is 1.17 so 4.5% of 234 will be 11.7 - 1.17 which is roughly 10.6. Yes, that was done mentally before I actually wrote it down. Some general rules to use for rough mental arithmetic:
finding a multiple of 5% is easier if you first find 10% (eg 35% = 10% * 3 + 10%/2)
multiplying by 10 is easy
multiplying by 2 is easy
So, break everything into sums of multiples of 2 and 10, followed by an addition
(eg. multiplying by 5 = multiply by ten, divide by two,
multiplying by 6 = multiply by 10, divide by two then add one (original value)
multiplying by 7 = add (multiply by 5 and multiply by 2)
multiply by 8 = multiply by 2, 3 times
etc...)

Well, you get the idea

--
I'm a minority race. Save your vitriol for white people.

Good book about this... by KingOfSpainIII · 2010-03-17 19:54 · Score: 1

Irrationality by Stuart Sutherland. Talks about irrationality in general, with a focus on how statistics are generally misunderstood and misused by the public, and particularly health officials. He also recommends Innumeracy by John Allen Paulos. As a good start to learn about statistics and probability theory.

My two cents by identity0 · 2010-03-17 20:07 · Score: 1

I've always thought teaching a good understanding of statistics should be a requirement for high schools, since statistics are so often (mis)used to justify public policies and legislation. We need a citizenry that can see through the bullshit, or at least think a bit critically on the subject.

I think a firm understanding of statistics is more useful than the entry level calculus and the entry-level science courses like chemistry and biology(not that those aren't good too, just not as relevant to citizenship).

Here's a nice book on statistics called "How To Lie With Statistics" that covers a lot of the ways statistics are misused. (not a referrer link or anything like that)

http://www.amazon.com/How-Lie-Statistics-Darrell-Huff/dp/0393310728

Re:Not Scientists by OrangeCatholic · 2010-03-17 20:19 · Score: 1

>You seriously think this is a common problem in biomedical research?

Of course it is. Medical studies are often condensed to a flashy headline in a newspaper. "Scientists said X is true so it must be now." Then the talking heads run off with it for the next three days. Nobody - certainly not the journalist - reads the paper itself, and generally it's behind a paywall so there's virtually no point in ponying up the $30 to be the only person with an accurate assessment.

If anything, the media "spin" makes it drop-dead easy to have a medical paper say whatever you want, since nobody is going to check it. Peer review? Here's what happened when the FDA looked at the Vioxx (COX2) data. It turns out the "peer review process" omitted the 12 and 15 month data points:

However, when the Food and Drug Administration (FDA) later presented more complete data from the CLASS and VIGOR trials on its web site, the results were less certain. The CLASS trial was revealed to also have twelve and fifteen month time points which had not been discussed in the JAMA publication; in this segment of the trial, the number of ulcer-related complications for Celebrex caught up to the control NSAID group. Similarly, the complete VIGOR study data revealed that in fact, when all adverse events, not just gastrointestinal, were tabulated, the patients receiving VIOXX had suffered (barely) significantly higher incidence of adverse events overall than the control NSAID group. In particular, the risk of serious cardiovascular thrombotic events, e.g. myocardial infarction, was 1.7% in the VIOXX patients versus 0.7% in the control group, and there were significantly more withdrawals in the Vioxx group for causes including hypertension, edema, hepatotoxicity, heart failure, or pathological laboratory findings. The mean increases in systolic and diastolic blood pressure in the Vioxx group were 4.6 mmHg and 1.7 mmHg respectively, compared to 1.0 and 0.1 mmHg in the control NSAID group. An estimated 43,000,000 Americans, nearly one out of six, suffers from arthritis. However, 42% (18 million) of these also suffer from hypertension. Therefore, the promise of better patient outcomes and lowered medical costs from use of COX-2 inhibitors may not be as great as previously hoped.

Re:Only if you restrict it to live ones... by Shadow+of+Eternity · 2010-03-17 20:56 · Score: 1

So your saying if you were to measure the popes and antipopes by weight there is just barely not enough antipope-tons to cause a significant reaction?

--
A bullet may have your name on it but splash damage is addressed "To whom it may concern."

Re:Personal experience (central limit theorem) by whoisisis · 2010-03-17 20:57 · Score: 1

> And then it struck me - most of the research I had read had applied parametric statistical tests to their data - that it, the researchers made an assumption that the underlying distribution of results would fall on a normal curve. Which in cases with lots of samples is a perfectly valid assumption. See http://en.wikipedia.org/wiki/Central_limit_theorem

Medicene / Science for money by SomethingOrOther · 2010-03-17 21:18 · Score: 1

And why would they? They can make more money on Wall Street

Think you are missing the point dude.
We (mostly!) didn't become doctors / scientists to make money.

If people are only motivated by money.... then have you ever wondered why kids climb trees ?

--
Anyone quoted by a reporter knows how little they understand
Don't believe what you read is the truth.

Re:Medicene / Science for money by JumpDrive · 2010-03-18 14:35 · Score: 1

I like science and math. I started working on a physics degree, but switched to engineering when I learned that I had a better chance of making a living in engineering. I eventually finished my physics degree, but continued in engineering.

Never wondered why kids climb trees, but have wondered why I always want to climb mountains.
Re:Medicene / Science for money by Uzuri · 2010-03-31 04:52 · Score: 1

Because money grows on trees. Duh.
Doesn't it?

--
I'm a she-slashdotter... but I make up for it by living with my folks.

Re:No surprise here by ShakaUVM · 2010-03-17 21:25 · Score: 1

>>if you want real fun, take the average master's degree idiot and start having them manually add fractions without changing them to decimals. such as adding a bunch of measurements off a tape measure together.

My question is, what kind of idiot uses fractions these days?

Just use decimals for everything.

I tend to keep running totals in my head when doing stats or budgets, just to make sure the excel spreadsheet hasn't auto-adjusted itself to miss a row. But adding 13/16 + 78/11 + 4 3/2 without converting to decimal? Pfft. You never do that once you leave elementary school. (Mixed fractions, lol.)

Re:No surprise here by Marcika · 2010-03-17 22:08 · Score: 1

And oh, my dear lord, try dealing with "valua-added-tax" in Europe....

500 million people try and succeed every day. The secret: By law, it is included in retail prices, so it does not matter whether there are 50 different rates across the EU, you pay what the sticker says. (If you are a business, your accounting software will apply the right rate and calculate the right amount for you. If you are a retailer, it is not too hard remembering the 1 or 2 rates that you have to add on the sticker.)

Re:No surprise here by imakemusic · 2010-03-17 22:24 · Score: 1

Maybe it proves his point more than he intended.

--
Brain surgery - it's not rocket science!

Re:Long winded troll by TapeCutter · 2010-03-17 22:38 · Score: 1

"That sounds right, but isn't part of being the best explanation, that such an explanation is true? Establishing the truth of an explanation would fit into one of the commonly used meanings of "proof.""

Sure, but then you fall into circular reasoning since you need proof to assert truth. I would go as far as to call well established science "beyond reasonable doubt" but that is neither proof nor truth.

The strength of scientific philosophy is that it is never 100% certain about anything and is willing to change it's explaination if provided with compelling evidence that an alternative explaination is a better fit for the observations. This usually doesn't mean the first explanation was wrong, mearly incomplete (see: Asimov's insightfull essay The relativity of wrong).

Most other philosophies (especially religious ones) view uncertainty and imperfection as weaknesses and hold up dogma and blind faith as virtues, my pet theory on that is that those philosophies seek to control people rather than inform them.

"I don't think your restriction of proof to axiomatic systems coincides with the way that that word is usually used."

Maybe, but that would be because few people are ever taught the basics of epistomology and science itself is generally taught as a grab-bag of usefull factoids rather than a coherent worldview. Also I would love to take the credit for that idea, but I am not that bright.

--
And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.

Re:Its common knowledge by Chrisq · 2010-03-17 22:47 · Score: 1

I'm waiting for some clever clogs to take the 77.28% and perform a baysian analysis based on your 70% observation and tell us what the modified expectation level should be.

Re:Its common knowledge by Chrisq · 2010-03-17 22:49 · Score: 1

And 77.335% of all statistics claim more accuracy than their expected deviation warrants.

Luckily, only 34.48% of the public ever pays attention to statistics. Only 54.13% of which can properly understand what they mean.

The world of the average Joe is mean.

And the result is only relevant to 27.765% of those.

The press are just as bad... by Endophage · 2010-03-17 23:28 · Score: 1

Apologies that I can't remember the exact details but I read about the case of a university professor in the US who lost his job for allegedly saying there were more men in science because men were more intelligent than women. The issue revolved around the press not understanding standard deviations. What the professor had actually said (in fewer words) was that the standard bell curve for intelligence is slightly difference by gender. For men it is shorter and fatter but the tails don't extend very far while for women the curve is taller but with very long tails. It boils down to there being more intelligent men but equally, more stupid men while women have the potential to be both significantly more intelligent but also significantly less intelligent than the bulk of the male population.

All the details are in the book Super Crunchers which is incidentally a fantastic read for anyone interested in the application of statistics in a very general, non-mathematical sense (it covers the use of statistics by baseball scouts, medical computers, predicting changes in flight prices and predicting wine vintages to name a few scenarios that are covered). Unfortunately the professor lost his job because of the furore generated by the misinterpretation in the press.

Re:No surprise here by qc_dk · 2010-03-17 23:52 · Score: 2, Interesting

I have the same problem. In school they were considering putting me in remedial classes because I had trouble doing basic arithmetic with even single digit numbers(I still have trouble with anything above 6). I can and could do a reasonably accurate estimate, but not the real result (possibly has something to do with me also having a bad short term memory). As soon as we got to the abstract bit (i.e. real math) I had no trouble. I can do integration with coordinatesystem shifts(e.g. cartesian->polar) in my head, but I will have to check my constants with a calculator.

Conclusion... by ilitirit · 2010-03-18 00:46 · Score: 1

Scientists should start working with statisticians.

Re:PhD Candidate in Biostatistics Here by dorpus · 2010-03-18 00:56 · Score: 1

How do you prove that isotopes stay in the same place in the mud for millions of years?

Shortcomings of of the math? No... by Mashdar · 2010-03-18 01:11 · Score: 1

Shortcomings of statistics? More like shortcomings of humans *attempting* to use statistics.

not specific to statistics by drfireman · 2010-03-18 01:45 · Score: 1

Misuse of statistics is well-represented in scientific articles. Other things that are well-represented are poor knowledge and reasoning in the area of the subject discipline, inept writing, misleading or unhelpful graphics, poor scholarship, etc. Sturgeon's Law applies across the board.

Having read a fair number of sky-is-falling articles about statistics in science, and having worked with my share of researchers (MDs and PhDs in a variety of fields) who think everything is rosy, I'm pretty sure that the truth is somewhere in between. A minor saving grace is the fact that getting the statistics wrong is not the same as getting the answer wrong. Although it's certainly quite common to find published articles that make claims with no support whatsoever, in my experience it's much more common to find articles where the inappropriate statistics just mean the support isn't nearly as strong as claimed. Spurious results tend, though not as reliably as we'd like, to get weeded out by the literature. I rarely read an article that isn't specifically about methodology in which the methods/statistics are really solid, but I also rarely read an article in which unsound statistics undermines the entire contribution.

statistical mechanics by pigwiggle · 2010-03-18 01:46 · Score: 1

I do stat mech. Most of the papers I read pay very little attention to assigning a level statistical significance to their "measurements". When they do, assumptions of uncorrelated measurements are always made - and probably incorrectly. I struggle with the statistics myself. I find myself working out of my undergraduate stats text mostly. I feel I'm more concerned with understanding how statistically meaningful my measurements are than most of my colleagues. And I worry about my understanding of the statistical methods I use.

--
46 & 2

Re:Long winded troll by Svartalf · 2010-03-18 01:50 · Score: 1

Indeed... If you can find causation, you'll find correlation.

There statistics will help confirm you're on the right path...after a fashion.

But so many people misuse the tool in question to go the other way around- and try to prove causation with correlations. You might be able to do that, through dumb luck. As often as not, you'll get all sorts of wild assumptions come up as theory due to that attempt as as likely as not you've missed something. That's why statistical analysis and meta-analysis used solely to validate a premise should be viewed as the hokum it typically ends up panning out to be (Just look at the past- it's replete with people thinking the most ludicrous things based off of "the statistics"...).

--
I am not merely a "consumer" or a "taxpayer". I am a Citizen of the State of Texas

Re:PhD Candidate in Biostatistics Here by penguinchris · 2010-03-18 02:06 · Score: 1

That's irrelevant, because isotope studies in geology are not done on "mud". You don't take the lid off a mountain and find liquid mud that's been sitting there for millions of years (in which case the isotopes would have circulated as you suggest).

The "mud" hardens into a rock. If it's really mud, then it might be mudstone or shale. Anything that's in the mud, like isotopes, is trapped in place in the chemistry of the rock. And believe it or not, geologists do realize that things "leak out" or are otherwise mixed up over time, and this is taken into account.

That said, as a geologist myself I do often think results based on isotope studies are bullshit, but not because the science of isotopes is bullshit. It's because of the problems described in TFA - misunderstanding of statistics - and misapplication of isotope-related techniques.

Your disagreement with public health bullshit is understandable, and I agree to some extent with that. However, I really don't think you understand the chemistry of isotope studies and the principles of geology that make these things valid (when used properly).

Re:PhD Candidate in Biostatistics Here by PineHall · 2010-03-18 02:08 · Score: 1

It sounds like you are a Truth Seeker who has become jaded because of the basic assumptions underlying science and because our broken human nature does not always treat the scientific results properly. You are pointing out the mess we are in and how Naturalism does not solve the problems. I would encourage you to continue to seek to understand Reality/Truth. It is important. I found that the Christian Faith fits reality best. Consider it. There are presuppositions/assumptions also with Christianity but I believe it does explain reality best and science can fit into that Christian framework.

MY common conversation by kenp2002 · 2010-03-18 02:29 · Score: 4, Informative

The largest demographic in american prisons are black americans. Real statistic but is it true?

Given a particular sample that indicates blacks are 60% of the prison population this would appear to be true.

But what if I said: "The largest demographic in prison is minority, non-whites." Suddenly the % jumps from 60% (black) to 80% (minority). Which is more right? This is the problem with statistics. Context.

Now I can say readily that the largest demographic in prison is actually right-handed people. The % now jumps to 90%.

But wait! There is more! The largest demographic is prison is actually people who prior to arrest were below the poverty line which jumps to 99% of the population. Again, all of the above are accurate based on a sample but which is MORE correct? Linear Algebra is coming into play here quickly....

When that kind of issue comes into play, it is the classic "Correlation != Causation" confusion. The majority of people in prison are in there because of "Being black? Being a minority? being right handed? or being poor?" None of the above. The majority of them are in there because they were convicted of a crime and sentenced. That is the causation of their imprisonment, the rest is correlation which may have a direct causation on the conviction or sentencing, but no direct causation on being in prison. (e.g. You cannot be thrown into prison for being poor, black, minority, right handed)

Same with medical research, politics, economics, etc. The price of oil rising 10% and a subsequent 5% drop in shipping orders. Measuring the significance of regessors is important but oddly never reported most of the time. Many factors get masked or shadowed by higher level regressors (e.g. being a minority masks a variety of other social and economic factors. In addition it can distort statistical work by being too broad. Asians have a variety of different economic and social factors as north american blacks versus even african immigrants.)

Back to the orignal subject:

We can take 100 prisoners and 100 non-prisoners and figure out rather quickly if being black is statistically significant in prison population. Non-prison population blacks would account for 25%-45% of the population (Depending on location). We can see that 60% of prisoners are black. There is a 20+% deviation from the norm. We can test to see the significance of that. Same with minorities. Now we find something quickly that right handed is insignificant because it doesn't deviate from the norm. We can test left-handed and right-handed populations and rule out the handed-ness of a convict being significant.
We can find the economic status is considerable MORE significant then minority or black as a status. We can determine that the reason minorities or blacks are disporotinally more prevelant in prison is that blacks and minorities have higher rates of poverty. We can extract and determine the statistical weight of POVERTY in regards to imprisonment (Since we find a high % of white in prison that are poor compared to the normal population.) Once we figure that out we can remove that and continue an investigation and figure out what weight minority and black has once we have removed POVERTY from the model (Residual analysis).

The problem in reporting is without providing the whole, comprehensive analysis you can miss important things. For instance to correct the injustice in sentencing, without reporting the weight POVERTY has in contrast to BLACK or MINORITY you may lose sight that you may have better success addressing POVERTY to normalize sentencing rather then MINORITY or BLACK (or not).

The same happens in medical reasearch. Given a cocktail of drugs wirthout having the whole analysis you may end up providing more of Medicine A versus B but lose sight that A & B are limited by the dosage of Medicine C.

Satistics are not bullshit, rather mearly observations with no intrinsic agenda or even implication of truth. Purely amoral, like a hand gun.. useful to both the good and evil.

Statistics don't lie, nor do they tell the truth. They simple show the relationship of the data as it stands. The Truth or Thruthiness of it is subjective and vulnerable to context.

--
-=[ Who Is John Galt? ]=-

My family is full by TheOutLiar · 2010-03-18 02:41 · Score: 1

of doctors and researchers who deal with statistics on a regular basis. My aunt and uncle are both oncologists. My grandfather is an orthopedist. Last year, my grandfather discussed this very issue with me: for the majority of his career, he did not understand statistics well enough to truly gain anything from scientific journals. He could understand things like means, standard deviation, median, etc. But when the literature begins to lean toward more esoteric statistics, he can no longer discern the meaning. He then handed me a book titled The Lady Tasting Tea, which he claims made a great difference in his understanding of statistics and their meanings. I graduated with a BS in computer science, and have taken enough statistics courses that the idea of reading one more word about chi square tests would melt my brain. But I digress. The point is that there is accessible literature out there for people who are not versed in statistics.

Oh, yeah... by russotto · 2010-03-18 02:54 · Score: 1

Lots of statistical problems seem to be ignored. Papers which blithely present meta-analyses as if they had the power of a single large study. Far too much significance attributed to case-control studies (which magnify small effects and can't, by nature, show causation). And statistical tests which simply don't have the power to show what they purport to be showing.

One example: A study purporting to demonstrate the effect of an event E on a particular variable X. The study took the average of the variable 12 months prior to E (high), and 12 months following E (much lower), and determined that event E reduced variable X. Only problem is that variable X had been declining, and about the time event E happened, that decline reversed and X started going up, though more slowly than it had been declining.

Now there's a surprise.... by ibm1130 · 2010-03-18 02:58 · Score: 1

Yeah, the mathematical statistics courses were just chock full of what we called "meds keeners" or "hoovers" ie those seeking admission to med school. Even those majoring in alleged sciences like biology were often shockingly ignorant of hard sciences and tended to fulfill only the minimum requirements in things like chemistry.

Re:PhD Candidate in Biostatistics Here by dorpus · 2010-03-18 03:11 · Score: 1

I did not dispute the properties of isotopes themselves. When studied in the laboratory, isotopes appear to have predictable properties of exponential decay. (Though, I would add that the more stable isotopes which last for millions of years can only be assumed on faith to have the same properties of exponential decay as more short-lived isotopes. Could there be emergent properties that make them deviate from predictions? We know from the world of survival analysis that longer-lived entities often do deviate; the "proportional hazards" model is inappropriate.)

What I question, as you have, are the untestable postulates involving mixtures of isotopes in the crust. The only way to test this is to make hundreds of planets and wait millions of years. We don't have the technology to do so, so we only have postulates that appear logical. As we know, plenty of ideas sound logically elegant, but fail to work in the real world.

Re:No surprise here by clintonmonk · 2010-03-18 03:20 · Score: 1

he couldn't calculate tip to save his life, but I don't certainly hold that against him.

That is, unless he's like my old roommate and stuck you with the tip each time.

Re:PhD Candidate in Biostatistics Here by dorpus · 2010-03-18 03:21 · Score: 1

I embrace Christianity as a moral system. I am a Christian in this sense. I'm not here to promote intelligent design or to oppose the theory of evolution as a whole, as the slashdot crowd may wish to label me. My position is that any number of theories can be cooked up about evolution, cosmology, etc., and one can find any amount of data to support their theory. Every theory I've come across relies on a faith in untestable or non-falsifiable postulates. Our physical reality is defined by what we look for; there are any number of legitimate theories based on data that isn't available yet, because we didn't look in the right places, or ask the right questions.

What my statistics professor taught me by ElmoGonzo · 2010-03-18 03:38 · Score: 1

Thou shalt not worship the .05 level. Correlation does not imply causation -- you need to have some idea of HOW the values are correlated. Linear regression is only valid when the relationship is in fact linear. The more variables added to a multivariate statistical model, the greater the likelihood that there will be a spurious correlation. SPSS will always find something when you tell it to look hard enough.

Re:No surprise here by coolsnowmen · 2010-03-18 04:11 · Score: 1

Definitely a cute trick,
While I thank you for your explanation, I have very little trouble doing mental math, estimations, and basically anything from addition through calculus. The point is that there are those who can do math, but can't do it in their head, even though they are otherwise intelligent, an quick-witted.

The problem you don't see is that while shifting decimals by */10 is easy, as soon as numerals have carries from the *2 or *3 the mental math becomes harder.

Re:Long winded troll by SlashBugs · 2010-03-18 04:29 · Score: 1

A lot don't. I work in biological science, and even with my mediocre maths education (I had four lectures on stats during my undergrad, plus one afternoon at the start of my PhD; everything else I've had to teach myself) I see a lot of people talking about statistical tests that they clearly don't understand.

It's sad but true that a lof of people end up in biology because they love science but can't handle the maths required by physics or even advanced chemistry. While there are plenty of exceptions, there's a very strong tendency to treat statistical tests as black-box tools: plug in the numbers, get an answer and don't worry too much about whether it's an appropriate test or what the answer actually means. The article's example of people misunderstanding the meaning of a p value from Student's T-test is actually distressingly common. Other things -- like designing and drawing conclusions from experiments without ever considering power calculations -- crop up a lot too.

The best area I've encountered so far is bioinformatics, which tends to be the realm of programmers and statisticians who've become interested in biology, rather than the other way around. I'm not in a position to give an informed assessment of their work, but the sheer pain on their faces when advising maths-impaired biologists on study design is a pretty solid sign that they're used to a much higher standard :).

Re:No surprise here by goose-incarnated · 2010-03-18 04:43 · Score: 1

Definitely a cute trick, While I thank you for your explanation, I have very little trouble doing mental math, estimations, and basically anything from addition through calculus. The point is that there are those who can do math, but can't do it in their head, even though they are otherwise intelligent, an quick-witted.

Well, that's the reason for the "trick" - it enables mental slow-motion-actors like myself to calculate in a reasonable amount of time. I taught this (and a few other basic rules) to a friend of mine who thought my approximations were an intrinsic quality, and he did just fine as soon as he learned the methods.

My only contention is that the mental magic done on numbers is not magic, and can be taught, not that some people can do it and some people cannot (which, feel free to correct me, is your point - that some people can and some cannot (and we should not judge their intelligence on that point)). I figure that *most* can do it if given a few simple rules. Regardless, I believe we both agree that people who cannot do this are not inferior in any intelligence/intellectual way.

The problem you don't see is that while shifting decimals by */10 is easy, as soon as numerals have carries from the *2 or *3 the mental math becomes harder.

Umm ... perhaps more tricks can be employed[1]? After all, if you're not after precision, then you can get pretty close to the answer very quickly.

[1] I prefer to work with fractions than decimals, as then my normal bag of "tricks" can be used :-)

PS - forgive typo in previous post - typing fast with no coffee

--
I'm a minority race. Save your vitriol for white people.

Re:Not Scientists by dorpus · 2010-03-18 05:03 · Score: 1

What you describe is a matched case-control study. There are better methods such as double-blind randomized clinical trials. An intro to epidemiology course will teach you all of this. If the high and mighty physicist can think of an even better method than clinical trials, go ahead and state them.

As for MRI machines, they produce a lot of data, but they are just statistical associations. Just because a depressed patient's brain looks different on an MRI machine from a normal person's brain does not prove any causal relationship. Again, if the high-and-mighty physicist thinks they have a better answer, they are welcome to state them.

Re:Maths anxiety by Frequency+Domain · 2010-03-18 07:14 · Score: 1

A cow is actually a rather topologically interesting beast.

Nah, it's just a funny torus.

You're udderly correct. Now if you had said it was just a funny taurus, on the other hand...

Re:PhD Candidate in Biostatistics Here by PineHall · 2010-03-18 07:57 · Score: 1

Every theory I've come across relies on a faith in untestable or non-falsifiable postulates. Our physical reality is defined by what we look for; there are any number of legitimate theories based on data that isn't available yet, because we didn't look in the right places, or ask the right questions.

Yes, we don't know Truth/Reality in the full. We only know bits and pieces and we have some concepts that are just wrong. I agree that all of our theories are incomplete and based "untestable or non-falsifiable postulates". And Kurt Godel showed that to be the case even with math with his incompleteness theorems. And yet I believe we can successfully strive to better understand reality, not the many different realities we perceive, but the one true Reality we live in.

Though science has given us a better understanding of reality, it is good to recognize the limitations of science and to question the assumptions, presuppositions and axioms that make up the theories and our beliefs. Ask yourself how well does this theory/belief match reality? If it does a poor job, if possible replace it with one that does a better job. And recognize that because humans can be biased and blind to reality, there are and will continue to be theories promoted that fall far short of reality and/or are based on bad assumptions. Don't let that get you down. Strive to better understand reality. That is the journey I am on and I believe the Christian worldview gives me the best framework to understand reality.

What are you measuring? by Colin+Smith · 2010-03-18 08:25 · Score: 1

People who deal with raw physical measurements (radar engineers, astronomers, the guy who makes airspeed sensor of the B2--er,um...) have had this problem figured out for a while.

It sounds easy to you cos you have an easy job. You only have a single, easy to measure parameter.

In other fields there can be dozens, hundreds or thousands of parameters, each with it's own signal. Determining which of the signals (if any) are meaningful is a lot harder than what you're doing. What I'm saying is, you're an engineer, not a scientist.

--
Deleted

Re:What are you measuring? by RightwingNutjob · 2010-03-18 12:27 · Score: 2, Insightful

That's exactly the point. If obtaining a degree of certainty in one measurement takes a bookload of theory to do 'properly', and is 'hard', obtaining a the same degree of certainty in a space with N channels should be 'hard'^N. The OP's point was that people assume that it should be just as easy, and don't go to the trouble of learning what it takes to do it right.

Re:Maths anxiety by Nefarious+Wheel · 2010-03-18 18:32 · Score: 1

Response to my sig
Brought gales of laughter to me

I must change it now.

--
Do not mock my vision of impractical footwear

Re:No surprise here by Antique+Geekmeister · 2010-03-18 23:20 · Score: 1

This was very profoundly not my experience when helping a corporate partner price and sell equipment. Prices for hardware are advertised _without_ VAT. Slipping in and out portions of VAT depending on the upstream vendor's behavior was insane. And oh, dear lord, if you had a sale, the VAT numbers got even more insane.

Re:No surprise here by Marcika · 2010-03-19 00:22 · Score: 1

This was very profoundly not my experience when helping a corporate partner price and sell equipment. Prices for hardware are advertised _without_ VAT. Slipping in and out portions of VAT depending on the upstream vendor's behavior was insane. And oh, dear lord, if you had a sale, the VAT numbers got even more insane.

Yes, but you were doing B2B/wholesale sales - only a very small percentage of the population will ever be in that situation. And if you are in a business, you usually know the VAT rate and have a dedicated accountant to deal with any problem that a pocket calculator can't solve...

Re:Long winded troll by khallow · 2010-03-19 01:16 · Score: 1

It's a troll because it implies scientists don't know about those things.

The implication has to be incorrect for almost all scientists for that to be a problem. As the other replier noted, there is widespread ignorance among scientists of this sort of thing.

Re:Long winded troll by crmarvin42 · 2010-03-20 01:36 · Score: 2, Informative

That people are trying to use peer-review as a method to detect fraud, does not make it a good method for doing so. I've mentioned this before on /., although not in this thread, but I have no way of telling if the numbers in a table were generated by the experiment described, some other experiment, a random number generator, or the PR department at the company who's product is being evaluated. As long as the numbers are internally consistent, I have to "trust" that what they describe, happened. I can catch obvious errors, such as the SEM not supporting claims of statistical significance made by the authors. However, if during the review process, they claim that the SEM was a typo (numbers were actually SD and not SEM for example) and change it, I have no way of verifying that their explanation was true.

Also, in your quote you highlighted 2 different lines. The first has to do with the soundness of the conclusions. This is most definitely a role of peer review, but not related to accuracy. It doesn't mean that they verify that your conclusions are correct. Conclusions are not objective. The data gives you objective facts from which to draw subjective conclusions. This line indicates that your discussion will be evaluated for how well the data (yours and previous literature) supports your conclusions. If you extrapolate, or ignore important results then your paper will be rejected.

The second bolded section just indicates that if serious errors are found (using insufficiently large sample size, extrapolating results, etc.) then the paper will be rejected. That's totally understandable to reject, but obviously serious errors of this sort are uncommon. Most errors are much harder to detect, and are not picked up by the peer review process in my experience.

--
Bureaucracy expands to meet the needs of the expanding bureaucracy.-Oscar Wilde

Slashdot Mirror

Science and the Shortcomings of Statistics

338 of 429 comments (clear)