Reanalysis of Clinical Trials Finds Misleading Results

← Back to Stories (view on slashdot.org)

Reanalysis of Clinical Trials Finds Misleading Results

Posted by Soulskill on Wednesday September 10, 2014 @01:37AM from the side-effects-may-include-pretty-much-anything dept.

sciencehabit writes: Clinical trials rarely get a second look — and when they do, their findings are not always what the authors originally reported. That's the conclusion of a new study (abstract), which compared how 37 studies that had been reanalyzed measured up to the original. In 13 cases, the reanalysis came to a different outcome — a finding that suggests many clinical trials may not be accurately reporting the effect of a new drug or intervention. Moreover, only five of the reanalyses were by an entirely different set of authors, which means they did not get a neutral relook.

In one of the trials, which examined the efficacy of the drug methotrexate in treating systemic sclerosis—an autoimmune disease that causes scarring of the skin and internal organs—the original researchers found the drug to be not much more effective than the placebo, as they reported in a 2001 paper. However, in a 2009 reanalysis of the same trial, another group of researchers including one of the original authors used Bayesian analysis, a statistical technique to overcome the shortcomings of small data sets that plague clinical trials of rare diseases such as sclerosis. The reanalysis found that the drug was, as it turned out, more effective than the placebo and had a good chance of benefiting sclerosis patients.

41 of 74 comments (clear)

Min score:

Reason:

Sort:

Decline Effect by Pino+Grigio · 2014-09-10 01:46 · Score: 5, Informative

Isn't this generally know as The Decline Effect? It's not just clinical trials, it applies to almost everything (to varying degrees). It's also been interpreted as The Half-Life of Knowledge.
1. Re:Decline Effect by tomhath · 2014-09-10 02:34 · Score: 2
  
  That article is a very long winded way of saying, yes - a small sample size can give results that are far from the mean. Flip a coin 5 times and you might get heads five times; does that mean the coin will always come up heads if you try the experiment again? No.
2. Re:Decline Effect by Pino+Grigio · 2014-09-10 02:52 · Score: 1
  
  I don't think that's what it means, no. It means there are difficult to discover sources of bias in the design, implementation and interpretation of studies. Over time they change and so (remarkably) do the results.
3. Re:Decline Effect by TheInternetGuy · 2014-09-10 12:55 · Score: 1
  
  Exactly, like with the theory of evolution and intelligent design. If the same amount of time was put towards the proof of intelligent design as with the theory of evolution, the atheists wouldn't have a leg to stand on.
  Well the Intelligent design people had a couple of thousands of years working on their story. The scientists only ~160 years since Darwin wrote his 'On the Origin of the Species'. So I'd say we should give the scientists another 1900 years before we make any conclusions.
  
  --
  If my comment didn't sound as good in your head as it did in mine, then I guess we all know who's to blame
4. Re:Decline Effect by RespekMyAthorati · 2014-09-10 18:48 · Score: 1
  
  Except you can't "prove" a fantasy.
Not the usual way science is done by Archtech · 2014-09-10 01:50 · Score: 4, Insightful

Now that is an interesting observation! Mostly, in science, when someone does an experiment that supposedly proves a theory, the next step is to document and publish every detailed step. Only when a number of peers have replicated the results can they be accepted with any confidence.
Yet in clinical trials of new drugs, it seems, only a single trial is ever done. How did that ever get accepted as proper scientific evidence?

--
I am sure that there are many other solipsists out there.
1. Re:Not the usual way science is done by Archtech · 2014-09-10 01:56 · Score: 4, Insightful
  
  Whoops, I misunderstood the article for a moment there. If it's a matter of incorrect or misleading statistical analysis, that seems to be rife in studies of nutrition at least. Part of the problem may be that the same people develop a theory, conduct studies to test it, and do the statistical analysis on their numbers. Naturally, the numbers usually turn out to support their theory!
  It might be safer if the three different activities were done by separate teams, with a "blind" system so no team knows who the other teams are. Thus the theory is developed by Team A, then studies/experiments to test it are created by Team B, and the number are analyzed by Team C. Thus Team C would have no idea what theory they were analyzing, or what might be the meaning of any correlations they found.
  
  --
  I am sure that there are many other solipsists out there.
2. Re:Not the usual way science is done by oh_my_080980980 · 2014-09-10 02:05 · Score: 3, Insightful
  
  Jesus christ people read the article not the title. Hell even the abstract pointed out the problem: SMALL DATA SETS! There was nothing wrong with the original reporting. Based on the sample size that was the proper conclusion to reach. I would not jump to the conclusion that the Bayesian analysis overturned the original conclusion. What the Bayesian analysis points to is that a new trial should be conducted with a larger sample size.
  
  FYI there's already a blind system. Again - read people.
3. Re:Not the usual way science is done by _anomaly_ · 2014-09-10 02:32 · Score: 1
  
  Drugs go through four phases of clinical trials, as required by the FDA.
  That being said, the whole clinical trial process to get a drug approved by the FDA is pretty messed up. From how pharmaceutical companies are involved, how patients are (are not) qualified to participate, how adverse events aren't necessarily properly documented, the list goes on.
  
  --
  "I have no special gift, I am only passionately curious." - Albert Einstein
4. Re:Not the usual way science is done by Shimbo · 2014-09-10 02:51 · Score: 1
  
  Yet in clinical trials of new drugs, it seems, only a single trial is ever done.
  That's not true at all. Generally, multiple trials are done and the most favourable results published. http://www.alltrials.net/
5. Re:Not the usual way science is done by Rich0 · 2014-09-10 04:51 · Score: 2
  
  There are lots of things that are messed-up about clinical trials, and the main reason for this is that they are VERY expensive to run. The problem is that you can't give somebody a drug unless their doctor is involved. Doctors make a lot of money, and clinical trials take a lot of time for them to participate in. So, if you want somebody who makes $500k/yr to spend 10 hours per week on a clinical trial, what is the only way to get them to cooperate? You have to pay them a LOT of money. Multiply this by thousands of patients, and then factor in that there are 14 cancer trials going on at the same time, so the doctor is going to pick the trial that pays the best to recommend to their patients, and now you have a bidding war on top of it.
  Nobody wants to spend the money needed to either sponsor trials run by a disinterested party, or replicate trials.
  (And yes, I do realize that some doctors are among the salt of the earth, and they will participate in trials because it is the right thing for their patients even if they were completely uncompensated. However, for every one of these there are probably 5 others that make it onto the FDA's debarrment list for getting patients to take experimental drugs that they shouldn't be taking just to collect extra fees. The list is posted online - just read it for yourself.)
6. Re:Not the usual way science is done by Anonymous Coward · 2014-09-10 06:19 · Score: 1
  
  I think there's an underlying assumption to your comment that is incorrect, which is about the purpose of clinical trials. That's ok though, most people do not understand the purpose of the FDA and clinical trials.
  The FDA's purpose is not to validate your science. That is not their mission, that mission is covered via journals/publishing/peer review processes. The FDA's job is in fact a marketing clearance organization.
  The FDA was created because it was recognized that health is very complex that many people do not understand, but if consumers are tricked into believing something about a product they buy the consequences could be very real and dire. The purpose of the FDA is all about labeling and marketing: what are you communicating to the public that your product can do? Based on what you're communicating, can your product really do what you say it can do?
  As such, clinical trials actually starts with what you plan to claim in your marketing material, be it public statements, websites, and product labels, what your product is able to do. The FDA then determines what the risk to the patient is based on what you claim it can do, and classifies your product. Based on the classification, you then have a series of things you need to perform under clinical trials to verify that your claims are correct, and they will review your claims and comment etc. Eventually passing clinical trials is actually not called "acceptance" but "clearance", as you are given "clearance" by the FDA to publish marketing material about your product. Note you are allowed to sell your product without doing any clinical trials, you're simply not allowed to say it can do anything (claiming your product is capable of something without it being cleared by the FDA is in fact a criminal offense, although it usually starts with cease and desist notifications; see 23andMe).
  So what makes this interesting is let's say you make dental floss. You claim that via it's intended use, it can clean 50 to 60% of plaque from teeth. The risk to using it is not high, and you're not claiming 100% effectivity (you still need regular cleanings), therefore it's a Class I medical device, in which case clinical trials are not necessary, only non-clinical tests and notification to the FDA. But if you take the exact same product but claim that it can be implanted and stop a heart attack under specific scenario, you are not suddenly a Class 3 device (like most implantables) and will need to conduct patient trials and probably animal trials to validate that what you claim in your marketing is actually the case.
  However, at no point is the FDA there to validate your science; you can claim your dental floss stops heart attacks all you want in a published paper as long as you're not selling it and your peers review your findings and find them acceptable.
7. Re:Not the usual way science is done by Altrag · 2014-09-10 07:56 · Score: 1
  
  It didn't. Clinical trials are performed for exactly one reason: To get FDA approval. Not for scientific integrity, not for human safety. Purely for political purposes.
  Its a crappy system. You hear all kinds of stories about companies just discarding the results of any trial that didn't prove their drug "worked" (because they're only required to provide a certain number of trial results. They're not required to provide the results of any trials they may have performed, or even note that they happened, above and beyond that certain number. So you just pretend those others never happened and keep rerunning the trial until you get the minimum required successes.)
  At the very least, the trials should be performed by an independent third party. Of course that would open up the question of who would setup and fund such a third party. The FDA wouldn't likely have the money to do that (and they have their own political agenda that could get in the way anyway) and the drug companies certainly wouldn't care to fund an organization that's basically working against them..
Re:Wooah! by Anonymous Coward · 2014-09-10 02:00 · Score: 5, Interesting

Almost had me there article! Until you said the most evil words known to man... "statistical technique". AKA "bullshit"
Bayesian statistics is far from bullshit.
I suggest you read up on it.
You can do some really cool stuff with it.
Testing if a coin flip is fair.
Correct images.
Filter spam
Re:Wooah! by oh_my_080980980 · 2014-09-10 02:07 · Score: 2, Insightful

The problem is the Bayesian analysis is far from conclusive. What it does point to is that the clinical trial needs a larger sample size. Sample sizes that are too small are useless.
Re:Wooah! by Anonymous Coward · 2014-09-10 02:14 · Score: 2, Informative

The problem is the Bayesian analysis is far from conclusive.
100% Wrong
What it tells you is the probability that your hypothesis is correct given your evidence and your prior knowledge.
Selection bias? by Chris+Mattern · 2014-09-10 02:15 · Score: 5, Insightful

They looked at reanalyses that had already been done for other reasons, rather than doing their own reanalyses on randomly selected trials. It occurs to me that these trials may have been subjected to reanalysis precisely *because* there were doubts about the initial analysis.
1. Re:Selection bias? by Xylantiel · 2014-09-10 03:07 · Score: 1
  
  So does that mean a re-analysis of the article on re-analysis leads to different conclusions than the original article?! HA!
  But I have the sneaking suspicion that this re-analysis won't be published, which is a whole nother kind of selection bias can of worms.
Re:Wooah! by Pino+Grigio · 2014-09-10 02:19 · Score: 1

Bayesians need priors don't they? Where do they come from and what affect does the choice of prior have on the final outcome? I don't think Bayesian analysis is any more of a silver bullet than any other technique.
Re:Wooah! by Anonymous Coward · 2014-09-10 02:21 · Score: 5, Insightful

No, the GP is right. While BA gives you a probability distribution for the effectiveness, unless the effect is really strong (or you bad a really bad choice of priors), that distribution is going to be quite wide for a small data set. Such results are not proving that what you were testing was effective, but that there is a decent probability it might be effective given the knowledge you gain from the test, and that you should pursue a larger test. I've found it to be quite rare to have a BA result that strongly excludes a null hypothesis in a small scale test without having already been flagged as effective by simpler tests (i.e. the effects were so obvious, didn't require trying that hard to see).
AllTrials by beatle42 · 2014-09-10 02:27 · Score: 1

This seems to highlight the reasons behind the All Trials movement: http://www.alltrials.net/
Re:Wooah! by Anonymous Coward · 2014-09-10 02:28 · Score: 1

Since people interpret confidence intervals as credible intervals (and they are usually close to same for a uniform prior), any problem with "uninformed" bayesian analysis is shared by those using frequentist techniques in practice.
Risks of Re-analysis by Rich0 · 2014-09-10 02:35 · Score: 3, Insightful

Anytime you re-analyze data you run into this.
Think about it. There are a million ways you can analyze any dataset. There are millions of datasets out there to analyze. There are millions of people who can independently decide to go back and do a re-analysis.
So, the issue is that if somebody goes back and does a re-analysis and the results are boring, nobody publishes. However, if the results are controversial, it gets published. Since there are so many permutations, you're guaranteed to find something exciting.
This is why you're supposed to establish your methods BEFORE you collect the data, and then stick to the methods you established to analyze the data. Otherwise your 95% confidence turns into a more realistic 1% confidence.
In practice, though, I'm sure the initial analyses are just as prone to this kind of problem. It just gets REALLY bad when you look backwards.
Re:Wooah! by Anonymous Coward · 2014-09-10 02:48 · Score: 2, Funny

Frequentists vs. Bayesians
statistics discussion by silfen · 2014-09-10 02:51 · Score: 3, Insightful

There are many things wrong with clinical trials, but this isn't one of them. Both the original article and the reanalysis use valid statistical procedures and do not contradict each other. The original analysis didn't prove absence of an effect, it merely failed to show the existence of an effect. The new analysis shows that the drug is, in fact, more effective under some (weak, reasonable) a priori assumptions.
Whether to use statistical hypothesis testing (frequentist methods) or Bayesian analysis is a long-running debate in statistics and medicine. Both techniques are mathematically valid. Statistical hypothesis testing makes fewer a priori assumptions, which is why people have traditionally trusted it more and why it is widely taught and used in science. But over the years that people have come to realize that pessimistic assumptions can be harmful, such as when you continue clinical trials too long or reject the use of life saving drugs. Although I personally think Bayesian methods are a better way of analyzing the data, I think the debate over which methods to use is the way scientific debate and change should happen: slowly and with careful re-analysis and re-examination of data and experimental results.
Shedding light by Anonymous Coward · 2014-09-10 02:54 · Score: 1

This is why the meta-research regarding the safety and efficacy of GMO food organisms is so important. We are constantly told by the industry and their online astroturf army that there are "thousands" of studies showing the safety of GMOs, but it turns out they're basically the same shallow study 2000 times. The most disturbing findings have come from the kind of reanalysis that this story describes. But now, those studies get shouted down because supposedly it's "settled science".
Whenever there's money involved, you have to be extra skeptical of terms like "settled science". Even if there's no obvious corruption involved, there's just too much energy and money pushing for a certain result (or perception) and it creates a kind of momentum. Yes, that includes climate change, though so far, the climate science has stood up to the copious additional scrutiny.
Good scientists aren't only skeptical about nature, they're also skeptical about themselves. We should follow their lead, not the way the Randi-style "pop-skeptics" have set up a tyranny of conventional wisdom, but by taking second and third looks.
Re:Wooah! by Pino+Grigio · 2014-09-10 02:58 · Score: 1

Constructing rational biases, which is what a prior effectively is ("objective prior") isn't all that easy though, is it. There's no universal method for constructing a prior. It's a big source of potential error.
Not so much with clinical trials by EdwardFurlong · 2014-09-10 03:10 · Score: 1

but with other studies they say researchers do have much incentive to redo trials/experiments. You don't make a name for yourself by just confirming someone else's work.
Drug companies and profit motive by Dimwit · 2014-09-10 03:18 · Score: 3, Interesting

Let's compare two companies that depend on science - IBM and GlaxoSmithKline.
Let's say IBM discovers a new method of lithography for building microchips. They publish their results, and their results are replicated. More importantly, IBM gets a new, presumably better way of making microchips.
GlaxoSmithKline makes a new drug that treats a psychological illness. To some degree, because there are no objective physical tests for most psychological illnesses, the determination of effectiveness is made subjectively.
Both companies want the science to turn out right, because it makes them money. One of them has a much easier time massaging the results of any studies.

--
...but it's being eaten...by some...Linux or something...
1. Re:Drug companies and profit motive by Rich0 · 2014-09-10 04:58 · Score: 1
  
  Well, in your analogy there is another big difference between IBM and Glaxo. In your example, if IBM does a faulty study that shows that a new way of making microchips is better, then the result will be that they spend a billion dollars on a new fab and it produces faulty microchips that nobody buys. In the other case, it is others who are relying on the results of the study to make a decision about whether to buy Glaxo's products. This is why it is much more important that drug trials be regulated. If it were up to me I'd have them all publicly conducted, but nobody wants to spend tax dollars on clinical trials.
MDs do not grok statistics! by gweihir · 2014-09-10 03:24 · Score: 1

A real shocker. As basically no profession has a good grip on statistics except specialized mathematicians, it is no surprise so many are wrong or misleading.

--
Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
1. Re:MDs do not grok statistics! by plcurechax · 2014-09-10 04:38 · Score: 1
  
  ... As basically no profession has a good grip on statistics except [statisticians], ...
  There I fixed it for you.
2. Re:MDs do not grok statistics! by gweihir · 2014-09-10 16:10 · Score: 1
  
  You did fix exactly nothing. A statistician can also work in applied statistics and then the understanding is not required. So you basically BROKE the statement. Pathetic.
  
  --
  Most ACs are not even worth the keystrokes to insult them. Be generically insulted by this and ignored otherwise.
Re:Wooah! by jensend · 2014-09-10 03:45 · Score: 1

But frequentist analyses aren't any more "objective," they just hide biases from view and include inductive biases that aren't even rationally compatible with any consistent state of belief.
With Bayesian analysis your starting point is out in the open and must be justifiable and defensible; analysts are accountable for their priors.
You can also, of course, examine what would follow from several different priors. This is much more straightforward than trying to shake the hidden biases in a frequentist model.
Re:Wooah! by Anonymous Coward · 2014-09-10 03:54 · Score: 1

Try implementing that in practice on actual analysis of data.
I do it every day. Works perfectly.
An infinitely wide distribution function literary mean 1. F(x) = 1.
Do you need an example.
I have some data
x y 0.188334 2.08939 0.400133 2.26874 0.723409 2.31389 0.172104 2.00783 0.430118 2.28716 0.245059 2.03828 0.0494027 2.04421 0.221342 2.15249 0.911822 2.4686 0.461583 2.24511
I have a hypothesis that this data is linear. y = m*x + b. I want to know that probability that the parameters of m and b explain my data. I know nothing about m and b so I will set P(m)=1 and P(b)=1. Any value of m from -Inifinity to Infinity is equally probable. Any value of b from -Inifinity to Infinity is equally probable.
I want to find the probability that a line with a value of m and value of b fits my data. I need a probability distribution function that my data is explained by my evidence.
P(x_i,y_i|m,b) = Exp(-(y_i - (m*x_i + b))^2/0.01^2) A Gaussian distribution with sigma for ~10% cause my data has noise in it.
Now my
P(m,b|x_1,y_1,x_2,y_2,...x_n,y_n)=Product[Exp(-(y_i - (m*x_i + b))^2/0.01^2)]*1*1=Product[Exp(-(y_i - (m*x_i + b))^2/0.01^2)]
Look at that my uniform prior worked perfectly. At the point where P(m,b|x_1,y_1,x_2,y_2,...x_n,y_n) is maximum, a line with corresponding values of m and b best describes my data.
Re:Wooah! by ColdWetDog · 2014-09-10 04:10 · Score: 1

There's no universal method for constructing a prior. It's a big source of potential error.
Don't they just have to go into an monastery or something like that?

--
Faster! Faster! Faster would be better!
Re: Wooah! by Stickasylum · 2014-09-10 05:21 · Score: 2

Uniform probability in what scale, though? Performing a Bayesian analysis with a uniform prior will generally give different results than, say, using a log scale on the dependent variable(s) and choosing a uniform prior on *that* scale. The Jeffreys prior provides a method of computing a non-informative prior that is invariant under re-parameterization, but is generally difficult to work with, and is never a uniform prior. So yeah, the concept of an uninformative prior is more complicated than "just use a uniform distribution", and analysts need to be especially careful with priors used with small sample sizes!
Re:Wooah! by Anonymous Coward · 2014-09-10 06:06 · Score: 1

The problem is the Bayesian analysis is far from conclusive. What it does point to is that the clinical trial needs a larger sample size. Sample sizes that are too small are useless.
Conversely if the sample size is too large, the cost of trials, typically a few to a few tens of millions of dollars, will go up significantly. The reflex answer to that is "so what, Big Pharma can pay for it!", except that many new therapies are developed by small companies, especially drugs for conditions with a small patient sample. For example, the condition used as an example, systemic sclerosis, has an incidence of about 1 in 100,000 in the US; it's estimated about 50,000 people in the US suffer from it. I've seen clinical trials with a 46 patient sample size cost upwards of $10M, because the condition is small in incidence so even just finding patients willing to participate is very tricky. If you upped that number to 450 patients, the cost for the trials would not justify developing a drug helping these people; the developers would lose money and these people are left to suffer. So there's a bit of a balancing act that is not just statistical significance, but also cost/benefit.
Re:Wooah! by NotSanguine · 2014-09-10 06:24 · Score: 1

Almost had me there article! Until you said the most evil words known to man... "statistical technique". AKA "bullshit"

Anyone who cannot cope with mathematics is not fully human. At best he is a tolerable subhuman who has learned to wear shoes, bathe, and not make messes in the house.
--RAH

--
No, no, you're not thinking; you're just being logical. --Niels Bohr
Statistics by manu0601 · 2014-09-10 14:35 · Score: 1

Please push a serious statistics course through the throat of medicine students, that would benefit everyone health!
Ultimate Argument for Reproducibility by fygment · 2014-09-11 00:10 · Score: 1

There have been recent cries for reproducible results in science.
The scope is too limited.
There should be a cry for reproducible results in any research prior to its publication.
Long and short of it for researchers: if only you can get the results and conclusions, then the results and conclusions are not publishable.

--
"Consensus" in science is _always_ a political construct.