Dozens of Recent Clinical Trials May Contain Wrong or Falsified Data, Claims Study (theguardian.com)

← Back to Stories (view on slashdot.org)

Dozens of Recent Clinical Trials May Contain Wrong or Falsified Data, Claims Study (theguardian.com)

Posted by BeauHD on Monday June 5, 2017 @03:30PM from the suspicious-reporting dept.

John Carlisle, a consultant anesthetist at Torbay Hospital, used statistical tools to conduct a review of thousands of papers published in leading medical journals. While a vast majority of the clinical trials he reviewed were accurate, 90 of the 5,067 published trials had underlying patterns that were unlikely to appear by chance in a credible dataset. The Guardian reports: The tool works by comparing the baseline data, such as the height, sex, weight and blood pressure of trial participants, to known distributions of these variables in a random sample of the populations. If the baseline data differs significantly from expectation, this could be a sign of errors or data tampering on the part of the researcher, since if datasets have been fabricated they are unlikely to have the right pattern of random variation. In the case of Japanese scientist, Yoshitaka Fuji, the detection of such anomalies triggered an investigation that concluded more than 100 of his papers had been entirely fabricated. The latest study identified 90 trials that had skewed baseline statistics, 43 of which with measurements that had about a one in a quadrillion probability of occurring by chance. The review includes a full list of the trials in question, allowing Carlisle's methods to be checked but also potentially exposing the authors to criticism. Previous large scale studies of erroneous results have avoided singling out authors. Relevant journal editors were informed last month, and the editors of the six anesthesiology journals named in the study said they plan to approach the authors of the trials in question, and raised the prospect of triggering in-depth investigations in cases that could not be explained.

21 of 66 comments (clear)

Min score:

Reason:

Sort:

Thanks for that! by ls671 · 2017-06-05 15:55 · Score: 4, Insightful

Thanks for that! Now I can use that tool to generate data for my upcoming fabricated studies.

--
Everything I write is lies, read between the lines.
"90 of the 5,067" by Nutria · 2017-06-05 15:56 · Score: 4, Insightful

That's... less than 2%. Naturally, we want it to be 0%, but 1.8% is nothing to generate scare headlines over.

--
"I don't know, therefore Aliens" Wafflebox1
1. Re:"90 of the 5,067" by hey! · 2017-06-05 16:08 · Score: 3, Funny
  
  You stole the words right from my mouth: 90/5067? That's significant at the p < 0.02 level!
  
  --
  Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
2. Re:"90 of the 5,067" by ShanghaiBill · 2017-06-05 16:22 · Score: 5, Insightful
  
  That's... less than 2%. Naturally, we want it to be 0%, but 1.8% is nothing to generate scare headlines over.
  They only caught the dumb ones. It would have been easy to generate fake data that fits a known distribution. For instance, in Python, just use numpy.random.normal instead of numpy.random.uniform.
  The 2% is just the floor. The actual fraud and/or incompetence rate is likely higher.
3. Re:"90 of the 5,067" by SharpFang · 2017-06-05 19:06 · Score: 5, Informative
  
  Seems like artifact of randomness - Prosecutor's Fallacy.
  Yes, some will be genuine falsifications. But some WILL be genuine results.
  You write a paper on a list of 1000 tosses of a coin, noting each result. The chance for the coin to land on edge in one toss is around 1 in 100,000.
  Then your paper is reviewed along with 100,000 others. If you have the coin land on edge more than once in your dataset, it's flagged as a falsified dataset.
  Roughly 10 papers in the 100,000 tested flagged as falsified will be false positives.
  ------
  Statistical results are subject to the same randomness as single tests contributing to these results. The scale of the randomness is reduced by a factor related to the number of tests, but still exist. And take enough correctly obtained statistical results, and you WILL find outliers.
  
  --
  45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
4. Re:"90 of the 5,067" by joneil · 2017-06-06 00:31 · Score: 2
  
  One other thing to consider.
  I am a funeral director. I see deaths first hand from medical mistakes and malpractice. In fact, I only see that "2%".
  You know that old joke "an undertaker is somebody who cleans up the doctor's mistakes"? Well, more truth to it than you might want to know.
  So, for all you people who say "it's only 2%", you come, sit with me when I deal with a family who has had a death because they were part of that "2%". You look them in the eyes, and say" hey, science is still good, it is less than 2%".
  Or better yet - when one you who thinks 2% is nothing, when one of your family members has a serious issue or dies, you look yourself in the mirror and say "hey, it's okay, it was still below 2%."
  Maybe OT, but it's like my uncle the aircraft mechanic used to say "5% or 2% doesn't sound like much, until you are 10,000 in the air and you suddenly find out your service tech on the engine of the airplane only got 90% of the rebuild done correctly."
5. Re:"90 of the 5,067" by Nutria · 2017-06-06 01:18 · Score: 2
  
  See, or... statistically estimate?
  
  --
  "I don't know, therefore Aliens" Wafflebox1
6. Re:"90 of the 5,067" by TechyImmigrant · 2017-06-06 05:20 · Score: 2
  
  >You can't fake the expected results if you use a RNG
  Yes you can.
  >Also, RNGs don't generate Normal results.
  They most certainly can. Look up the Box Muller method or the Ziggurat algorithm.
  >They generate random results.
  If the designer knew what he or she was doing. Usually they don't.
  >Most real-world "random" events are not "random", but are "normal".
  That depends on how you measure it. Poisson distributions are an example of something you can force by choosing your measurement method.
  >So a computer-generated RNG would fail to make reasonable results.
  A computer properly programmed is quite capable of generating any of the usual distributions you desire and the result will be statistically indistinguishable from other data conforming to the same distribution.
  >You need to overlay a normal distribution on your RNG.
  See above. Box Muller and Ziggurat.
  > It's easier to just guess results, using your brain as both the RNG and normal distribution.
  No it isn't. Humans are awful RNGs.
  
  --
  I should use this sig to advertise my book ISBN-13 : 978-1501515132.
7. Re: "90 of the 5,067" by alvinrod · 2017-06-06 07:55 · Score: 2
  
  Not quite, because knowing the odds of winning with cheating by itself says nothing. It's like having over one thousand people play, but always concluding that the winner cheated because the odds that they could win (.1%) are so small that they must have cheated, without considering that when you have that many people playing, even though the odds of winning are individually low, there are enough people to make it likely that at least one person wins.
  
  If you do a binomial calculation with .1% chance of success it only takes 693 people playing before there's a 50% chance of seeing at least one winner. The fallacy is mistakenly believing that it's 99.9% likely that a person cheated to win instead of something much, much smaller if we use your 1:100 odds of winning with cheating above. Even if cheating guarantees a win, then it's still only 50% likely at most (remember with 693 people all playing fair, we expect at least one winner 50% of the time) so you're basically convicting on a coin flip. Now if on the other hand there's a lot of other evidence to suggest that a person cheated, then it can valid to use the low likelihood of success in support of that other evidence. But if there's absolutely nothing else to support a claim of cheating, then it isn't reasonable to assume a person cheated based on low likelihood of individual success when there are a large number of players.
Outlier data by Vitriol+Angst · 2017-06-05 17:12 · Score: 3, Interesting

So 90 of the 5,067 were outlier-like data and this is concluding results on these outliers.
I knew that publish or perish was ruining science, but this is actually the most heartening news I've heard of its credibility.
I've learned that less than 1.8% of these studies used non-extreme crazy data. My faith in science is restored!

--
>>"ad space available -- low rates!!!"
Unbiased followups needed by Tablizer · 2017-06-05 17:35 · Score: 2

A study should be repeated by an org that has no skin in the game per results. They should be paid to test and get the same amount of compensation regardless of outcome. A random lottery should decide the head managers/researchers for any given repeat.

--
Table-ized A.I.
Re:Only in Clinical studies ..... by religionofpeas · 2017-06-05 18:14 · Score: 3, Interesting

Luckily we can trust those implicitly, especially the model based ones.
Trust is not required. Full source code and input data is available for your inspection and verification.
Re:Only in Clinical studies ..... by quantaman · 2017-06-05 19:02 · Score: 2

It is fortunate that fraud (or incompetence) like this never occurs in other areas. For example think of the implications of this happening in Climate Science papers and studies. Luckily we can trust those implicitly, especially the model based ones.
Because when confronted with the evidence that 90/5,067 studies in one field (likely) contain fabricated data the obvious implication is that an entire field is fabricated?
Your conspiracy theory seems to be missing a few steps.

--
I stole this Sig
Re:Is anyone surprised? by Dunbal · 2017-06-05 23:40 · Score: 3, Insightful

Those are only the ones that are easy to prove fake. There has been a lot of research over the years whose results simply cannot be reproduced even in an identical experiment. Big Pharma has been caught red handed several times now - at one point even publishing their studies in their own "peer review" magazine.

--
Seven puppies were harmed during the making of this post.
Re: Only in Clinical studies ..... by KGIII · 2017-06-06 01:29 · Score: 3, Insightful

Hmm...
I am not a climate scientist. I am a retired scientist. What did I do? I modeled traffic. As strange as it might sound, there is a lot of similarity between the two. I will try to give some history, as it may help this make more sense. Sorry for the lack of brevity.
In my case, I helped bring traffic modeling to the age of computers. In this process, it was learned that you could improve the model results, significantly, by increasing the amount of data available. Even seemingly trivial things can impact throughput. Simple things, such as signage fonts, can impact throughput. Even the frequency of lane markings, reflectivity of lane markings, and coloration all have an impact on throughput.
To try to put this in perspective, I was working with data sets in the full TB size, before the turn of the century. We did distributed computing, before it even really had a name.
Why is that important?
Well, traffic is a bit like climate. It is a chaotic system. To be clear, a chaotic system is not a system that is random. It appears random but, with more data, you can tease out patterns and make deterministic predictions based on a variety of variables, with some levels of consistency and success.
I am not suggesting, for the record, that the climate science models are 100% accurate. In fact, they have confidence ratings. That goes underreported, but they will tell you how confident they are in the results.
Anyhow, that's besides the point. I just want to make it clear and avoid confusion.
What is important is that you have to massage the data. You have to make corrections to the data. You have to remove outliers.
See, we'd collect data and then run it against the models. We'd compare the model output with what was really happening. Sometimes, the results are pretty close. This means you can have greater confidence in the results. Sometimes, it isn't even remotely close.
At that point, you usually start by poking at the model itself. However, you will also poke at the data. You will throw some of that data right into the trash. You will normalize the numbers, and adjust the impact factor. You will also probably swear, like a lot. You will invent whole new languages, just to swear in them.
Either way, you will massage that data until you get the results that most closely match reality. You take existing data and run your models to see how well they match reality. When you get it to the point where you're confident, you use those methods to make predictions about the future, given new variables. This will have varied confidence levels, and pinpoint accuracy isn't expected by anyone versed in the science.
The truth is, you can model all you want but some drunk guy is still going to drive, in reverse, the wrong direction down a one way street. So, you only have so much confidence in the predictions.
The whole point is, you have to massage the data. If you don't, you get horrible results that don't match reality. The expected outcome isn't certainties. The expected outcome is predictions for which you can assign a confidence level.
I suspect part of the problem is poor communication and bad journalism. I've taken some time to examine the models, methods, and reasons. I am not a climate scientist, but I have taken a reasonable amount of time to study it in a scholastic manner. You can download their data AND their models, for free, and run them yourself. You can massage that data any way you want, too. You can apply all the adjustments you want and run the models yourself - for just the cost of hardware you already own and electricity.
Anyhow, I hope this clears a few things up. Correcting and massaging data is pretty normal. It's pretty much required, if you want meaningful results. I am pretty sure the uncorrected data sets are also available. You can get so many data sets, for free. They'll even give you the models. Hell, they'll even give you the source code for the models.
I do want to make it clear, the goal isn't a perfect pre

--
"So long and thanks for all the fish."
This won't be a problem fir much longer by Nyckname · 2017-06-06 02:00 · Score: 2

Scott Gottlieb, the current head of the FDA, wants to end drug trials. "The free market will put the bad actors out of business."
Re:Most modern 'research' is fraud by ceoyoyo · 2017-06-06 02:01 · Score: 2

That's not fraud. Most of those studies are primary biology or animal studies, non-blinded. They tend to have sample sizes of around 10, and use sketchy stats. It's not particularly surprising they can't be replicated.
The stats should be improved, and they need to be more cautious in their conclusions (as does anyone reading them), but the scientific literature is supposed to be more about "hey guys, look at this, what do you think?" and less about "this is the truth!".
Re: Study May Contain Wrong or Falsified Data, by fustakrakich · 2017-06-06 02:56 · Score: 2

How the hell does he even have that many publications?!?
Probably doing a study on the effects of amphetamines.

--
“He’s not deformed, he’s just drunk!”
Marcia Angell & Skepticism on Mainstream Scien by Paul+Fernhout · 2017-06-06 03:12 · Score: 3, Informative

http://pdfernhout.net/to-james... "The problems I've discussed are not limited to psychiatry, although they reach their most florid form there. Similar conflicts of interest and biases exist in virtually every field of medicine, particularly those that rely heavily on drugs or devices. It is simply no longer possible to believe much of the clinical research that is published, or to rely on the judgment of trusted physicians or authoritative medical guidelines. I take no pleasure in this conclusion, which I reached slowly and reluctantly over my two decades as an editor of The New England Journal of Medicine. (Marcia Angell)"

--
A 21st century issue: the irony of technologies of abundance in the hands of those still thinking in terms of scarcity.
Re: Only in Clinical studies ..... by ventsyv · 2017-06-06 06:58 · Score: 2

Not OP but I'll give you a better example. Let's say you are processing the temperature data of all of the US over 20 years, tens of thousands of weather stations with multiple readings per day. It's likely that some of those malfunction every now and then. How do you know which readings are correct and which are not? Well you can eliminate the outliers. You take an area and you look at the temperature readings - if they are all around 75F plus or minus 3 degrees, but you have one that's showing 90F, there is pretty good change that one station is busted. That's exactly how they deal with the urban island effect. Temperature readings in cities are higher due to high area of pavement/concrete, so instead they use the readings from a rural area nearby. In the end you don't care about the ups and downs, you smooth those over and you look at direction of the average. I
Re: Only in Clinical studies ..... by KGIII · 2017-06-06 09:16 · Score: 2

I am not a climate scientist - I feel this needs to be made clear. I have made a quasi-scholastic study of climate science, largely to see for myself what the fuss was about. I've read a whole lot of papers, a whole lot of research, and watched a whole lot of talks. Oddly, I never watched the Gore movie. I prefer to listen to the scientists.
Which leads me to...
The theory behind it is pretty sound. It can be reduced to some pretty simple physics and math. If you put more energy into a system, it's going to transfer some of that energy in the form of heat. The effects of greenhouse gases are, fundamentally, well understood.
And then it gets complicated as all hell.
I am unqualified to opine on the affects from human output. However... The research that I've read makes a pretty good case for it and I didn't see (once I dug deep enough) anything that reeked of bad science. Note: Some does exist but, for the most part, I found that coming from proselytizers and journalists, as well as some who appear to have moved from academia to publicist.
There's a pretty big difference between what I read for research and what is parroted by those who have decided men in lab coats are the new priests. The research is more nuanced and actually lists things like confidence ratings. Where the research may say 2, 4, or even 6 standard deviations, the journalists will take even the weakest confidence conclusions and present them as factual certainties. And, of course, this gets parroted and treated like the kid's game of Telephone.
What baffles me is the amount of alarmists who claim to support science while parroting bad science - or just plain inaccurate science.
Not that long ago, I had someone telling me that the oceans were going to rise 27' within the next fifty - and that this was a fact. So, I handed down citations and demonstrated that there's actually no research supporting such absurdities. They immediately decided I was anti-science and a Trump voter. I am still dumbfounded.
I, who seems to actually understand the science to some extent, don't deny that the climate is changing. In fact, it's pretty evident that it is changing. Hell, I can even understand the science that suggests it is caused by releasing CO2 into the atmosphere. Yet, because I didn't toe the line - and agree with their kinda strange assertion, I was lumped in with the deniers and, even more strangely, assumed to be a Trump voter.
I don't know? I just don't know, at that point. I will say, I can sure as hell understand why people would be skeptical. I can sure as fuck understand why people would negative in their responses. The oceans aren't gonna rise 27' in the next fifty years. Not even the most dire of predictions suggests that.
Again, I'm not a "denier." I've done a lot of reading and researching. I'm not a client scientist, but I am a scientist. I didn't see anything wrong with their conclusions - and there are many, many studies. Some of those studies have data that hasn't been manipulated, to any great degree (as far as I saw). There's a whole lot of research on this.
If I were qualified to opine, I'd suggest that they're on the right track and that we are releasing enough CO2 to have an impact. How much? What will the results of our impact be? Is there a tipping point? Those are things I can't really say. I can say that it does look like the science is fundamentally good.
Why yes, yes I am being careful with my verbiage. ;-)

--
"So long and thanks for all the fish."