Positive Bias Could Erode Public Trust In Science
ananyo writes "Evidence is mounting that research is riddled with positive bias. Left unchecked, the problem could erode public trust, argues Dan Sarewitz, a science policy expert, in a comment piece in Nature. The piece cites a number of findings, including a 2005 paper by John Ioannidis that was one of the first to bring the problem to light ('Why Most Published Research Findings Are False'). More recently, researchers at Amgen were able to confirm the results of only six of 53 'landmark studies' in preclinical cancer research (interesting comments on publishing methodology). While the problem has been most evident in biomedical research, Sarewitz argues that systematic error is now prevalent in 'any field that seeks to predict the behavior of complex systems — economics, ecology, environmental science, epidemiology and so on.' 'Nothing will corrode public trust more than a creeping awareness that scientists are unable to live up to the standards that they have set for themselves,' he adds. Do Slashdot readers perceive positive bias to be a problem? And if so, what practical steps can be taken to put things right?"
Right? isn't that what American schools and TV have been teaching for the last 30 years? Nerds aren't cool - facts are open to interpretation - everyone is special - you can eat more than you grow... When you have a society rewarding irrationality, what do you expect? Rigorous science?
I want to delete my account but Slashdot doesn't allow it.
Positive Bias is another word for Group Think. I guess it could also mean deception
There are "studies", and then there is observation, modelling, prediction, model testing which is this thing called science. "Studies" are bullshit. Scientific research functions as it should. I believe the OP's article is just a chunck of sensationalist BS, or utterly ignorant of what science is (and is not).
'Nothing will corrode public trust more than a creeping awareness that scientists are unable to live up to the standards that they have set for themselves,' he adds.
No, the corrosion of public trust is the incessant idiocy coming from Fox and other Murdoch properties exclaiming "oh those silly scientists got it wrong again!" when the story is about a refinement of a model or something.
Scientists are losing the credibility war because scientists are not PR flacks and are unable to counteract the "we don't have to report actual news, we got a court order saying we don't" assholes at Fox.
There is a concerted effort to discredit scientific research no matter what it is.
--
BMO
I received my PhD in physics, and the thesis was measuring a number, in which I measured zero within the error bar. Not particularly interesting, but valid science. My wife was in a PhD program in Biology, she also did valid science, novel measurement technique, came up with an uninteresting result, therefore was not able to publish, therefore was unable to graduate. It would have been extremely simple to fudge the result to a 2-3 sigma result 'hinting' at an interesting answer, which would have gotten published. I think certain sciences have gotten to a point where they have forgotten that if you do valid work in a novel way, then that is science and you should not be punished for the conclusion of the measurement. Most measurements you do of the natural world should probably end up being unsurprising, and thus uninteresting, but you don't graduate or get tenure with those kinds of results. I think this is the mechanism for the positive bias. That is why I do not take results from certain branches of science at face value.
The solution lies in a reformation of research finance that is not focussed on how many papers X published compared to Y, but also takes into account whether they are consequential or not and if they actually comply with at least basic scientific attributes such as repeatibility, verifiablity, falsifiability, accessibility of all data and all conducted research, as well as actually conducted verification of research by independent third parties.
There should also be an outright condemnation of data mining, where data bases are checked only for the existence of attributes and correlations that happen to affirm the researchers opinion and leave all others untouched.
Fields like economics, medicine and climate have long since deteriorated to mere cargo cults due to those failings.
Some sort of redirection towards findings that can be verified by independent labs would seem to be an improvement on the current system. But that would require a focus non science as a system and less of the "great researcher" emphasis we see today.
They're overstating precision. Which rather then a forgivable error is an elementary mistake no trained scientist should ever make.
A VERY basic concept they teach at the lowest level of science education is the distinction between accuracy and precision. This is science 101.
Accuracy is whether or not a given conclusion is correct.
Precision is to the degree of specificity.
Typically you run into problems on complex subjects because they overstate the precision of their data or their ability analyze the data.
This can boil down to simple thinks like significant digits.
For example, I'm measuring volume to two significant digits in a giant data set with thousands of measurements. When and if I average those numbers the final average can't have more then two significant digits. That sounds elementary but you see this error made on some big studies. You'll have a situation where something is being measured in a crude sense by many sources and then in the analysis a much higher degree of specificity is implied.
Often that degree of specificity is required to make certain conclusions which is why they break the rule. This is lazy and a breach of scientific ethics. What they need to do is collect the data all over again this time to the level of specificity they need.
Simply saying its too hard to collect the data properly so they're going to make assumptions is not reasonable or ethical. I suppose you could do it so long as you kept an asterisk next to the data and the findings to make it very clear throughout that the conclusion is a guess and not in any way empirical science since at some point people were guesstimating results.
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
When you have a hammer, everything looks like a nail.
Anything can become a religion, as a result. We're less critical of our data when that happens, and we "nudge" it into place.
The problem is not "science" per se but our social approach to it.
Science is a method dingbat, anyone who puts faith in a scientist however is practicing demagoguery.
Practice science, not demagoguery.
Like just about everything else in this world, science is about money. And how do you get money in science? By finding and/or hyping the next leap forward. Being successful in science is all about getting grants. You don't get tenure without bringing in grant money, you don't get grant money without publishing in the best journals, you don't publish in the best journals without finding the next leap. Your typical PhD finishes school in their late 20's, probably with significant school loan debt. He or she then gets a postdoc where they can barely afford to live in the city with the prestigious school that they think they need to further their career. At the same, it's probably time to think about starting a family (especially if you are a woman). And as a postdoc, the pressure to publish is even greater in a more compressed time frame. There is so much pressure there financially, emotionally and mentally, that is it is no wonder that some people cave and take shortcuts and fudge results. And then, if you do make it to a tenure track position, you don't do much science any more. Instead you spend all your time writing grants and churning through postdocs, who may or may not be fudging their results to get a recommendation to get a better position...
In engineering research, there is definitely a positive bias; in fact, negative results are rarely published at all. This is both because negative results have less sex appeal than positive results and because peer reviewers are trained to outright reject publications without positive results. Although there is huge pressure to publish positive results, I'm not aware of systemic fraud in the literature. What does happen, however, is roughly this: 1) researcher gets great idea. 2) researcher tries idea. 3) idea fails to produce state-of-the-art results. 4) researcher adds hacks and kludges to marginally improve performance. 5) repeat steps 2-5. So, what you get in the end are journals filled with "positive results" that mean nothing and a bunch of "scientists" who make a living doing things that do not really resemble science at all.
I've not seen it put more concisely than that yet - well done.
In relation to your second line: Practice of science is more engineering that science -
In theory there is no difference between theory and practice, but in practice there is.
blog.sam.liddicott.com
And ignores the fact that once published, there's no reliable corrective mechanism to propagate those results down beyond a standard literature search. I'm posting as AC because quite a few years ago I published results that I believed at the time to be correct, but were shown to be wrong in a subsequent paper. Despite this, I'm *still* being cited in new papers while the paper that refuted mine is seldom cited. Science isn't some infallible field. We make mistakes; Sometimes those mistakes are accidental, sometimes they're sloppy, and yes, sometimes they're even intentional. That doesn't reduce the validity of science, but it requires us to be more vigilant.
The case of the green jelly beans.. http://xkcd.com/882/
Atlas stands on the earth and carries the celestial sphere on his shoulders.
Crap. I only thought I was posting as AC. Doh! :)
LOL.
"First they ignore you, then they ridicule you, then they fight you, then you win." -- Mahatma Gandhi
I remember a few days ago someone submitted a story about piracy for "The Avengers" being low compared to potential profits from them. A few high-ranked comments were like "This is yet another proof that [insert common /. parlance here]". I saw very few comments that stated the most plausible reason: a camcorded action film, with crappy audio and a shaking image, can't compete against the real thing. I thought the same thing: confirmation bias.
People do it all the time. If something can somehow support their views (specially if they don't RTFA) they'll use it as yet more confirmation. "I still don't get why this piece of evidence is discarded by everyone else! They must be delusional or have bad intentions". For example, I imagine this article will be used as evidence for: lack of funding, falling standards in the US, the demise of education, lack of scientific reasoning (maybe they'll even extend it to scientists themselves), and other common /. utterances. I wonder how many of them will actually say what I found out after RTFA...
So, everyone is playing the same game, and scientists are no exception. But hey, that study has numbers on it. At least you can try to replicate the findings, if only the entry barrier wasn't so high: these tests are *hugely* expensive. More collaboration may be a good idea. Shared laurels are better than none, right?
P.S., a nice article on confirmation bias (and other goodies) here.
I rarely respond to comments. Also, don't ask for clarifications: a brain and Google are faster, believe me!
Actually, science is stll working; the real trouble comes with the publicity of the science.
You should never believe the results of any single study. Every scientist knows this; or should know this. Science comes when results are confirmed, not when somebody publishes the first paper. The real work of science just starts when somebody publishes a study saying "we show that x has the effect y." That initial paper really is no more than "here's a place to start looking." However, newspapers want to publish news, and they need to publish whatever's hot and interesting and being done today, not "well, scientist z had his team take a look at the xy phenomenon to see if there was anything interesting there, and they couldn't really find anything there, although maybe some other research lab might have different results."
And, I suppose that somebody should post a link to the obligatory xkcd: http://xkcd.com/882/
http://www.geoffreylandis.com
Saying 'science is a method' is like saying 'Christ was a Jew'. True, but it doesn't change what happened.
Science was an idea designed to seek empirical truth. To find things in such a way that those who followed after could find them again. Then people got a hold of it and started using it as a means to control one another.
Christ (even from the atheist point of view, so bear with me) had a simple message of love being service to your fellow man. Then people got a hold of it and we get monstrosities like the Crusades.
That's where the 'HAS become' part of the above phrase kicks in...
It's just you. Corporate R&D produces products. Sometimes incremental engineering improvements. Not science.
Corporations USED to do some decent science, even more or less basic science. But not anymore.
Okay, it may not be exactly a myth. We can't tell. I strongly suspect there is actually a negative publication bias.
What most people think are "negative" results are actually inconclusive. A non-significant p-value is NOT a negative result. That misunderstanding is very widespread, and leads to lots of high level mistakes. Half of the neuroscience papers published in top journals including Nature the last two years that could make a mistake based on that fallacy, did. And neuroscience didn't seem to be particularly worse than most other fields.
A non-significant p-value is just that - not significant. Inconclusive. Getting an actual negative result is considerably more work than getting a positive one. You need to figure out what the minimum effect size you're interested in is (you should do that for positive results too, but almost everyone just uses zero for that) and show that your confidence intervals do not include it. As Ionnadis points out, you really should consider the power of your study as well (also for positive results), and take a stab at estimating the priors too.
If you go and do all that, and also do a quality experiment, in my experience you actually have a pretty good chance of getting published, because a) it's clear to the reviewers you've done a really thorough job (any idiot can run some data through a t-test and get a p-value, negative results are harder) and to show a negative result your study is probably much higher powered than a positive result one, meaning an impressively big p-value.
The problem is not that there's a positive publication bias, it's that most scientists don't know how to show negative results so there are very few negative papers around.
I think that's a classic keyhole problem.
Private companies have a shit ton of marketing and PR.
90% of all projects in large companies fail. 10% succeed. The marketing never mentions the failure.
In the government, 90% of all project are successfully, 10% fail. Because the PR(news media) only focuses on the negative, the success don't get talked about.
This is the same thing.
.
The Kruger Dunning explains most post on
If you have a personal web site or something similar, it may be a good idea on your part to make that clear and link to the refuting paper. It would take some guts on your part, of course, because it could negatively affect your career, but it would probably be the right thing to do.
I'm a scientist. It's a big problem.
Here's how you fix it:
The metric for success for both researchers and their government funding sources is published papers. It's hard to change the cultural view of the scientific community, but it's easy to change government metrics. Paper publishing as a metric is easy to track between programs, but has had a terrible impact on scientific culture. It's also led to a large bias in how the government decides what areas to fund. If your metric is paper publishing and you're looking at energy issues, do you fund a sub-field with a historic high paper publication rate, a moderate paper publication rate or a low publication rate? It's fine for us here to say we'd fund the best research, but a government program manager may lose his job for picking a field with the lower publishing rate.
Other metrics such as how many other researchers use some results or whether a practical implementation of some new technique is developed will be harder to judge and take a longer time to evaluate, but would at least give us an honest assessment of the quality of government funded research. Tie future funding to what our broader society is looking for out of science, and eventually the scientific culture will follow.
There's nothing wrong with science itself. But there are many barriers against doing science correctly. I can't say for sure if the barriers have gotten worse in recent decades. But I think yes, it has become harder to do good science. Why?
The Space Race and Bobby Fischer's day in the sun are long gone. They were good contests, and well-liked, with very positive outcomes, especially when compared to the alternative of nuclear war. In a way, it makes up for science having discovered The Bomb. It was possible that no one would make it to the moon. We were not going to suffer big losses if that had been the outcome. As for Fischer, he looked more and more unstable as the years passed.
But one thing about The Bomb: it demonstrated the power of science, even if very darkly. After that, the Space Race demonstrated the power of science again.
And now? It's really hard to top The Bomb and The Moon. But, by some measures we have. We have the Internet, which has been, among other things, a virtual nuke to the traditional business model of Big Media. Perhaps that's too subtle for the more cretinous among us.
That's what I think is a big factor in the current state of science. We're still in the letdown following the big high of the moon landing, and the awesomeness that is the Internet has not been appreciated enough.
Some of the stretching and outright fraud is a symptom of that letdown. We're straining to meet impossible expectations. It's as if we have to top the Moon to impress the public. One of the more crudely obvious ways to do that is to visit Mars. Send a human to Mars. Except it can't be a mere visit, it would have to be a prelude to colonization. Can we do that? Not presently.
Is there any other reason to suppose that fraud in science is any worse than in the past? Maybe. I think also that science has become more accessible, which on the whole is good. But a bad consequence is that it's easier for quacks, cranks, and cons to fake it.
Finally, there is motivation for the increased fraud: bias against bad news. Right now, one of the loudest message scientists are giving is AGW. The frauds have seized on public dislike of that negative message. They've been riding high ever since Big Tobacco showed them the way with "Doubt is our product", and they've corrupted and captured the social conservatives. The Republican Party has fallen a long way from the sober fiscal pragmatism of the 1950s. In those days, they were the sane, careful counterbalance to the misty and foolish idealism of the Democrats. The book "The Republican War on Science" would have been laughable in the 1960s. Most "squares" were Republicans, while many Democrats were dirty, drug addled, brain rotted, uneducated hippies. No one would have asked if a Republican president was our worst ever, as Rolling Stone did. Now, the Republicans are the crazies in denial.
Intellectual Property is a monopolistic, selfish, and defective concept. It is "tyranny over the mind of man"
I work in optics and computer vision, and investigate research papers on a regular basis. In the majority of the papers I read the methods proposed either only work for the specific example in the paper (some don't even do that), or only work under such "ideal" circumstances that they barely work under lab conditions, much less real world cases. There are a few gems, but you can tell that much of what is published is just written so someone gets that degree or that quota to reach tenure. Prior to working at this job, my trust in published research was much higher. Now my tendency is to treat it all as garbage until it proves itself otherwise.
Sorry, but you are asking for something that can be called the "everyone should be at the top" problem. Yes, it would be nice if everyone could understand science, scientific methods and what constitutes the difference between one result and a confirmed, peer-reviewed body of research.
It isn't going to happen that way.
What we need is to stop publishing unconfirmed, preliminary results in popular media. That by itself solves 90% of the problem. So who gets to decide when something is ready for 60 Minutes, Fox and Friends in the Morning or the Today Show? I don't know, but I know it isn't the news media and it should not be the original team reporting some unconfirmed results. Just a rule that says nothing gets copied out of scientific journals until it is marked as having been confirmed by at least one independent team would go a long, long way.
Actually, he said "science is a method dingbat," whatever that means.
The importance of commas....
The cesspool just got a check and balance.
I think the real problem is the "all or nothing" approach the public tends to take.
"I found a study on climate change that is mistaken; therefor, every study on climate change is mistaken!" "I found a minor tenant of Darwinian Theory that has been modified; therefor, every theory of evolution must be a lie!"
Science has always been contentious. It's really hard to admit we're wrong.
To me it seems a perfect example of how NON SCIENTISTS have made the expectations that scientists are not able to live up to.
The report was basically "Hey, guys, this looks odd, but we've corrected for everything we could think of. You may want to see if you can replicate this".
Nobody else managed to replicate.
MEANWHILE the original posters were being told how they had it wrong. They changed their process to accord for this and retained some extra difference, despite this. Another attempt by another team showed no effect, so the original scientists CONTINUED to see what could be the cause if not FTL neutrinos.
They then found that the 50ns difference can be accounted for if the connectors were not tight.
NOT, at that time, that this was the cause of the discrepancy, but that
a) the couplings were loose NOW, but they didn't check THEN, could have loosened since then
b) an effect of this is a slight delay, enough to remove the difference they'd seen
All absolutely fine and no "positive bias" AT ALL.
But what is the public saying about it? What were the press saying about it?
THAT debacle (and how it's ignored by the parties involved in it) is a perfect example of the problems science has with people who aren't scientists.
Let's all try to be careful, here.
Medical research was a little late the science party, and they still have serious issues to work out. Some of these problems can't be helped (low sample sizes lead to higher p values, but higher sample sizes place more human beings at risk), others can (not being skeptical enough of research conducted by organizations with a financial stake in the outcomes).
Most of these problems are peculiar to medical research and should not be conflated with all of science. While we should hold the medical research community's feet to the fire over this, the reason we are hearing so much about this is that they themselves are talking about it and they themselves are conducting the meta-research that is producing all of these articles. Other branches of science have looked down on medical research for a long time, but at least they are now getting serious about addressing the problems.
Something that most scientists know, but that is not widely appreciated by the public, is that "landmark" studies are particularly subject to positive bias. For a study to be acclaimed as a "landmark," it is the first report of a novel phenomenon, which by definition means that it has not yet been confirmed by other investigators, and moreover that it is in some sense unexpected--which means that there is not even much "indirect" evidence from other sources that it is correct. It is also likely to be submitted to one of the high-profile "newsy" journals, like Science or Nature, where submissions are not evaluated solely on the basis of whether the science is high quality, but are first screened (and usually by an editor, before it even gets specialist peer reviewers) on the basis of whether it is of "broad interest." Because these journals are widely read and cited , they have high impact factors, so even getting published in one of these journals is a feather in a researcher's cap. In contrast, a publication in a middle-rank journal does not give a major immediate boost to a scientist's career, but may have a long-term benefit--but only if it turns out to be correct. Being "scooped" (by having somebody else publish research leading to the same conclusion) will not prevent you from being published in a middle-rank journal, but it will knock you out of the newsy journals. So the alluring prospect of publication in a newsy journal may lead a scientist to be a bit less cautious than normal, and the risk of being "scooped" means that he/she may not take usual precautions (like seeing if somebody else in the lab can reproduce the result independently). High profile journals are stringently reviewed, but peer reviewers even in these highly-regarded journals necessarily take the authors at their word, unless there are blatant errors--nobody visits the authors' labs to look over the shoulders of the researchers or to verify that negative results have not been discarded. On the other hand, if you are planning to publish a result in a middle-rank journal, you are more likely to take the time to make sure that your conclusions will hold up over the long haul.
Working scientists know this, so they take these "landmark" studies with a grain of salt until they are independently confirmed. But the public can easily confuse "high profile" with "most reliable."
And Feynman said as much in his famous "Cargo Cult Science" speech, which I encourage everyone to go read.
The problem is a lot of scientists DON'T do science that way these days. Bad science, "positive bias" as this article calls it, was rampant in the behavioural sciences when I was studying them. I basically never read a paper where they falsified their theory, or where they said things were inconclusive. They always found a way to hammer the results in to support of their theory, and none of them were ever willing to give me raw data (admittedly I only asked a few, but still).
So while journalists are part of the problem, some scientists are another. They can't ever have their theory be wrong, so they'll do whatever they can to "prove" it. Rather than trying to be the ultimate skeptic and work to find evidence to falsify it, they discard any of that and try to show how it might be true.
This is not a flaw in the system. This is a virtue of the system. Nobody wants to read 100s of papers about how bananas do not cause cancer. No scientist (at least no scientist that I know) believes that a paper proves anything. The paper is a finding worth sharing which must then be confirmed repeatedly by other scientists to be accepted as a fact.