Results Are In From Psychology's Largest Reproducibility Test: 39/100 Reproduced

39/100 is the new passing grade. by geekmux · 2015-05-01 00:19 · Score: 4, Insightful

Is there a valid reason we accept studies that have not been reproduced at least one more time to truly vet them before the community?

Logistics, resources, patents, or a need to just plain bullshit people. I'm sure there's plenty of excuses as to why we don't, but doesn't sound like we have a whole lot of good reasons why not.

And those that are labeling a score of 39/100 "not bad at all" should have their head checked. Enjoy your legal fun from that ball of lies.

Re:39/100 is the new passing grade. by queazocotal · 2015-05-01 00:30 · Score: 4, Insightful

Funding.
Assuming for the moment that the reproducers were not particularly more skilled than the original scientists, you can't go from '60% not reproduced' to '60% wrong'.
Assuming there is some actual effect being investigated, one reproduction will not get you to 'good' levels of surety about the effect. To hit '95%' - you're going to need likely over ten reproductions.
Re:39/100 is the new passing grade. by Anonymous Coward · 2015-05-01 00:35 · Score: 1

The way it's supposed to work is that your peer-reviewed paper get's published (peer-review having established that it is at least basically methodologically sound), and then people can read your paper, and if it's interesting and relevant enough then they might decide to try to replicate your results. If you can't publish first, then how are you going to find people to replicate your work?
Re:39/100 is the new passing grade. by Anonymous Coward · 2015-05-01 00:35 · Score: 5, Insightful

Please remember that this applies to Psychology, a field that is rife with lots of historical issues (and it is getting better).
In the past, they've practiced thinly veiled religion (Freud, if you don't believe me it's because my unverifiable model explains it, and you are in denial)
They've overstated their findings, to the tune of "and because of this experiment, we can extrapolate that EVERYONE is just the same!" (Stanford experiment, one of the ones with reproduciblity problems too, coincidentally).
They've put up roadblocks to proper scientific evidence gathering (and so this experiment was done before we adopted an ethical code that made verification of its outstanding results, that is reproduciblity, possible, but we are going to believe the conclusions anyway).
It was always called a "soft" Science because that way they could dodge the bullet coming from the real Sciences. However, when you read some of their works (especially their older works) you begin to see a pattern. They come from a history that probably poised them to have a long and hard road to understanding much of what we really do. In their defense they were attempting to build a model of the "mind" which is something that they assumed existed in a particular way, but didn't really test (it took a long time for Turing to come around). Finally, they are burdened with a lot of thinking that doesn't meet Philosophical rigor, because they shore it up with testing that doesn't meet Scientific rigor.
I'm glad to see the new wave of Psychology coming through. The now base a lot of their findings on biochemical analysis and stronger testing (including better attention to controls and double blind testing, which to their credit, they invented). It's just disheartening that the field lacks respect in other ways because in every intro to Psychology class they keep pushing the sensational "Dogma" experiments as facts when in reality they often fail to reproduce the results.
Re:39/100 is the new passing grade. by lurking_giant · 2015-05-01 01:05 · Score: 3, Funny

Should have had the teachers in Atlanta grade the results... http://www.nytimes.com/2015/05...
Re:39/100 is the new passing grade. by Richard_at_work · 2015-05-01 01:17 · Score: 3, Interesting

Psychology, sociology and other social sciences have always been given special treatment precisely because its difficult in some cases to get two independent groups together to rerun an experiment in the first place - and if you try and reproduce an experiment done in the 1950s today, are the results due to poor scientific method in the original experiment, or because the evidence gathered was misinterpreted, or because society has changed which means the results have changed?
Re:39/100 is the new passing grade. by geminidomino · 2015-05-01 01:30 · Score: 3, Funny

And those that are labeling a score of 39/100 "not bad at all" should have their head checked.

They did, but only 39/100 of them found anything out of whack.
Re:39/100 is the new passing grade. by plopez · 2015-05-01 01:38 · Score: 1

Because the methods seem reasonable and no one else has looked. It is the best information available.

--
putting the 'B' in LGBTQ+
Re:39/100 is the new passing grade. by ceoyoyo · 2015-05-01 01:43 · Score: 1

What do you mean by "accept?"
When you publish in a journal what you're really saying is "hey, look what I did! What do you guys think?" The point of publishing your work is to tell people what you found so they can evaluate it and try to reproduce it themselves.
If you mean that people, both scientists and the layman, shouldn't believe something is true until it's reproduced? Absolutely that would be a good idea. Even better, wait until a good meta-analysis is performed.
Incidentally, the FDA generally requires at least two large, independent trials, both of which are significant, before a drug is approved.
Re:39/100 is the new passing grade. by Anonymous Coward · 2015-05-01 01:57 · Score: 1

IOW, psychology had some 'maturing' to do. Does that mean that the social sciences will be under the same scrutiny? Like all those gender studies that are rife with: "because my unverifiable model (patriarchy) and you are in denial and a misogynist.
Re:39/100 is the new passing grade. by ckatko · 2015-05-01 02:51 · Score: 1

>Is there a valid reason we accept studies that have not been reproduced at least one more time to truly vet them before the community?

Are you talking about "Accept" as in accept for publication in a peer-reviewed journal? Because that's the entire point for a PRJ, getting other qualified scientists to read and possibly call bullshit on your findings if they can't replicate it. Engineering papers (from China!) get debunked all the time. The difference is, very few of the many stupid news papers don't create massive reporting hysteria from engineering papers. The want clickbait and "new polymer chain rotated 13 degrees" is hard to warp into "Men Vs Women" or "End of the World / This will kill you" and "New attack on women's equality."
Re:39/100 is the new passing grade. by umafuckit · 2015-05-01 03:07 · Score: 1

Is there a valid reason we accept studies that have not been reproduced at least one more time to truly vet them before the community?
I don't think it really works like this. In practice, most studies are totally uninteresting and their only purpose in life is either to not ever be read or to seed more uninteresting studies. Nothing of value is lost if they're wrong (and probably they often are). The studies that do matter are replicated because they're interesting enough that other people try to use them. So if a study discovers an interesting new effect or develops an interesting new tool then other (good) researchers jump on to the badwagon. If the original study was wrong then it'll be obvious pretty quickly. In addition, within every field it's an open secret which high profile papers are actually bollocks. It's usually obvious by just reading them. These papers are generally not contested by others for political reasons, but they are ignored because the field knows them to be crap.

--
soylentnews.org
Re:39/100 is the new passing grade. by hey! · 2015-05-01 03:20 · Score: 1

Is there a valid reason we accept studies that have not been reproduced at least one more time to truly vet them before the community?
Well, you shouldn't. It's usually the press that blows a single study way out of proportion, because they have no understanding of how science works. Science *always* generates contradictory results early on as it gets the kinks worked out of a hypothesis. This is not some kind of failure of science, it's the way science is supposed to work. A critical follow-up attempt to check on some study's results is *of course* much less likely to reach significance, because of researcher bias either way.
The gold standard for judging the state of science isn't a study, it's a review paper. This is a peer-reviewed paper, written by someone working in the field, summarizing the state of published evidence on some question in that field. These are supposed to be both comprehensive and extremely conservative in their findings.
Science is continually producing a streams of contradictory evidence. You should either pay very little attention to some new scientific idea, or be prepared to follow along in great detail over several years. But even forty years ago, when my local newspaper used to publish a whole section of science news one day a week (!!!) you could pretty much count on most of the media ridiculously overreacting to a juicy sounding bit of scientific controversy. What can you expect of today's emasculated and dumbed down reporting?
Imagine the media response if there were a study that came out that purported to show that smoking e-cigarettes was beneficial to health. It would be a media circus, but only the start of a long process in the scientific community. So rather than lighting up your e-cigarette, you should wait for the critiques and counter-studies to pile on, and then for a few review papers to come out after the dust settles. Most new ideas in science, like most new businesses, fail after a year or two.

--
Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.
Re:39/100 is the new passing grade. by bondsbw · 2015-05-01 03:31 · Score: 3, Insightful

Many sciences, not just the ones you listed, have at least some problem with reproducibility. Verification isn't nearly as sexy as coming up with a new idea.
During my academic days, all the focus was on new work and literature reviews, but only one professor seemed to (defeatedly) care about verifying the results of other researchers. That doesn't get the funding.

--
All my liberal friends think I'm a conservative, all my conservative friends think I'm a liberal.
Re:39/100 is the new passing grade. by Capt.Albatross · 2015-05-01 03:33 · Score: 1

And those that are labeling a score of 39/100 "not bad at all" should have their head checked.
At least they should have their studies replicated.
The comment that several of the failed replications were "broadly similar" but failed to reach statistical significance leads me to wonder if there has been any data cherry-picking in some of the original studies.
Re:39/100 is the new passing grade. by rockmuelle · 2015-05-01 04:08 · Score: 4, Insightful

Gah. I have mod points but want to add to this conversation.
The point of publishing is to share results of an experiment or study. Basically, a scientific publication tells the audience what the scientist was studying, how they did the experiment, what they found, and what they learned from it. The point of peer review is to review the work to make sure appropriate methods were followed and that the general results agree with the data. Peer review is not meant to verify or reproduce the results, but rather just make sure that the methods were sound.
Scientific papers are _incremental_ and meant to add to the body of knowledge. It's important to know that papers are never the last word on a subject and the results may not be reproducible. It's up to the community to determine which results are important enough to warrant reproduction. It's also up to the community to read papers in the context of newly acquired knowledge. An active researcher in any field can quickly scan old papers and know which ones are likely no-longer relevant.
That said, there is a popular belief that once something is published, it is irrefutable truth. That's a problem with how society interacts with science. No practicing scientist believes any individual paper is the gospel truth on a topic.
The main problem in science that this study highlights is not that papers are difficult to reproduce (that's expected by how science works), but that some (most?) fields currently allow large areas of research to move forward fairly unchecked. In the rush to publish novel results and cover a broad area, no one goes back to make sure the previous results hold up. Thus, we end up with situations where there are a lot of topics that should be explored more deeply but aren't due to the pursuit of novelty.
If journals encouraged more follow-up and incremental papers, this problem would resolve itself. Once a paper is published, there's almost always follow-up work to see how well the results really hold up. But, publishing that work is more difficult and doesn't help advance a career, especially if the original paper was not yours, so the follow-up work rarely gets done.
tl;dr: for the general public, it's important to understand that the point of publishing is to share work, peer review just makes sure the work was done properly and makes no claims on correctness, and science is fluid. For scientists, yeah, there are some issues with the constant quest for novel publications vs. incremental work.
-Chris
Re:39/100 is the new passing grade. by Kjella · 2015-05-01 04:20 · Score: 4, Interesting

Assuming there is some actual effect being investigated, one reproduction will not get you to 'good' levels of surety about the effect. To hit '95%' - you're going to need likely over ten reproductions.
One study != one sample. Each study should have enough cases to make it statistically significant. The problem is related to issues with the sample population or systematic flaws in what you're measuring. To bring it into the realm of physics, if we do a high school gravity experiment and ignore air resistance we can make as many tests as we like, check for measurement uncertainty in our clocks and whatnot and put up some confidence intervals that are still horribly wrong. It's very hard to isolate and experiment with one tiny aspect of the human psyche and most of the problem is the result is nothing but either a statistical fluke or quirk with the people tested that doesn't generalize to the general population.

--
Live today, because you never know what tomorrow brings
Re:39/100 is the new passing grade. by geniice · 2015-05-01 04:26 · Score: 1

Will the first issue is that about the only thing we would have got out of the Large Hadron Collider is confirmation of the Top quark.
Various space probes would also be rather limited. For example MESSANGER would be limited to stuff that had previously been found from earth or Mariner 11.
In fields like chemistry the overwhelming majority of studies produce results that are pretty much what you would expect so there is little reason not to accept them (other than the yield always assume the yield is inflated unless it is a process optimisation paper).
In a field like archaeology not only are most findings pretty uncontroversial (we found some surrey whiteware at Stafford Castle) but repeating an excavation is only going to work so well.
Re:39/100 is the new passing grade. by cusco · 2015-05-01 04:36 · Score: 1

Sounds an awful lot like that other "soft science", Economics. Some researchers are finally getting away from the 'revealed wisdom' of Friedman and company, and hopefully Chicago School Economics will soon end up in the dust bin of history with phrenology and vital humors.

--
"Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin
Re:39/100 is the new passing grade. by bill_mcgonigle · 2015-05-01 04:40 · Score: 1

Is there a valid reason we accept studies that have not been reproduced at least one more time to truly vet them before the community?
The point of papers [in real science] is to say, "we did this, here's what we found". It's not to announce a beacon of new Revealed Truth. That's largely the fault of science "reporters" looking to sell advertising space.
The papers are themselves the invitations to replicate.
The problem is the government science-funding model is largely based on fame and popularity, and doing replication studies is felt to be beneath most researchers except for the most extraordinary of claims, or those that threaten the Orthodoxy.
None of these problems will go away until the incentives of the funding model change. To assume anything else would be economically ignorant.

--
My God, it's Full of Source!
OUTSIDE_IP=$(dig +short my.ip @outsideip.net)
Re:39/100 is the new passing grade. by iMadeGhostzilla · 2015-05-01 04:40 · Score: 1

There is. Not to *accept* the study but to be on the lookout whether the pattern that the study claims to exist really exists, if it useful for you, so you can validate it first hand.
Example (not a great one but will do) -- suppose a new study that seems reasonably well done claims that drivers of black cars are significantly more prone to road rage. You drive a car, so if this pattern holds, it's relevant to you. Then from time to time when you see a black car on the road you give it a little extra attention to see if there's something that indicates the rage thing might be true. After a few "experiments" you decide for yourself.
IMO the patterns they discover would almost certainly be less than they generalize it -- perhaps the black car drivers thing is true only in the U.S. Or, it's only true in in bad economic times. Or, it's only true during the 2008 recession. And, only in suburban areas. And so on. Compared to hard sciences, the individual and group behaviors are much much less constant -- the systems are more complex, there are far more interactions, so the patterns are much less stable, a behavior hold for a while and then people move on and it never comes back. (Life sciences are somewhere in between.)
Come to think of it -- given the fickleness of those patterns, you should pay attention to a new psychological/sociological study only if it could be *very* useful to you if true.
Re:39/100 is the new passing grade. by iMadeGhostzilla · 2015-05-01 04:55 · Score: 1

I'd argue the opposite -- it's the "Physics Envy" i.e. basing psychology on the classic scientific model of objective measurement that makes it worse, and now it gets 39/100 score. And we don't even know how important those 39% that passed are -- it may be some stupid stuff few people care about while the big ones failed.
The reason is in psychology you simply can't measure reliably -- often times your measurement is asking people what they think or feel, or you observe some behaviors that depend on a thousand other factors. And we do have a first hand access to our minds, why pretend that the mind is the black box? But most importantly, the utility of psychology is supposedly to make people feel better and more meaningful, and that escapes any objectification.
I have read some Jungian literature and while it's in no way what we'd call proper science it seems very useful. (It was for me, that's the only thing I can claim.)
Btw not saying that all of the objective psychological approach is not useful, IMO it's useful for small/mechanical we do stuff (that can still be important).
Re:39/100 is the new passing grade. by HornWumpus · 2015-05-01 04:55 · Score: 1

Keep hoping. Also remember there are no Keynesians. Keynes would run a surplus in good times. 'Keynesians' are just 'print and spenders' with 'intellectual' cover circle jerking.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:39/100 is the new passing grade. by jythie · 2015-05-01 05:36 · Score: 1

"Physics Envy" is a good way of putting it. "hard' sciences are almost trivial by comparison, but tend to get more respect since they can deliver nice simple answers. Must suck to be working on really complex problems and be treated like you are not 'real scientists' by people doing the easy stuff.
Re:39/100 is the new passing grade. by amicusNYCL · 2015-05-01 05:53 · Score: 4, Insightful

I think that there is a bit of a disconnect between scientific methods and things like psychology. I think it stems from the fact that it's easy to set up a thermometer, or an EKG, or whatever else and get those discrete data points, but it's difficult to measure things like "I feel better" or "I don't think about that the same way." I have a few friends who would swear up and down that EMDR therapy helped them out tremendously, but I don't know if there's a single way to gather data that would actually quantify what happened to them. I see the results not so much as psychology being flawed, but more about the difficulty of simply gathering the type of quantified data that a scientific study would require.

--
"Our two-party system is like a bowl of shit looking at itself in a mirror." - Lewis Black
Re:39/100 is the new passing grade. by jfengel · 2015-05-01 06:14 · Score: 1

That's a really good summary of the situation.
I do think that there's one more important factor. The flip side of reproducibility is utility. The whole reason that we care about reproducibility is that it means that we can put things to use. We demand falsifiability because if it can't be put to the test, then it's not so much "wrong" as "worthless", i.e. Not Even Wrong. If it can be reproduced but never is, what did it matter in the first place?
That's not the same kind of epistemological issue that falsifiability is, but it's a bit more immediate. If this research isn't being put to use, why are we bothering doing it at all? Wouldn't our time and money be better spent on other things?
It seems as if a lot of these studies weren't worth having done. Not just because they couldn't be reproduced, but because nobody wanted to. It's the sociology of science, the dynamics of funding and defining a new field. It's a field full of questions that we want answers to, but the questions themselves are ill-posed because we don't have a solid theory in which to ask them. We're going to have to grope towards an answer, and that's going to mean a lot of missteps.
I wish we had better answers, but it does seem to me as if this hints at a need for the field to clean itself up. Rather than performing so many disconnected studies, maybe we need to stop pushing for papers that nobody apparently has an use for even if they were valid. I know that's easier said than done; it hints at completely revamping the way funding is done. But the money is being spent, and it appears that much of it is not being spent well.
Re:39/100 is the new passing grade. by ultranova · 2015-05-01 06:22 · Score: 1

Sounds an awful lot like that other "soft science", Economics.

The key difference between psychology and economics is that there's money in getting psychology correct (because it'll help you sell crap), while there's money in "proving" whatever economical hypothesis benefits the current elite.
Liars study psychology and practice economics.

Some researchers are finally getting away from the 'revealed wisdom' of Friedman and company, and hopefully Chicago School Economics will soon end up in the dust bin of history with phrenology and vital humors.

It doesn't really matter. Some other excuse for why the rich getting richer and the poor poorer is really in the best interests of those poor will take its place. And people will eat it right up, since it also excuses their dreams of getting rich themselves and lording it over the rest, who don't deserve it. And it doesn't help that Cold War mixed economics with nationalism to the point where people actually developed religious attachments to particular economical positions.
After all, phrenology might be in the dustbin of history, but the cause it served - dividing people into winners and losers so the former might be kept too busy oppressing the latter to start thinking for themselves rather than just doing what they're told - is still alive, as it has been for most of recorded history. It's like a memetic parasite living on noosphere and feeding on humanity generation after generation. Our future depends on killing it before advancing technology makes even losers able to destroy the world.

--
Forget magic. Any technology distinguishable from divine power is insufficiently advanced.
Re:39/100 is the new passing grade. by Quirkz · 2015-05-01 06:46 · Score: 1

This always surprises me. I know how contentious the physical sciences are, and from what I understand the soft sciences are even worse. I'd expect lots of people with differing opinions would be out to refute studies contradicting their assumptions. Maybe the flow of income makes this less possible?

--
The Quirkz Handbook of Self-Improvement for People Who Are Already Pretty Okay
Re:39/100 is the new passing grade. by pipingguy · 2015-05-01 07:15 · Score: 1

"(Freud, if you don't believe me it's because my unverifiable model explains it, and you are in denial)"

That sounds like a Kafkatrap.
Re:39/100 is the new passing grade. by HornWumpus · 2015-05-01 07:19 · Score: 1

Must suck for the few that are actually trying to do science. Not so much for the 99% pushing a agenda and faking it.
It's hard to do social science right and it has a danger of upsetting your world view. So nobody does.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:39/100 is the new passing grade. by HornWumpus · 2015-05-01 08:23 · Score: 1

Lets show who ran the house on the same graph!
There are no Keynesians!

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:39/100 is the new passing grade. by HornWumpus · 2015-05-01 08:26 · Score: 1

It's amusing when two social scientist, neither of who understand basic statistics, get into a pissing contest.
What's not amusing is they decide who 'won' the argument by polling their coworkers feelings.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:39/100 is the new passing grade. by david_thornley · 2015-05-01 10:24 · Score: 1

Just because a result is statistically significant doesn't mean it's correct, although I can't search for the appropriate xkcd here at work,so google for "xkcd jelly beans" or "xkcd significant" to find it.
Statistical significance means that the results you got would be gotten by random chance by a certain probability, typically 5%. 5% is not some magic number, and there's no theoretical significance, but it seemed like a reasonable value.
Therefore, if you take a batch of statistically significant results, some of them are going to be statistical flukes, and retests will not confirm the original significance. This happens. If you run twenty studies correlating two factors, and they're all unrelated, you will get an average of one publishable result. If you run a study with more than one comparison, you're much more likely to get a publishable result, because psychologists generally don't do really good stats. I read a psych paper once that had eight or ten correlations, and suggested that the one that was only 10% likely to result from chance variance was promising.

--
"When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
Re:39/100 is the new passing grade. by david_thornley · 2015-05-01 10:30 · Score: 1

I had a friend who worked for the FDA in animal feeds. She said that all she wanted to recommend approval was three honest studies significant at the 5% level, and was really annoyed at companies who couldn't understand that. (Control group? What's that?)

--
"When you have eliminated the unacceptable, whatever is left, however improbable, must be the truthiness" - Holmes
Re:39/100 is the new passing grade. by bondsbw · 2015-05-01 12:14 · Score: 1

It's not just about trying to prove someone wrong. But few people want to spend their energy on proving someone else to be correct, although that is arguably the best kind of science we can do.
And suppose you do try to prove them wrong, and fail to do so. What's in it for you? Too few papers are submitted or published where the author's hypothesis is shown to be flawed.

--
All my liberal friends think I'm a conservative, all my conservative friends think I'm a liberal.

false positives by jc42 · 2015-05-01 00:21 · Score: 4, Insightful

'This makes it hard to dismiss that there are still a lot of false positives in the literature.'

An even more widespread problem is that there are a lot of true negatives that aren't in the literature.

Of course, this is a problem in most scientific fields, not just the "soft sciences" like psychology. I'm occasionally impressed by a researcher who publishes descriptions of things studied and found to be not significant, but this doesn't happen very often.

--
Those who do study history are doomed to stand helplessly by while everyone else repeats it.

Re:false positives by wisnoskij · 2015-05-01 00:45 · Score: 1

Except they are not really a problem, just wasted effort.

--
Troll is not a replacement for I disagree.
Re:false positives by quintessencesluglord · 2015-05-01 01:01 · Score: 4, Informative

More to the point, this is a problem of funding in all fields.
No one wants to pay for basic research, even if it yields other useful ideas for further research. Unless it hyped to high heavens, the possibility of getting dollars is nil.
Gent I know was a decent researcher who got demoted to teaching community college. After a year of not being able to produce the "right" (read: able to secure further funding), he was canned, and another researcher who was more accommodating to fudging results got the position.
It's not like the experiments were going to be reproduced anyway. Just fodder for additional grants because you produce "results".
Re:false positives by randomencounter · 2015-05-01 01:21 · Score: 2

False positives are why we have to insist on doing research to verify past studies, and they are inevitable for various reasons.
Accepting that, and funding verification studies, is how a science goes from soft to hard.

--
Forget diamonds, copyright is forever.
Re:false positives by mrchaotica · 2015-05-01 01:35 · Score: 4, Informative

You missed the GP's point: the problem is not that true negatives were found; the problem is that they were not published. Because they were not published, future researchers might waste more effort re-discovering them.

--
"[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz
Re:false positives by ceoyoyo · 2015-05-01 01:47 · Score: 1

You shouldn't be impressed by people who publish insignificant results. Insignificant doesn't mean "not true." It means inconclusive. You should be impressed by people who go the extra mile to turn their not significant results into meaningful limits on parameter estimates (setting limits on how big an effect could be). That's done a lot in physics but only occasionally in other fields.
The only reason for publishing inconclusive results is to allow somebody to incorporate them into a later meta-analysis, or to serve as pilot data for someone doing a power calculation to plan a larger experiment.
Re:false positives by jc42 · 2015-05-01 05:23 · Score: 1

You missed the GP's point: the problem is not that true negatives were found; the problem is that they were not published. Because they were not published, future researchers might waste more effort re-discovering them.
Indeed, though there is often a more insidious effect. Suppose there's a claim that treatment T is effective for medical condition C, but it's actually a "placebo effect". If one study showed a (perhaps small) effect, and other studies showing no effect aren't published, a lot of money can be made selling T to customers. If the true negative results are published, the makers (and prescribers) of T lose that income.
This is one of many reasons for the low level of publishing "negative" results. Another reason is linguistic: In English and most other human languages, it's hard to make a clear distinction between "Our study showed no effect" and "Our study didn't show any effect". Most listeners won't hear a difference between negating the object and negating the verb, and the media frequently reports the former as the latter. Managers of scientific organizations are as prone to this problem as the rest of us are. "We want studies that show results, not studies that don't."
And, of course, there's the general cultural resistance to "negativity". This easily explains why many pseudo-scientific beliefs persist centuries after they've been disproved.

--
Those who do study history are doomed to stand helplessly by while everyone else repeats it.
Re:false positives by Anonymous Coward · 2015-05-01 06:24 · Score: 1

Another reason is linguistic: In English and most other human languages, it's hard to make a clear distinction between "Our study showed no effect" and "Our study didn't show any effect". Most listeners won't hear a difference between negating the object and negating the verb, and the media frequently reports the former as the latter. Managers of scientific organizations are as prone to this problem as the rest of us are. "We want studies that show results, not studies that don't."
As a statistician, I would say this is not a linguistic issue but a fundamental conceptual issue. It's (almost) always possible that there is an effect which is just too small for our study to have identified. This is why we pound into the heads of Stats 101 students that failing to reject the null hypothesis is not the same as accepting the null hypothesis. If they media is reporting it as "the study didn't show any effect" then they're probably getting it right. It's extremely unusual for a study to be able to positively show the lack of an effect.
Re: false positives by MenThal · 2015-05-01 07:16 · Score: 1

Your momma is so fat, her shadow reversed global warming!

Obg. XKCD by Reaper9889 · 2015-05-01 00:31 · Score: 5, Insightful

I think it could have something to do with this XKCD:

https://xkcd.com/882/

Re:Obg. XKCD by wile_e_wonka · 2015-05-01 01:03 · Score: 1

I think this paper is saying it is worse than this. Only about 5 percent of studies with 5% chance of results by random chance should be wrong. But this is saying that 60% weren't replicated. It's almost as if the researchers feel pressured to publish papers and they have a hard time publishing papers without results.
Re:Obg. XKCD by west · 2015-05-01 03:22 · Score: 1

Let's say that *no* hypotheses are correct. 1000 studies are conducted. 50 of them find a significant result!
When we do a replication, 2 still pass! They *must* be true.
I know, I'm butchering the statistics, but the main point stands, because we see only the studies with p 0.05, significance doesn't mean what we like to think it means.
Frankly, I'm surprised they reached 39%.

Has Anyone Reproduced Their Results? by registrations_suck · 2015-05-01 00:32 · Score: 4, Funny

Well, this is interesting news, to be sure. Gives us plenty to think about. I can't help but wonder if anyone has been able to reproduce their results.

Perspective by wisnoskij · 2015-05-01 00:44 · Score: 4, Interesting

You need to put this in perspective. Sure, psychology is wishywashy field filled with pseudo science. But apparently their studies are about as reproducible as a bunch of the hard sciences fields. If there is anything that reproduciblility studies have taught us is that if there is around a 50% chance your result is correct than you are around the norm, in a great many fields. This 39% would make them about on par with what I remember from medical/cancer reproduciblility studies.

--
Troll is not a replacement for I disagree.

Re:Perspective by geekmux · 2015-05-01 01:17 · Score: 1

You need to put this in perspective. Sure, psychology is wishywashy field filled with pseudo science. But apparently their studies are about as reproducible as a bunch of the hard sciences fields. If there is anything that reproduciblility studies have taught us is that if there is around a 50% chance your result is correct than you are around the norm, in a great many fields. This 39% would make them about on par with what I remember from medical/cancer reproduciblility studies.
Based on the evidence you've presented here, I've come to this simple conclusion; The hard sciences are just as wishywashy and filled with pseudo science as psychology is.
And if those in the "real" science community want to cry bullshit over this conclusion, then fucking prove it with reproducible studies.
Until then, you appear to be either incompetent or corrupt, with the latter not being a far stretch when you consider the potential revenue involved in the medical community as a whole.
The scary part is not just the result of 39/100 being somehow acceptable here. The scary part is understanding that 61% of studies could be filled with nothing but dangerous lies, potentially putting many lives at risk.
Re:Perspective by Anonymous Coward · 2015-05-01 03:08 · Score: 1

And how do you propose we determine which studies are reproducible without attempting to reproduce them?
Reproducibly is not the same as replication. For a study to be reproducible the original dataset(s), processing code, and methodology must be available for other people to examine and use.
Re:Perspective by bluefoxlucid · 2015-05-01 03:50 · Score: 1

Welcome to science! In 10 years, a lot of what we believe today will be somehow invalid bullshit!
Say it with me: current theory suggests....
I'm an engineer. When the entire field of cognitive science rolled over on its back for K. Anders Ericsson, I said, alright, the research is funny, but the conclusions are useful. So it's all fucked up, full of bullshit and misunderstood data. Ericsson's hundreds of papers and books all boil down to one thing: experts become experts by a principle he calls "deliberate practice", whereby a person must have goal-oriented, technique-focused practice strategies with constant and immediate feedback. Cognitive scientists now define "practice" as some sort of activity that GENERATES ERRORS, having decided you don't learn if you're not fucking it up.
So maybe they don't understand all this bullshit; but they understand now that the prior theory--10,000 hours of rote mechanical behavior to become skilled in something--was bullshit. As an engineer, I don't care that the new theory is full of holes; all I care about is they definitely know that time doesn't really correlate with expertise, except by the confounding of more types of activities occurring in longer time. If you've done something for 10 years, you'll have made and corrected for more mistakes in your career as if you've done it for 10 minutes. The new theory? You have to pick a technical facet of the skill, practice that directly, and do so in a manner that strains your abilities and forces you to make mistakes you can learn from. I'm on board with that, because it works; at least, it works better.
In 10, 20, 50, 100 years, they'll come out and say, hey, we figured out you're like 5 times more effective if you practice in this way, and that whole "deliberate practice" pseudotheory bullshit was just missing this key common behavior among practitioners of deliberate practice! I'll be like, hey, that's cool, we'll do that then, because it works better.
Current theory suggests playing complex songs you can already play on the piano day after day won't make you any better; playing difficult songs will make you better very slowly; and determining what piano musicmanship skills you're weak in and drilling them directly in a fashion demanding skill beyond yours such that you make mistakes of a nature you are able to identify and correct for *will* advance your skill *very* quickly. Maybe current theory is full of bullshit, but all of these statements are verifiable as true, and so I know how I'm practicing my skills.

--
Support my political activism on Patreon.
Re:Perspective by HornWumpus · 2015-05-01 05:38 · Score: 1

Do all the people that study liver disease have fucked up livers?
If no then they are not the same. Psych majors are _all_ batshit insane.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:Perspective by wisnoskij · 2015-05-01 08:03 · Score: 1

I do not think that all or even most experimental psychology studies use brain scans to gather the results. Instead they ask the participant what they think. A physicist does not ask a rock if it thinks it fell quicker with weights attached to it, he simply measures the fall. Furthermore, not only do we not understand the brain enough to really know exactly what we are looking at when we take a scan of the brain, but the brain is such a mutable object that you cannot really compare one to another the same way that you can with other organs. You can cut away as much as 50% of a live working brain and still have a live working brain, which have switched functions that used to be done in the gone part into entirely different parts of the brain; That is akin to cutting out someones liver and having the heart take over the duties of blood cleaning.

--
Troll is not a replacement for I disagree.

If it's published, it must be true? by KitFox · 2015-05-01 00:44 · Score: 1

It wasn't just published on the internet, it was published in a scientific journal!

Silly researchers. You're not supposed to publish science fiction just because a company paid you to write a story that matches their agenda.

--

@Whee

Re:If it's published, it must be true? by godrik · 2015-05-01 02:02 · Score: 1

I tend to seriously dislike the kind of comment that attributes malicious intent to researchers. I do not think that this is a problem with collusions. It is a problem that making a sound and reproducible experiement is HARD. It is easy to forget to report a phase of your experiment that you did not think about but that turn out to be important. It is also easy to have an implicit biais you did not recognize: an obvious one could be you did your experiment on sunday, so you excluded all the church goers.
Re:If it's published, it must be true? by KitFox · 2015-05-01 11:00 · Score: 1

At the same time, just like in any group, there are "bad" researchers and "good" researchers, and the ones performing due diligence and trying to be upstanding can still have a bad day.
The primary focus was to point out that even a scientific journal is not without fault, and there are even some that have been known to accept and promote complete drivel that was created intentionally to point out how flawed the scientific journal system can be in some places.
The general idea is not to make the researchers seem malicious. The intent is to point out that just because it's published doesn't mean it's true, and in this case it's taking it even further than the prior shows of failure that occurred with "Pay to be published" journals.

--
@Whee

The bar for passing research is low in several fie by uarch · 2015-05-01 00:45 · Score: 1

I'm trying to imagine the results for a similar review of computer science and computer architecture research from most universities.

Since there is nearly zero reproduction of results, limited validation, generally poor test content and few incentives to improve research quality I doubt the results would instill much confidence.

Quick plug for JASNH by AEton · 2015-05-01 01:07 · Score: 5, Informative

Just taking this quick opportunity to post a link to my favorite journal, the Journal of Articles in Support of the Null Hypothesis: http://www.jasnh.com/ .

JASNH is one of the few places where you can submit a paper that says "we tested for X effect on Y and found no evidence that X affects Y". Generally this research is unpublishable and people will tweak parameters to get something career-advancing out of their research; I like JASNH because of the reminder that "falsifiability" can really happen.

--
We recently had heard in the office over one of the Yellow Machine that's made by Anthology Solutions.

Re:Quick plug for JASNH by Anonymous Coward · 2015-05-01 01:46 · Score: 1

Don't forget, those studies can be just as wrong.

Which gives a global p value of... by leehwtsohg · 2015-05-01 01:11 · Score: 2

Great! With p=2E-65, studies in psychology aren't totally random.

A floor sweeper contributes more to society by AndyKron · 2015-05-01 01:23 · Score: 1

Psychology is basically pulling something out of your ass, and making it sound good. A floor sweeper contributes more to society.

Re:A floor sweeper contributes more to society by narcc · 2015-05-01 09:40 · Score: 1

What amazes me is that someone with no scientific background can make such absurd statements about a field, of which they have absolutely no understanding, with such conviction.

--
Required reading for internet skeptics

Baseball and Psychology by Virtucon · 2015-05-01 01:38 · Score: 1

Where batting .390 is considered a good thing.

--
Harrison's Postulate - "For every action there is an equal and opposite criticism"

What? - Question Solved. by Murdoch5 · 2015-05-01 01:41 · Score: 1

Okay, 39/100 is an absolute, total and complete failure in all possible regards. Legitimate scientific fields don't get recognized for being able to backup 39% of there research. This goes to show why Psychology will never be considered a real science, it produces unverifiable results and it produces flaky / questionable answers. I'm glad someone took the time to finally put this issue to bed once and for all, 39/100 verified studies is the same as saying: "You're not science, stop acting like it!"

Re:What? - Question Solved. by godrik · 2015-05-01 02:16 · Score: 1

Actually 39% is not bad at all. I am sure it is not better in computer science. As a reviewer I typically need to fight with the authors for them to give enough details to be able to even attempt reproduction. Most CS papers lacks basic information on :
-how the code is written (language, major data structure),
-how it is executed (complied, interpreted, which level of optimization),
-where it is executed (which machine, complete spec, operating system, idle load, is parallelisation used),
-what dataset are used (randomly generated does not say anything unless you give distributions),
-what is precisely measured (Did you include or exclude I/O, did you only measure the kernel you are interested in, did you measure teh entire algorithm, did you include startup and closedown of you execution engine)
If these is not mentionned, then you lack information to attempt to reproduce the result. And then if you have them, there is always a possibility the result will differ from what was reported.
No seriously, psychology is probably better at this game than computer science.
Re:What? - Question Solved. by Murdoch5 · 2015-05-01 03:22 · Score: 1

Most of Computer Science also isn't science. The only real science in CS is when a new data structure is designed or when someone makes a radical new way to accomplish a task. Entering syntax into an editor does not constitute science, that's just leg work. For instance, applying an established principal to achieve an outcome doesn't mean your doing science, science has already been done, at that point it's application, which well it needs to fall back on scientific principals, doesn't mean it's science.
Re:What? - Question Solved. by godrik · 2015-05-01 04:17 · Score: 1

bitter much, aren't you?
For sure, there is a bunch a second grade science in there. But there is a lot of science going on in CS. I suggest you read about it instead of passing for an idiot. :)
Re:What? - Question Solved. by Murdoch5 · 2015-05-01 05:50 · Score: 1

Well I'm a qualified Embedded Software Engineer, Web Developer, and DBA, so I'm not talk out of my ass, I'm talking from experience. Most of what I do isn't science and isn't even engineering, it's typing code into an editor.
Re:What? - Question Solved. by godrik · 2015-05-01 07:08 · Score: 1

I don't get the argument. You are saying that becasue most trained computer scientist are engineers then computer science is not a science?
Then I guess medicine is not a science either since most medical doctors are saving lifes and treating illnesses and not researching their root cause.
I also assume that mechanics is not a science since most professionals are build bridges and designing component and not solving Navier-Strokes equation.
There is a science called computer science that gets published in scientific journal and conferences. And there are reproducibility issues in there as well. Maybe even more than in psychology because these guys get beat up with experimental protocols. And in CS we are more relaxed about it.
Re:What? - Question Solved. by godrik · 2015-05-01 07:19 · Score: 1

well...I beg to disagree.
Computer science is not mostly discrete mathematics, it use to be true in 1980, but that was last century. Also we have "reproducibility" issues in proofs as well. Many proofs in the field are not correctly written. And it causes many of them to be incorrect. For instance there was a critical flaw in the proof of TimSort which caused problem recently.
But Computer Science is much more than that nowadays. Algorithm get tested in practice because they are proved on models of computers. You need to investigate runtime, numercial stability, .... It is good to know that this algorithm is in O(n^2) but the Big-Oh notation hides a constant in there (and a rank of initial validity) which are found experimantally. And that is only in algorithm design.
The entire field of networking and performance is essentially built around a modelization, experiments, discrepancy loop.
All data mining/machine learning is also essentially built on hypothetisation that a model fits reality and validation.
Programming languages/middleware is also fairly experimental, you hypothesized that this programming language delivers a better effort-quality tradeoff than that other ones and you verify it experimentally by taking measuring the performance of a human population with different tools.
Human Computer Interaction is also very experimental, after designing a new system/mode of operation, you are hypothesizing that it enables performing a task faster than some other one. Which you will verify by comparing the performance of a set of users.
Re:What? - Question Solved. by Murdoch5 · 2015-05-01 07:24 · Score: 1

No, you missed my point, which may be my fault, so I'll fairly explain again! :-)

Computer Science, just like Medicine has valid aspects which involve hard science, such as developing a new medication or developing a new data storage mechanism like a Tree / Link List. However where it stops being science is when you're just implementing those solutions, after they've been proven to work and have been flushed out.

If I take a Red / Black tree Library and either write it myself and use it or use an existing one, I'm not doing any kind of science, I'm just applying the existing science to my task. Just as if you take an aspirin for a headache you're not participating in science, your just applying the science to the application.

Science is defined as:

the intellectual and practical activity encompassing the systematic study of the structure and behavior of the physical and natural world through observation and experiment.
, Which for the most part is not what programming or development is. I'm very rarely observing new behavior or studying the systematic response of a new system. I'm simply taking what already exists and reapplying it into a new solution.

It's because of this that I feel we should stop using the term science loosely and on a side note the term engineering. If you're not progressing a field of study then you shouldn't apply the term science, feel free to disagree. Just as if you're not designing something new you're engineering. The terms get thrown around in places they don't belong and I think that should stop.
Re:What? - Question Solved. by narcc · 2015-05-01 10:22 · Score: 1

Okay, 39/100 is an absolute, total and complete failure in all possible regards. Legitimate scientific fields don't get recognized for being able to backup 39% of there research.
Yes, that's why we abandoned the pseudo-science of medicine ages ago. Oh, wait...
Given the little data we have, psychology is 'average'. We won't know if they're doing exceptionally well, or exceptionally poorly, until more studies are done not only on reproducibility in psychology, but in other fields as well.
Reproducibility problems aren't often investigated, and very few fields are actively studying the issue. I suspect that we'll find serious problems in virtually all branches of science as these studies continue. Nature has already taken action. I expect this crisis to hit even physics which is certainly not immune to controversy.
There's also the question of fraud, to which no branch of science is immune. It would be difficult to determine, but very helpful, if reproducibility problems could be divided between methodological problems and fraudulent or falsified results. It's difficult enough to stop computer generated articles from slipping through. How much more difficult would it be to find "real" papers with falsified data?
If nothing else, this should stress the importance of replication in all fields. Scientists are humans, after all, not the purely objective machines you imagine them to be. It's a dangerous belief, often held by non-scientist "science fans", which ultimately undermines the whole enterprise in the minds of the public.

--
Required reading for internet skeptics

Social Sciences are hard to constrain by plopez · 2015-05-01 01:44 · Score: 1

Which makes it very very hard to do research on. In addition effects may change over time, e.g. as a person's individual psychology changes as they age or as culture changes people over time. Doing research in a natural system, like the Social Sciences, is very hard. It is much harder than Physics, Chemistry, or other such Sciences. It is much harder than putting together a cutesy mobi app. And longitude studies are even harder to do well.

--
putting the 'B' in LGBTQ+

God bless him... by Karmashock · 2015-05-01 01:46 · Score: 1, Insightful

I had despaired that psychology could ever pull its head out of his own ass. But if they start actually doing real science again then the field might actually be saved.

It had gotten so bad that I just assumed that the neurologists would have to deal with all this stuff from the other side. Answer the psychology questions with neurology science.

Psychology has become something of a joke lately and there is no way to fix it short of subjecting it to cold empirical science.

--
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.

Re:God bless him... by blind+biker · 2015-05-01 04:18 · Score: 1

The shift to various neurosciences (e.g. neuro-economics) is inevitable, as was the shift from philosophy to philosophy of nature and then the various natural sciences. While philosophy still exists, it deals with issues nobody truly gives much of a fuck anymore.

--
"The agriculture ministry is not in charge of Gundam" - Japanese ministry official.
Re:God bless him... by Karmashock · 2015-05-01 04:57 · Score: 1

Psychology can be a real science though. It just has to preform experiments and limit its theorizing to the data.
Furthermore, statistical data is vastly over used and really should be reduced radically in its application. Especially data that is mined from other studies and re-appropriated for other purposes. The mishandling of statistics is something I think we've all seen and I'm sick of it.

--
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
Re:God bless him... by Karmashock · 2015-05-01 07:04 · Score: 1

If you need someone to hold your hand that is one thing. However, that isn't why I go to see a doctor.

--
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
Re:God bless him... by Karmashock · 2015-05-01 07:13 · Score: 1

I had a friend that quit psychology because he couldn't perform experiments. A lot of it is ethics considerations.
Imagine a chemist having to worry about the ethics of the chemicals he was studying? The problem with psychology is that to do a proper experiment, you might need to break someone's mind to learn what is going on. But that is unethical and so you can't really do serious study into the human mind in most cases.
Consider medicine if they were unable to ever harm a human body and if the body couldn't be studied after death?
There's no such thing as a dead mind you can study and take apart to learn how it works. That is how doctors first started. They worked on the dead to understand how they were put together so they could heal the living.
The other place things were learned was in the triage tents of the legions. Men would be brought in with stab wounds, blunt trauma to the skull, etc... and the doctors were asked to save as many soldiers as they could.
For psychology to be as real of a science as medicine... we're going to have to have the gloves taken off. But that won't happen.
Even the largely harmless tests in the 60s and 70s on college students were considered to be too harmful and unethical. So we can't do anything anymore.
Most of the studies I've seen are statistical studies. And frankly that is everything that is wrong with modern science. Nearly all the statistical studies I've seen were bullshit due to poor handling of the methodology and mistaking the belief that a given conclusion was likely on little more than the bias of the researchers than on whether or not the conclusion had actually been proven.
The man just said, there is a problem in psychology. And that is just one. I think if they can start repeating the experiments a bit more they might make the science less full of shit. But ultimately given that they can't really tear someone's mind apart and then put it back together... I don't think they're really going to learn much.

--
I've decided to stop wasting my time responding to AC trolls/sockpuppets... so if you want a response from me... login.
Re:God bless him... by narcc · 2015-05-01 09:44 · Score: 1

While philosophy still exists, it deals with issues nobody truly gives much of a fuck anymore.
Like epistemology. Oh, wait. That's absolutely essential to modern science. My bad. I didn't mean to disturb your delusion.

--
Required reading for internet skeptics
Re:God bless him... by blind+biker · 2015-05-04 00:39 · Score: 1

That was an interesting post.

--
"The agriculture ministry is not in charge of Gundam" - Japanese ministry official.

Societal effects by operagost · 2015-05-01 01:59 · Score: 1

The frightening part is when one of these unreproducible studies is used to formulate government social services policies. Then we have people imprisoned, or their children taken from them, based on bad science.

--

Gamingmuseum.com: Give your 3D accelerator a rest.

Re:Societal effects by Bongo · 2015-05-01 02:16 · Score: 1

I liked Soros' point that a reason for having an "open" society is that you can never be sure if the fantasy you're operating under, is actually going to do more benefit than harm, so one is always trying to remain not too sure of oneself, hence the "openness". For example, don't have a death penalty, because you can never be sure you're not making a big mistake (you might still jail the person, but basically try to be as available to correcting mistakes as possible). Try to be softer, more open, which I guess, can at the same time appear more liberal and more conservative.

Not just soft sciences by argStyopa · 2015-05-01 02:19 · Score: 1

A lot of people claim the soft sciences are not 'really science' due to the intangibility of their results - and this plays directly into that bias.

However, it's very much not just the softer sciences that have this issue. There's a growing realization that it's pervasive across many hard science disciplines:

http://www.wsj.com/articles/SB... : 64% of pharma trials couldn't be reproduced.

http://retractionwatch.com/201... - half of researchers couldn't reproduce published findings.

We're inundated with data that, due to the specificity of the field or detail of the results, has to come from 'experts' and doesn't lend itself to a sort of common-sense vetting that we can use to filter bullshit in the usual course of our lives. Whether it's from ignorance of statistical methods, poor experimental technique, motivated mendacity (for whatever reason), or simply experimental results that represent only an unusual end of a bell-curve, there are many, many reasons that scientific data has to be taken with a serious grain of salt. It can't be assumed to be conclusive until we've reproduced it in whatever context we're trying to apply it.

--
-Styopa

Re:Not just soft sciences by BCGlorfindel · 2015-05-01 03:27 · Score: 1

A lot of people claim the soft sciences are not 'really science' due to the intangibility of their results - and this plays directly into that bias.
However, it's very much not just the softer sciences that have this issue. There's a growing realization that it's pervasive across many hard science disciplines:
http://www.wsj.com/articles/SB... : 64% of pharma trials couldn't be reproduced.
http://retractionwatch.com/201... - half of researchers couldn't reproduce published findings.
We're inundated with data that, due to the specificity of the field or detail of the results, has to come from 'experts' and doesn't lend itself to a sort of common-sense vetting that we can use to filter bullshit in the usual course of our lives. Whether it's from ignorance of statistical methods, poor experimental technique, motivated mendacity (for whatever reason), or simply experimental results that represent only an unusual end of a bell-curve, there are many, many reasons that scientific data has to be taken with a serious grain of salt. It can't be assumed to be conclusive until we've reproduced it in whatever context we're trying to apply it.
With the exception of climate science. It's settled and you're a heretic if you suggest any uncertainty exists in any part of that field...

Re:What They Don't Say by Runaway1956 · 2015-05-01 02:21 · Score: 1

Sounds about right.

I trust the judgement of peers before I trust the judgement of a shrink. I've heard to many things from and about shrinks to believe that they know shit from shinola.

--
"Windows is like the faint smell of piss in a subway: it's there, and there's nothing you can do about it." - Charlie Br

the process by slashmydots · 2015-05-01 02:41 · Score: 1

Here's an experiment that will always reproduce the same results:
1. psychologist thinks they're right
2. psychologist gets mediocre stats that sort of support their claim
3. psychologist messes with the numbers and eliminates "incorrect" data to make their point appear more supported

It's a joke by Anonymous Coward · 2015-05-01 03:09 · Score: 1

When I was a freshman in college, I took Intro to Psych, and like many psych 101 classes in colleges across the country, the students are required to participate in research studies for grad student researchers in order to complete the class. Some of the studies were what you would consider to be more traditional one-on-one experiments with the researcher, but more often than not, the experiment merely consisted of taking a survey. All of the surveys were scheduled so that everybody involved took them at the same time. Well, I remember one time a survey was scheduled for something like 6pm on a Friday. Nobody wanted to be there, so of course, every student was rushing through the survey so they could get out of there and start partying for the weekend. I remember feeling so bad for the researcher, knowing that their dissertation probably depended on getting good data. Unlike the rest of the students, I took my time taking the survey, to the point where I was the last student left. That's when the researcher came up to me and in a sarcastic tone said "Are you done yet?". At that point I just randomly filled in the rest of the bubbles and came to the conclusion that most psychology studies are probably BS.

Ah psychology by rubycodez · 2015-05-01 03:17 · Score: 1

The "soft science" in the sense that a stick of butter is a "soft structural material"

Re:Ah psychology by HornWumpus · 2015-05-01 07:33 · Score: 1

You ether don't know any psychology majors or are one yourself.
Of course the average nut job doesn't go to medical school, then specialize in psychiatry.
Which isn't to say that psychiatrists aren't also loons.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
Re:Ah psychology by HornWumpus · 2015-05-01 07:35 · Score: 1

You also appear to want to group neurologists (scientists) in with psychologists (not scientists).

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'

Re:If 39% is "good science" by ColdWetDog · 2015-05-01 03:23 · Score: 1

There's no grant money available from government for that. It's only available (in huge quantities) for supporting the paradigm. Strange but true.

I rather thought that energy companies had a few extra dollars running around. Exxon's 2014 revenue was over 400 billion. Surely they could fund a few studies all by themselves.

--
Faster! Faster! Faster would be better!

Re: Sure, replicability is the problem... by Fwipp · 2015-05-01 03:23 · Score: 1

So, are all you skeptics paid shills, or just really passionate about your cause?

Re: Sure, replicability is the problem... by mi · 2015-05-01 03:34 · Score: 1

So, are all you skeptics paid shills, or just really passionate about your cause?

I, for one, am rather passionate about some odd numbers not being primes.

That there is an infinite amount of them strengthens my position, but having only a few exceptions is enough to invalidate the theory I cited as an example.

--
In Soviet Washington the swamp drains you.

Beyond non-reproducibility by Subm · 2015-05-01 03:35 · Score: 1

I'd suggest a study going farther than just checking reproducibility.

I bet for many studies you could produce opposite or contradictory outcomes.

That ought to get someone's PhD published.

Be careful with alleged "reproductions" by davidwr · 2015-05-01 03:53 · Score: 1

If you are unethical and try to reproduce a given experiment 100 times and it reproduces 10 times, you can publish a paper saying "I reproduced this experiment 10 times successfully" and destroy the evidence of the other 90 trials. Find 2 or 3 "independent" shills to do the same type of fake "reproduction" over the course of a few months and people will just assume that the experiment is valid and stop trying to disprove it.

It works in reverse too:

If you are unethical and try to reproduce a given experiment 100 times and it reproduces 90 times, you can publish a paper saying "I tried and failed to reproduce this experiment 10 times" and destroy the evidence of the other 90 trials. Have a few "independent" shills repeat the sham "failure to reproduce" a few times and the original experiment will be discredited, probably along with the original research team and its institution.

--
Knowledge is how to play a game, intelligence is how to win, wisdom is knowing what game to play.

Re:What They Don't Say by cusco · 2015-05-01 04:19 · Score: 1

I grew up a few blocks from a large mental institution, and many of my classmates' parents worked there. An awful lot of the people who worked there ended up loonier than their patients.

--
"Think about how stupid the average person is. Now, realise that half of them are dumber than that." - George Carlin

Re:What They Don't Say by Penguinisto · 2015-05-01 04:23 · Score: 3, Funny

Think of it this way... psychology is the 1960-70's equivalent of today's MBA, and have many similarities:

* neither has an objective means of measuring success or failure, in spite of claiming to have a wide array of methods by which to do so.
* neither the psychologist or the MBA is held accountable for incompetence or non-criminal malice.
* sometimes either one can take on the semblance of religion, minus a deity.
* the big 'do-nothing-but-are-promised-great-riches' degree of the 60's-80's was psychology, as hordes of students took that class thinking just that. In the 90's through today it's the MBA program.
* both can stretch logic and credulity in their work to attempt things that would get an engineer either incarcerated or killed.

(...add your own here...)

(Trigger warning for the MBAs and Psych majors: this is what is known as a joke.)

--
Quo usque tandem abutere, Nimbus, patientia nostra?

Re:Yes: religion by HornWumpus · 2015-05-01 05:35 · Score: 1

The bible was last edited by the council of Nicea in about 250AD. It gets re-translated regularly. Look at the various translations of 1st Timothy 2:12 for how politics can affect that. Who decided to remove 'she should be quite'?

By your standard, that makes the bible about 1/10 as reliable as the Hindu books about flying elephants having battles.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'

Social science is mutable by edcheevy · 2015-05-01 06:38 · Score: 2

Unlike the hard sciences, awareness of classic social science findings can loop back to impact the phenomena in question or they can change in response to society's evolution. Take the bystander effect for example. How many thousands or millions of college students have learned about the bystander effect in Psych 101? Hypothetically, now that they're aware of it, the effect should diminish and not be quite as reproducible as it once was. Then you layer on societal changes (oblivious smartphone/iTunes users increase the effect, but ubiquitous phones may decrease barriers to reporting and responding to violent crime, etc) and the ability to reproduce an earlier effect becomes muddled.

When a physicist announces a new particle, nothing changes. All the particles keep behaving how they were behaving before the announcement, and they don't care how society changes. The findings should be reproducible 100 years from now.

Many other comments have correctly pointed out that studies in general often focus on the new and shiny and statistically significant rather than reproducing prior results or reporting null findings, but the issue of settling on "truth" is made that much more difficult in the social sciences due to the existence of moving targets.

The answer is here by rangek · 2015-05-01 07:02 · Score: 1

http://rsos.royalsocietypublis...

If you use p=0.05 to suggest that you have made a discovery, you will be wrong at least 30% of the time. If, as is often the case, experiments are underpowered, you will be wrong most of the time.

And given the low power of most psychology experiments I am not surprised by this result.

Pure sciences vs. not pure sciences? by Kylon99 · 2015-05-01 07:05 · Score: 1

I've always thought it had something to do with this. Yes, another xkcd post:
https://xkcd.com/435/

I can see how messy proving things are in sociology and psychology, and how absolute mathematical proofs are. It's always disturbed me how uncertain we can be with the sciences as we move to the left, though I really don't know at what point we can call something 'pure.'

Re:Pure sciences vs. not pure sciences? by lucien86 · 2015-05-03 07:30 · Score: 1

Yes but if you're not careful physics and maths can also wonder off deep into the realms of pseudoscience.
In physics current theory stops or fails at the edge of the FTL barrier - but if you examine a map of reality 99.9999...% of the universe can only exist as an FTL structure. The FTL region of general relativity predicts either non-existence, or a young universe, or a fixed fate (past and future fixed), and is totally illogical given that 3 dimensions occupy a smaller region than 4. Then there are Super strings, 10 dimensional spaces, super symmetry, the 'holographic' principle (2D universe), circular time, etc, etc... most of them with no proof whatsoever.
In the real world maths is ultimately based on physics and not the other way around - and mathematics when analysed is an evolved system - and that makes the whole concept of Pure mathematics essentially a lie and little more than a delusion.. but higher maths wonders off into many strange realms like infinite dimensional spaces, Laplacian operators, non-Euclidian topologies.. and many others.

--
Below the speed of light Special Relativity is one of the most accurate theories in physics - above the speed of light..

Because it is a SOFT science by blang · 2015-05-01 07:11 · Score: 2

You can measure how many parts per million of some matter is in teh air.
You can measure how many bacteria of a certain type is in your blood stream.

How do you measure if someone is in a good or bad mood?

The tester's bedside (or couch-side) manners can be enough to tilt the result one way or the other.

And if the researcher has an idea of what he is looking to find, he can (even subcounsciously) manipulate the patient into reacting one way or the other, tainting the measurement.

What do we measure, how do we measure it? The subject could be lying. They subject could be be imagining something. The tester has no way to verify.

Reproducibility is NOT the problem.

Even research that was reproduced can be wrong, for same reasons as above.

The NATURE of the field is the problem, not the lack of reproduciblility.

Lack of reproducibility is merely the proof that there are fundamental problems with measurements and conclusions.

But I agree that the conclusion we can draw, is that there are a lot of false positives.

--
-- Another senseless waste of fine bytes.

Sounds quite reasonable by SoftwareArtist · 2015-05-01 07:42 · Score: 1

'A lot of working scientists assume that if it's published, it's right,' he says. 'This makes it hard to dismiss that there are still a lot of false positives in the literature.'

Ummm... they do? Like, who? Not a single one I know.

If a result is published, I assume (as do most other scientists) that means very little until it's been reproduced, and even then I remain quite skeptical until it's stood the test of time. I assume many published results will turn out to be wrong. That's just the nature of science. Every paper is a work in progress, a snapshot of someone's research at one moment. And that's fine.

So 39% were successfully reproduced, and another 24% came close? I'd call that pretty good, especially in psychology where you're studying an incredibly complex system (the human brain) while trying to sort out hundreds of interacting factors.

--
"I'm too busy to research this and form an educated opinion, but I do have time to tell everyone my uninformed opinion."

Psychololgy is psuedoscience by tehlinux · 2015-05-01 07:56 · Score: 1

That is all.

--
Most linux users don't know this, but the man pages were named after Chuck Norris. Chuck Norris fsck'ing hates noobs!

Re:What They Don't Say by PopeRatzo · 2015-05-01 08:07 · Score: 1

I grew up a few blocks from a large mental institution, and many of my classmates' parents worked there.

You mean they said they worked there.

--
You are welcome on my lawn.

Invisible Hand by PopeRatzo · 2015-05-01 08:08 · Score: 2

As dismal as Psychology's record is as a science, it's still way more rigorous and evidence-based than Economics.

--
You are welcome on my lawn.

Probability a paper is correct in a "FIELD" = PPV by jameson.burt2404 · 2015-05-01 10:15 · Score: 1

These results merely define the proportion of true results in the psychology FIELD, presuming the papers did good research.
See PLOS's "most" viewed paper, "Why Most Published Research Findings are False" by John Ionnidis August 30, 2005 at PlosMedicine.org Why Most Published Research Findings are False

Ionnidis paper proves that
"After a research finding has been claimed based on achieving formal statistical significance, the post-study probability that it is true is the Positive Predictive Value,"
PPV = (1 - beta) R / (R - beta * R + alpha)
= 1 / [1 + alpha / (1 - beta) R) ]
where

alpha = .05 usually -- the probability of a Type I error beta is the probability of a Type II error (1 - beta is the power) R is the ratio of true relationships to no [false] relationships in that field

Here, for psychology, with alpha = 0.05,
PPV = 0.39 = 1 / [1 + .05 / (1 - beta) R]
so

R = 0.03 if 1-beta = 1 [the power for a very large sample] 0.06 if 1-beta = 0.5 0.15 if 1-beta = 0.2 [the power for a moderate sample].

That is, these psychology papers operate in a field with around R = 0.15 true/false relationships.
Germany's Pharmaceutical Bayer found only 30 percent (PPV=0.30) of all pharmaceutical papers verifiable, corresponding to an R = 0.11.

You can change the ratio R to
R / (1-R),
the pre-study probability the relationship is true. Call this the "Background Probability" of a true relationship.

In the extreme though not uncommon genetics field, research seeks from 30,000 genes the (at most) 30 genes that influence a genetic disease, for which
R = 30/30000 = 0.001
and at this small R, PPV is then also about 0.001.

Don't lose track. There are three fractions mentioned here,
(1) R (ratio of true relationships to false relationships in the field, before experiment)
(2) Background probability = R / (1-R)
(3) PPV (after an experiment and publication, this is the probability the result as significant)

While the researchers/statisticians can set alpha = 0.05, and can get beta = 0.80, their probability meaning is clouded by their frequentist interpretation. What the statistician can't set, and what is never mentioned -- the Background Probability -- differs and is important in each research field!

When the Background Probability is moderate, a design with moderate power (1 - beta) can get good PPV. But research often works in a field of previously unseen results, or uses data mining software (a good generator of false results and tool of charlatans), where R does equal 0.01 or even 0.001. In these many fields, the Background Probability swamps any statistical design's alpha and beta. "Most research findings are false for most research designs and for most fields... a PPV exceeding 50% is quite difficult to get." Indeed, a look at the PPV formula shows that whatever alpha, even a power of 1 (a little thought reveals why more power hardly helps here) produces mostly false results if the Background Probability itself is less than alpha!

If R must be relatively large in a "field" for published results to represent true relationships, then a large proportion of relationships considered in that field are true (significant). Such a research field should be exceedingly boring. In the other extreme, in a "field" with relatively few true relationships, research produces mostly false conclusions. However, in followup studies from published results (eg, pharmaceuticals check results with further studies), R becomes large (note the conditioning). When you see that the probability published research represents a true relationsh

Re:You're playing a bit of a game by HornWumpus · 2015-05-02 05:58 · Score: 1

It was a full tilt edit. How did the poor old lady giving a single copper story show up in John? It was never there before the edits. There are multiple fragments that cover the verses affected. It wasn't there before Nicea. They just paraphrased it from Mark.

They fully consolidated the bible. Synchronizing 'good' gospels while burning others etc.

--
John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'

Half Science ..Half ? by lucien86 · 2015-05-03 06:51 · Score: 1

Just proves what we already knew - that psychology is half science and half pseudoscience. Good thing that so many base their careers and their children's future and the future of the world on it..

An Astrologer "..and they had the nerve to call us pseudoscientists!"

--
Below the speed of light Special Relativity is one of the most accurate theories in physics - above the speed of light..

Slashdot Mirror

Results Are In From Psychology's Largest Reproducibility Test: 39/100 Reproduced

111 of 174 comments (clear)