Slashdot Mirror


Bloggers Put Scientific Method To the Test

ananyo writes "Scrounging chemicals and equipment in their spare time, a team of chemistry bloggers is trying to replicate published protocols for making molecules. The researchers want to check how easy it is to repeat the recipes that scientists report in papers — and are inviting fellow chemists to join them. Blogger See Arr Oh, chemistry graduate student Matt Katcher from Princeton, New Jersey, and two bloggers called Organometallica and BRSM, have together launched Blog Syn, in which they report their progress online. Among the frustrations that led the team to set up Blog Syn are claims that reactions yield products in greater amounts than seems reasonable, and scanty detail about specific conditions in which to run reactions. In some cases, reactions are reported which seem too good to be true — such as a 2009 paper which was corrected within 24 hours by web-savvy chemists live-blogging the experiment; an episode which partially inspired Blog Syn. According to chemist Peter Scott of the University of Warwick in Coventry, UK, synthetic chemists spend most of their time getting published reactions to work. 'That is the elephant in the room of synthetic chemistry.'"

31 of 154 comments (clear)

  1. Terrible, Terrible, Headline by damn_registrars · · Score: 5, Insightful

    The bloggers are not testing the scientific method, they are testing methods that are scientific. Those are two vastly different concepts. Their work is important, but not epic.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    1. Re:Terrible, Terrible, Headline by girlintraining · · Score: 2, Interesting

      The bloggers are not testing the scientific method, they are testing methods that are scientific

      Putting your ignorance in boldface type is amusing. The most basic promise of the scientific method is that results can be replicated by anyone with the proper equipment repeatedly and reliably. This is accomplished by describing an experiment to the detail level necessary to reproduce the result. If the result cannot be reproduced from a description of the experiment, it has failed this test.

      They are testing the scientific method insofar as asking whether professional and peer-reviewed scientific work actually meets this basic test. And in many cases it doesn't. Science only works if it is built on a firm foundation: Their work isn't just important, it's critical. It may not be fun, but explorations like this prevent us from assembling a body of knowledge and understanding based on flawed experiments... and that has happened many times in the history of science, especially in medicine.

      --
      #fuckbeta #iamslashdot #dicemustdie
    2. Re:Terrible, Terrible, Headline by gman003 · · Score: 5, Informative

      They are testing whether scientific papers meet the scientific method (ie. the results are reproducible). They are not testing the validity of the scientific method itself (myself, I cannot see how one could test the scientific method without using it, thus bringing the results into question).

      That is the point GP was attempting to make.

    3. Re:Terrible, Terrible, Headline by flink · · Score: 5, Insightful

      Right, but they are utilizing the scientific method to test the quality of published papers, not attempting to verify the utility of the scientific method itself.

      The headline should read "Bloggers apply scientific method to validate published findings".

    4. Re:Terrible, Terrible, Headline by gandhi_2 · · Score: 2

      This speaks to the failings of the participants implementation of the scientific method, not to the failing of The Scientific Method.

      We have discussed here before the problem of too much content being generated and not enough people to peer review it all. Still not a failing of The Scientific Method.

      And I prefer my ignorance in courier font.

    5. Re:Terrible, Terrible, Headline by phantomfive · · Score: 4, Insightful

      "Bloggers apply scientific method to validate published findings".

      A much better headline.

      --
      "First they came for the slanderers and i said nothing."
    6. Re:Terrible, Terrible, Headline by c0lo · · Score: 5, Funny

      myself, I cannot see how one could test the scientific method without using it, thus bringing the results into question

      So little faith you have...

      (large grin)

      --
      Questions raise, answers kill. Raise questions to stay alive.
    7. Re:Terrible, Terrible, Headline by damn_registrars · · Score: 3, Informative

      The bloggers are not testing the scientific method, they are testing methods that are scientific

      Putting your ignorance in boldface type is amusing. The most basic promise of the scientific method is that results can be replicated by anyone with the proper equipment repeatedly and reliably

      And I am sorry that you struggle so greatly to understand what I have written.

      They are testing the scientific method insofar as asking whether professional and peer-reviewed scientific work actually meets this basic test.

      Do you not understand the purpose of peer-review? If results that were peer-reviewed are not reproducible, that is not a failing of the scientific method itself, nor is it a failing of peer review. Peer review does not exist to validate methods as that would be quite nearly an impossible task for the majority of all scientific papers that are published currently unless the journal sent an editor to the lab that submitted said paper to rerun the work themselves - which would be so absurdly expensive that nobody would ever pay to publish. Peer review is intended to make sure that work published is scientifically rigorous and well written.

      Hell, if you go back and actually read my comment - I would say re-read but it does not appear you read it successfully for a first time yet - you will find that I did say this work is important. I also said that it is not testing the scientific method itself, which is correct.

      --
      Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
    8. Re:Terrible, Terrible, Headline by RatherBeAnonymous · · Score: 2

      A scientist should also run experiments multiple times to see if the results are repeatable before publishing those results. If you can't repeat your results you can't possibly give others instructions on how they can repeat them. Not knowing that the HVAC failed for a couple hours during one run out of a dozen should result in outlier results that can be investigated or discarded.

    9. Re:Terrible, Terrible, Headline by fluffy99 · · Score: 2

      Do you not understand the purpose of peer-review? If results that were peer-reviewed are not reproducible, that is not a failing of the scientific method itself, nor is it a failing of peer review. Peer review does not exist to validate methods as that would be quite nearly an impossible task for the majority of all scientific papers that are published currently unless the journal sent an editor to the lab that submitted said paper to rerun the work themselves - which would be so absurdly expensive that nobody would ever pay to publish. Peer review is intended to make sure that work published is scientifically rigorous and well written.
       

      Many of the published results and methods being verified are ones that have questionable results - such as producing too much output chemicals or reactions don't appear they should work at all. Those are papers for which peer-review has failed to provide adequate review. If the paper was truly read in-depth by other equally qualified scientists these issues would have been noticed and the paper (published or not) would have been called into question.

      The caveat to this would be papers that are published with the sole purpose of seeking peer review and inviting other to validate the results, for example many of the cold fusion papers and the experiment which implied neutrinos were traveling faster than light.

      I also recognize that peer review happens both before and after publication, and in fact the bloggers are part of the peer-review process.

    10. Re:Terrible, Terrible, Headline by __aaltlg1547 · · Score: 3, Insightful

      No, it means that the original experimenters didn't describe their experiment correctly. Or worse, may have never done it at all...

    11. Re:Terrible, Terrible, Headline by mysidia · · Score: 2

      We have discussed here before the problem of too much content being generated and not enough people to peer review it all. Still not a failing of The Scientific Method.

      The article's headline text is a demonstration of that issue of too much content and not enough peer review :)

    12. Re:Terrible, Terrible, Headline by RespekMyAthorati · · Score: 2

      Then the scientific method is effectively shot, not because it's invalid, but because it's not being followed, and it becomes no longer reasonable to believe researchers are following it

      That is identical to:

      People who don't do up their seatbelt buckles die.
      Therefore, seat belts fail to protect people.

      The fact that people are failing to apply the scientific method does not mean that the scientific method has failed, only that some people who call themselves "scientists" shouldn't.

    13. Re:Terrible, Terrible, Headline by vlm · · Score: 5, Insightful

      A scientist should also run experiments multiple times to see if the results are repeatable before publishing those results.

      Won't help. I studied to be a chemist (admittedly a long time ago) and by far the biggest non-ethical problem out there is contamination.

      So it turns out that your peculiar reaction you're studying is iron catalyzed, in fact its incredibly sensitive to iron, but no one in the world knows that yet. And your reagents are contaminated. Or your glassware, which you thought was brand new and/or well cleaned, is contaminated. Or your lab is downwind of a hematite ore processor and the room dust is contaminated.

      Sure, you say, test everything. Well there isn't time/money for that, but for the sake of argument we'll assume there is. What if dust from the hematite ore processor is far larger than the filter paper pores in filtration stage of your overall process? Test the reagents and product all you want but you'll never find iron anywhere except room dust (which you already knew about) and the debris in the filter paper (which you assume was contaminated by room dust AFTER removal from the apparatus)

      The most important thing is this is the norm in chemistry, not an outlier. Chemistry is not math or CS, sometimes stuff just doesn't work or just works for no apparent reason. Unlike some technologies, detailed modeling of "why" "how" often doesn't happen for years, decades, centuries after the ChemEng team has been selling product / papers have been written.

      A very important lesson is analysis paralysis. So you live downwind of a hematite ore crusher. And you know it. And periodic tests of your lab show iron enriched house dust. But you can't go around testing everything, because you're surrounded by millions of things to test for. You're a gardener, god only knows whats on your hands. Skin oil of certain blood types is a contaminant? Your breath has a tinge of ethanol in it from last night? Maybe its your perfume / cologne / antiperspirant / nail polish? The point of discovery is its literally unknown... maybe wearing nitrile gloves instead of old fashioned latex "does something" good or bad to the reaction.

      I think the main thing "slashdotter IT people" need to understand is most chemistry and most chemical engineering runs somewhat less than 6 sig figs. This is incomprehensible to IT people... if your T1 or whatever LAN had a BER worse than 1e5 you'd call it in for repair... If you got one thousand read errors when you read a 1 gig DVD, you'd throw out that DVD. If your processor runs at 1500 mips then at a six sig fig error rate it would crash about 1500 times per second. The bad news is six sig figs is actually pretty good work for a chemistry lab. Certainly undergrads could never aspire it that level, both skill and experience and specialized equipment....

      Please spare me the details of one peculiar quantitative analysis technique that in one weird anecdote measured once in tiny fractions of a ppt. The overall system cannot be "cleaner" than the filthiest link in the entire system. Is the hand soap in your lab spectrographically pure? The unused toilet paper in the bathroom? All of your hoods and benches and storage cabinets are in a verified and tested cleanroom environment? Seriously? The drinking cooler and lab fridge also only hold spectrographically pure substances? Please no anecdotes.

      --
      "Science flies us to the moon. Religion flies us into buildings." - Victor Stenger
    14. Re:Terrible, Terrible, Headline by Electricity+Likes+Me · · Score: 2

      Do you not understand the purpose of peer-review? If results that were peer-reviewed are not reproducible, that is not a failing of the scientific method itself, nor is it a failing of peer review. Peer review does not exist to validate methods as that would be quite nearly an impossible task for the majority of all scientific papers that are published currently unless the journal sent an editor to the lab that submitted said paper to rerun the work themselves - which would be so absurdly expensive that nobody would ever pay to publish. Peer review is intended to make sure that work published is scientifically rigorous and well written.

       

      Many of the published results and methods being verified are ones that have questionable results - such as producing too much output chemicals or reactions don't appear they should work at all. Those are papers for which peer-review has failed to provide adequate review. If the paper was truly read in-depth by other equally qualified scientists these issues would have been noticed and the paper (published or not) would have been called into question.

      The caveat to this would be papers that are published with the sole purpose of seeking peer review and inviting other to validate the results, for example many of the cold fusion papers and the experiment which implied neutrinos were traveling faster than light.

      I also recognize that peer review happens both before and after publication, and in fact the bloggers are part of the peer-review process.

      Also frankly, there's a ton of papers out there which do work but which omit crucial details of how they work. Queue 2 years of my life discovering that someone's "simple robust synthesis" can't possibly have worked the way they said it did, and gradually reverse engineering that they were doing something they didn't report in any papers which had an important effect (they didn't quite seal up the reaction vial, but didn't leave it open, so coupled with water and condensation they were setting up a highly unreliable oxygen headspace control system - rig up a reliable way to fix the mol % and hey, anyone can do it!).

      I've very much become convinced that the terse writing style which gets encouraged in a lot of chemistry papers is just not useful at all when it comes to replicating them (and protocol papers are worse - people write still write what they were trying to do, not necessarily what actually gets done more often then not).

  2. Not just synth chemistry by Anonymous Coward · · Score: 3, Informative

    If you try to repeat an experiment and fail then it is almost impossible to get published. Failed experiments, though critical for advancing science, aren't sexy and editors prefer their journals to be full of positives. So scientists don't even bother trying anymore. This is a problem in medicine and probably all sciences. There is a movement in medicine to report all trials so they can be found by researchers doing meta-studies.

    1. Re:Not just synth chemistry by WrongMonkey · · Score: 3, Interesting

      As a research chemist, I've published a couple of papers that were motivated because I didn't believe a paper's results to be true. The trick to get it past reviewers is to not only prove that they are wrong, but to come up with an alternative that is demonstrably superior.

    2. Re:Not just synth chemistry by phantomfive · · Score: 2

      It's not a problem with the scientific method, it's a problem of communicating results. Clearly it isn't working optimally if this article is correct. If people can only tell each other when something works, but can't discuss when things don't work, then the communication channels are severely broken.

      --
      "First they came for the slanderers and i said nothing."
  3. Good luck with that by WrongMonkey · · Score: 4, Insightful

    It's not a secret that about half of published synthesis methods are garbage and yield values are wildly creative. Reviewers don't have the means to verify these, so anything that seems plausible gets published. Then researchers are left to sort out the best methods based on which ones get the most citations.

    1. Re:Good luck with that by TrekkieGod · · Score: 2

      It's not a secret that about half of published synthesis methods are garbage and yield values are wildly creative. Reviewers don't have the means to verify these, so anything that seems plausible gets published. Then researchers are left to sort out the best methods based on which ones get the most citations.

      It's not just synthesis methods. I remember taking a graduate control theory class in which the final project was for the class to replicate the results of a paper with a particular control algorithm. It just...wouldn't...work. Not a single person managed to replicate the results, which simply led to the inevitable conclusion they were fudged.

      I'm not and would never defend anyone who publishes any data that has been tempered with, but I still find it annoying that we've set up a system where there's an incentive to do so. There's tremendous pressure to publish in academia, starting in grad school. Combine that with the difficulty in publishing papers that have negative results and a lack of interest in replicating any experiments that are not groundbreaking and you end up with the quality of papers we have here. I'd love for it to be standard for grad students' first papers, as they're learning to write them, to be just replicating results from other papers. And have journals actually recognize the importance of such work, and publish the results often. I think this would cut down the number of crappy papers, because first, you wouldn't want to publish something that's going to be shown to be bullshit in short order, and second, you'd be able to satisfy your publishing requirements by doing the important task of verifying other people's work.

      --

      Warning: Opinions known to be heavily biased.

  4. Which scientific method are they testing? by EmagGeek · · Score: 2

    Are they testing the tried and true scientific method that *real* scientists used for centuries to arrive at the cumulative knowledge of mankind, or are they testing the modern scientific method that involves drawing a conclusion and then trying to find data that fits your model, discarding any data that doesn't?

  5. It might be epic by Okian+Warrior · · Score: 5, Interesting

    The bloggers are not testing the scientific method, they are testing methods that are scientific. Those are two vastly different concepts. Their work is important, but not epic.

    I'm not so sure about that.

    We believe in a scientific method founded on observation and reproducible results, but for a great number of papers the results are not reproduced.

    Taking soft sciences into consideration (psychology, social sciences, medical), most papers hinge on a 95% confidence level. This means that 1 out of every 20 results arise from chance, and no one bothers to check.

    Recent reports tell us depression meds are no better than chance and scientists can only replicate 11% of cancer studies, so perhaps the ratio is higher than 1 in 20. And no one bothers to check.

    I've read many follow-on studies in behavioral psychology where the researchers didn't bother to check the original results, and it all seems 'kinda fishy to me. Perhaps wide swaths of behavioral psychology have no foundation; or not, we can't really tell because the studies haven't been reproduced.

    And finally, each of us has an "ontology" (ie - a representation of knowledge) which is used to convey information. If I tell you a recipe, I'm actually calling out bits of your ontology by name: add 3 cups of flour, mix, bake at 400 degrees, &c.

    This assumes that your ontology is the same as mine, or similar enough that the differences are not relevant. If I say "mix", I assume that your mental image of "mix" is the same as mine. ...but people screw up recipes, don't understand assembly instructions, and are confused by small nuanced differences in documentation.

    Does this happen in chemistry?

    (Ignoring the view that reactions can depend on aspects that the researchers were unaware of, or didn't think were relevant. One researcher told me that one of her assistants could always make the reaction work but no one else could. Turns out that the assistant didn't rinse the glassware very well after washing, leaving behind a tiny bit of soap.)

    It's good that people are reproducing studies. Undergrads and post-grads should reproduce results as part of their training, and successful attempts should be published - if only as a footnote to the original paper ("this result was reproduced by the following 5 teams..."). It's good practice for them, it will hold the original research to a higher standard, and eliminate the 1 out of 20 irreproducible results.

    Also, reproducing the results might add insight into descriptive weaknesses, and might inform better descriptions. Perhaps results should be kept "Wikipedia" style, where people can annotate and comment on the descriptions for better clarity.

    But then again, that's a lot of work. What was the goal, again?

    1. Re:It might be epic by Areyoukiddingme · · Score: 2

      I have mod points, but stupid Slashdot isn't showing me the moderation option on posts. So I will snark instead.

      But then again, that's a lot of work. What was the goal, again?

      Uhm. Publish or perish, I think it was...... Full speed ahead and damn the torpedoes. Verification studies don't count on your CV.

      Good idea to harness the slave labor though. That's what grad students are for.

      And how many grad students will actually be willing to do this verification work? None, who can think politically enough to stay in academia. What are you asking them to do? Verify published papers. Who published those papers? The people who will be sitting on their doctoral thesis board. The people who will be peer reviewing their papers. The people who are on the tenure committee. The people who are on the funding committee. I could go on.

      The scientific method is a marvelous thing, but the way the system is rigged, we're left to amateurs to verify anything. All hail the internet and bored people, I guess. I also guess this is a measure of our wealth as a society, that there are people with the time and money to donate to verifying synthetic chemistry results. Expect a lot of shouting and angst when they can't verify results though.

    2. Re:It might be epic by Skippy_kangaroo · · Score: 3, Interesting

      Many commenters seem to be mistaking some idealised thing called the Scientific Method with what is actually practiced in the real world when they claim that the scientific method is not being tested. Damn straight this is testing the scientific method - warts and all. If the scientific method as practiced in the real world does not deliver papers that provide sufficient detail to be reproduced then it is not working properly and is broken. In fact, I'm sure that most people who actually publish papers will acknowledge that the peer review process is broken and does not approach this idealised notion of what peer review is in many people's minds. If peer review is broken then a critical element of the scientific method is also broken.

      Reproducing results is hard every time I have done it - and that includes times when I have had access to the exact data used by the authors. (And sometimes even the exact code used by the authors - because I was using a later version of the statistical package and results were not consistent between versions.)

      So, if people want to claim that the Scientific Method is perfect and this is not a test of it - it would be interesting if they could point to anywhere this idealised Scientific Method is actually practiced (as opposed the the flawed implementation that seems to be practiced in pretty much any field I have ever become acquainted with).

    3. Re:It might be epic by MozeeToby · · Score: 2

      It's good that people are reproducing studies. Undergrads and post-grads should reproduce results as part of their training, and successful attempts should be published

      Dang, you were doing so well too. How about we publish both the successes and the failures, rather than just the successes. Just publishing the successes is how we get stuff like the 11% reproducibility rates you talked about earlier. So a footnote in the original paper: "this procedure was replicated by 23 teams, those results supported the results of this paper in only 3 cases."

  6. Re:Seriously? IDIOTS by paiute · · Score: 3, Insightful

    Frankly, Chemistry is among the easiest of the physical sciences. I say this as the physicist who was tasked by the chemists to fix their gear when it broke down.

    Organic chemistry is quite difficult. The purpose of synthesis is not as you suppose, just mix A and B, see what happens and publish. Most organic chemists are trying to make specific transformations on certain parts of molecules in high conversion and trying to control the variables of time, temperature, concentration, reagent reactivity with substrate functional groups, etc.

    Physics is just a block on an inclined plane and variations.

    --
    If Slashdot were chemistry it would look like this:Cadaverine
  7. Re:Yeah, because thats exactly who I trust. by docmordin · · Score: 2

    Like how in the 70s scientists all over the world proved and knew nuclear energy was causing an ice age but of course we all know how good their factual evidence turned out to be.

    Or like how you're wrong about that, since there was no scientific consensus in the 1970s that Earth was headed into an ice age:

    T. C. Peterson, W. M. Connolley, and J. Feck, "The myth of the 1970s global cooling scientific consensus", Bulletin of the American Meteorological Society, 89: 1325-1337, 2008.

  8. The work of Harold Garfinkel by aussersterne · · Score: 2

    is instructive (though still not very readable) here.

    The founding ethnomethodologist, Garfinkel argued that much of science depends on practical assumptions and habits of which researchers are only vaguely aware, leading to the "loss" of the phenomenon.

    This is both good and bad. On the one hand, it means that a phenomenon is real, with real implications (useless theory and tautology are marked by the difficulty of losing the phenomenon), but on the other hand it means that what is said about the phenomenon is often missing the most critical bits of information, unbeknownst to the PIs themselves because they are unaware of practical (embedded in the practice of) assumptions and habits, something that makes it seem likely that many scientific truths are either solidified far later than they might otherwise have been or incorrectly lost to falsification rather than pursued.

    --
    STOP . AMERICA . NOW
  9. A very old elephant in the room by decora · · Score: 2

    These many many decades ago I went to a "top 5 STEM school" and enrolled in Chemistry. While all of the students had Very Expensive Machines to play with, they also had No Time To Do Careful Work.

    "Flubbing" of results was considered kind of, you know, ordinary, and if you didn't get what was expected, well, you were expected to just kind of ignore it.

    People like to harsh on the 'soft' subjects like history ... but I can assure you that many a history teacher would chew you up one side and down the other for saying things that are patently false and blatantly contradict the evidence - while many a Chemistry teacher would simply tell you "well, it should have worked. and you understand the basic ideas. so lets move on."

    I'm not sure what the issue is - if it's just too expensive to do experiments, or maybe the point is not to learn 'experimenting' but rather to learn 'theories... with some hands on experimenting to give you a flavor of it"????

  10. computer science too by Anonymous Coward · · Score: 3, Interesting

    The sad truth is that this happens everywhere, also in computer science. I once implemented a software pipelining algorithm based on a scientific paper, but when implemented, the algorithm appeared to be broken and had to be fixed first. In this case, it probably wasn't malice either. But the requirements to get something published are different from getting it to actually run. Brevity and readability may get in the way of correctness, which may easily go unnoted.

  11. Not reproducible by jotajota1968 · · Score: 3, Insightful

    I am a physicist. I do believe that a large percentage (less say 50%) of scientific publications do not meet basic quality standards, let alone the scientific method. It depends also on the level of the journal. But even in the best journals you can find articles and results that can not be reproduced. The pressure to publish is too strong. Anyway, 70% or more of the articles are only academic exercises, or do not have robust statistics or do not receive more than one or two citations (including a couple of my own). Only the best of the articles (should) prevail with time (Darwinism).