Slashdot Mirror


Open v. Closed Source-Climate Change Research

theidocles writes "The ongoing debate over the 'hockey stick' climate graph has an interesting side note. McKitrick & McIntyre (M&M), the critics, have published their complete source code and it's written using the well-known R statistics package (covered by the GPL). Mann, Bradley & Hughes, the defenders, described their algorithm but have only released part of their source code, and refuse to divulge the rest, which really makes it look like they have some errors/omissions to hide (they did publish the data they used). There's an issue of open source vs closed source as well as how much publicly-funded researchers should be required to disclose - should they be allowed to generate 'closed-source' solutions at the taxpayers' expense?"

18 of 443 comments (clear)

  1. How much is enough? by mfh · · Score: 5, Insightful

    how much publicly-funded researchers should be required to disclose

    All of it, baby. We're paying for it -- we should have the right to:

    a) Know what you're spending our money on
    b) Have the right to make it better ourselves
    c) Learn of security flaws early so we can correct them

    Especially when there is some doubt about the nature of the results in the closed source model from Mann et al.

    --
    The dangers of knowledge trigger emotional distress in human beings.
    1. Re:How much is enough? by R.Caley · · Score: 4, Interesting
      We're paying for it

      Of course, often you will only be paying for part of it. It is common for research to come out of a combination of `projects' funded from different sources.

      What should happen if, for instance, a drug company funds a project into developing statistical theory and signal analysis and so on to improve analysis of early candidate drug screening data, and then the researchers use the prototype implementation in a publicly funded project they are involved in on climate data and find something significant?

      Or what happens for part-state, part-commercially funded projects?

      I think one thing which could be done is to give companies a (bigger) tax break on money put into research (internal or when they give grants) if they sign up to give out not only all the data, but things like source of programs and detailed design of prototypes and experimental setups.

      Another thing would be to set up some kind of peer review process and then treat published source as a publication for the researcher. If your peers sign off to say that you have produced and documented the code to the level where it is a useful resource for other researchers, then it should count towards departmental and personal evaluations just as a journal article would. The formalised review process is important -- the average bit of lashed-together-to-get-the-data research code is more equivalent to a scribbled note on a whiteboard than to a journal article.

      Perhaps all that is needed is an online journal set up and run as a properly organised accademic journal, but specialising in publishing code. Imagine an infrastructure not a million miles away from sourceforge, but with a peer review process to decide what gets counted as a release.

      --
      _O_
      .|<
      The named which can be named is not the true named
  2. It should be called argh! by Anonymous Coward · · Score: 5, Funny

    I hate projects with names like R. I used R a while back, and it's a great program, but try searching for "R" plugins on Google. Not fun.

  3. No brainer... by wileynet · · Score: 5, Insightful

    Science, like government, should be transparent. The public should be able to see and evaluate every part. Any science, or government, that hides it's implementation is inherently suspect to corruption.
    Closed science is half a step from religion. You are expected to have faith in the researcher's methodologies, analysis, assumptions, and motives. Sorry, but good science does not rely on faith.

  4. Re:Short answer, no. by FuzzieNorn · · Score: 4, Insightful

    No. By that logic, the taxpayer should be able to see information about how the selection of doctors is made, and which prescriptions should be prescribed, and generally how Medicare money is spent.

  5. Arguments for & against open-source by G4from128k · · Score: 5, Interesting
    Arguments for open-source science:
    1. Replication: Science thrives on replicable results. If researchers don't publish everything, others cannot replicate the results and the original findings become suspect.
    2. Knowledge Sharing: If the point of publicly funded academic research is to advance human understand of the world, then open access to methods (code) is a key part of that.
    3. Reduce/Eliminate Stealth Patents: Releasing knowledge into the public domain will help nip patents in the bud.
    4. Preserve Fair Use: University's trends toward turning research into money is threatening the basis of fair use for researchers. How long will it take intellectual property owners to get regulations on "fair use" because academic research has turned into another big, for-profit, corporate enterprise.

    Arguments against open-source science:
    1. Replication: Science thrives on replicable results. But the key is independent verification. If one scientist simply reuses another scientist's code, there is a chance (high, some would argue) that faults in that original code would corrupt both scientists' results. Closed-source forces independent verification.
    2. Commercialization: If the point of publicly funded academic research is to create widely-used products and services, then the system needs some scheme for protecting the value of intellectual assets. Where the cost of bringing the product to market is very high (e.g., pharma), the company/investors needs some assurance that another company can't just copy the results when the product comes out.

    I'm sure there are arguments on both sides.
    --
    Two wrongs don't make a right, but three lefts do.
    1. Re:Arguments for & against open-source by goodmanj · · Score: 5, Insightful
      To frame replication of scientific results as an "open source" debate is both a no-brainer and misleading. A no-brainer because if an investigator does not provide enough information to allow their colleagues to replicate their work, they are not doing science -- in that sense, all science is "open source". Misleading because scientific ethics do not require totally open sharing of source code: it is sufficient to verbally describe the algorithms and data used in enough detail that someone else can repeat the experiment. In practice, journal article page limits often require that this description happens on a person-to-person level, rather than in published literature.

      Most of the "arguments against open-source science" mentioned here are not about science at all. The secrecy surrounding commericial and national-security "science" is good only in a financial or political sense: they do not help science, per se, at all. And personality conflicts are a factor as well: I suspect that Mann et al's reluctance to release source stems from an extreme personal frustration at McKitrick et al's persistent and (in my view) not always well-supported attacks.

  6. Voodoo, not science by climb_no_fear · · Score: 5, Insightful

    I'm of the opinion that anything that gets published should be published in its entirety, at least at some point. For example, people who publish protein structures can put the coordinates "on hold" for up to 18 months.

    And to say because the research is done with "taxpayer's money" is missing the point: If you can't reproduce every step, it's voodoo, not science. And we make policy decisions based on science, not voodoo (I hope).

  7. real government research goals by MyRuger · · Score: 5, Informative

    Obviously you have never written an SBIR or BAA. You when you do research "At the tax payers expense", you need to show your plans to commercialize the results of the research. The government wants you to create a IP twoards a commerial project which will spur the economy, not to contribute to the scientific community as a whole. Take it as you will, but I think that most research would not get funded if your commertilization plan was to release it on sourceforge.

  8. taxpayers vs boffins by dos_dude · · Score: 4, Informative

    This is an extremely difficult issue, although it sounds pretty trivial.

    For one thing, the taxpayer is rarely participating in discussions like this one. Moreover, the success of scientific institutions is often measured in terms of number of patents, successfully launched businesses by former students/researchers, etc. So not only is there little or no opposition to closed-source software (or scientific articles!), there are also good reasons for researchers to go the closed-source road.

    Some researchers have a tendency towards secrecy. Some even seem a little paranoid when it comes to their data and methods. You could compare this to the tendency of the OSS zealot to suspect bugs, glitches, and omissions in any piece of closed-source software.

    And as a German side-note: There are laws over here that require you to have the patentability of any piece of software you develop checked by university lawyers. GPLing something is technically illegal for a researcher. I have no idea how this is regulated in other countries.

  9. Re:But the Hockey Stick is True! by BenBenBen · · Score: 5, Insightful

    That Global Warming is a manmade, real phenomona is accepted by 99.9% of scientists in the fields involved. To trot out the "only a theory", "some experts dispute" etc routine is like getting the Flat Earth Society involved every time someone talks about circumnavigation. "Heads in the sand" is going to be on our culture's gravestone when the next lot of intelligent life evolves here and starts wondering why parts of Nevada are 10,000 times the normal radiation level.

    --
    The Slashdot Paradox: "100% Overrated"
  10. Re:Short answer, no. by Glock27 · · Score: 4, Insightful
    Should taxpayers be able to see information on secret military projects as well?

    Not when the secret is current, and espionage is a concern. This is of course the current state of affairs.

    Once the secret is no longer of military importance, all information that can be released should be released. In general that's what happens - note the wealth of information available on the WWII atomic weapon program for instance.

    --
    Galileo: "The Earth revolves around the Sun!"
    Score: -1 100% Flamebait
  11. with open source, everyone can see you're dumb by Nick+Barnes · · Score: 5, Informative
    See this debunking of McKitrick's work, showing, among other things, how he:
    • denies that average temperature is meaningful,
    • confuses degrees with radians,
    • invents a whole new temperature scale,
    • replaces missing data with zeroes
  12. Re:Short answer, no. by arkanes · · Score: 4, Insightful

    The problem of course is who decides how secret something is and when it shouldn't be a secret anymore. The government (the current administration moreso than most, but not *much* more) has a definite "make it secret" reflex. It's shocking to me how little outcry various abuses of that have been (like a redacted FOIA on an internal audit, where the redacted sections were exposed, and were revealed to be the parts of the audit where the agency failed). Sadly, although unsuprisingly, both the populace and the government have forgetten who's supposed to be the servant.

  13. Covert Perpetuation by Doc+Ruby · · Score: 5, Insightful

    Where's that wealth of information about the secret US wars in Central America in the 1980s? Or in Angola in the 1970s? Or in Chile in the 1970s? Or in Cambodia and Laos in the 1960s? Iran in the 1950s? These secret wars are secret largely for *political* purposes - the military secrecy benefits evaporate within months. But the political purposes - covering liability for abuse, war crimes, and just plain lying about the causes, effects, and benefits of the war - those last forever.

    --

    --
    make install -not war

  14. Obviously YES by radtea · · Score: 4, Insightful

    How many times have you asked someone, "What does your code do to solve this problem?" and got a description of an algorithm which, when you finally get to see the source, does not match the code?

    In my case, the answer to that question is, "Lots." I have had it happen in pure science (neutrino physcis), applied science (medical physics) and software development (database programming, data analysis, etc.)

    I am painfully aware that my own published descriptions of algorithms have often left out minor details that may be critical in some applications, but that page limits in peer-reviewed journals necessitate. It is not uncommon to get a call from someone doing similar work asking for details about what you've done, how you've done it, and in some cases, asking to look at source code.

    In contentious areas of science such requests are not always met with full disclosure, which is a sign that the people involved are no longer doing science. They are doing politics. This happens a lot, and it brings the scientific process to a halt on the question at issue.

    In the case at hand, the original authors have done a very poor job of describing what they have done, and an extremely poor job of defending their work. Their refusal to publish their source code for their analysis gives credibility to their critics.

    There are certainly legitimate cases where code ought not be published. If a lab has spent many, many years developing a framework for solving a certain type of problem and wants to get the most advantage out of that framework before releasing it, they may reasonably want to limit it's disemination for a while. But those sorts of reason don't apply in this case, and the source should be made available to anyone who wants to reproduce their actual results. That would just be good science.

    --Tom

    --
    Blasphemy is a human right. Blasphemophobia kills.
  15. Science Fails by tjstork · · Score: 4, Insightful

    One's interests in keeping clients does not entitle you to make a scientific claim that cannot be peer reviewed. If a paper such as Mann is now regarded as fact, and indeed, makes policy, despite the obvious sloppiness regarding its data management process, then, what is the point of science anyway?

    Science is supposed to be about peer review, rigor, that every assumption behind every assertion can be challenged. If, all we have is someone with a Phd can claim that they have a fact as our science, then, what is the point of even trusting them?

    Without independent verification and an open process, there's nothing to separate scientists from creationists, and the people are going to pick whoever makes the most attractive sales pitch.

    --
    This is my sig.
  16. Re:If you were wondering what real scientists thin by jnaujok · · Score: 4, Informative

    You do know that Mann writes this website, right? You do realize that the source of your argument (http://www.realclimate.org/) is a shill for Mann and his cronies?

    Second of all, there was a flaw in the original algorithm that was pointed out by McIntyre and McKitrick before they even got to the bad data being put into the equation.

    And, to top it off, Mann's equation always produces hockey-stick graphs, even with randomly distributed data.

    Don't point at Mann's own site as a defense of Mann.

    --
    Life, the Universe, and Everything... in my image.