Slashdot Mirror


UK University Researchers Must Make Data Available

Sara Chan writes "In a landmark ruling, the UK's Information Commissioner's Office has decided that researchers at a university must make all their data available to the public. The decision follows from a three-year battle by mathematician Douglas J. Keenan, who wants the data to do his own analysis on it. The university researchers have had the data for many years, and have published several papers using the data, but had refused to make the data available. The data in this case pertains to global warming, but the decision is believed to apply to any field: scientists at universities, which are all public in the UK, can now not claim data from publicly-funded research as their private property." There's more at the BBC, at Nature Climate Feedback, and at Keenan's site.

352 comments

  1. Sudden Outbreak of Common Sense by nacturation · · Score: 5, Insightful

    The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

    --
    Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    1. Re:Sudden Outbreak of Common Sense by eviloverlordx · · Score: 0, Troll

      Glenn Beck, is that you?

      --
      'Loose' is when your pants are three sizes too big. 'Lose' is when you misuse 'loose'.
    2. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 2, Informative

      Now if only the same rules were applied to the fraudsters who promote evolutionism...

      Responding to a troll, I know... but if you really want the data on evolution (as opposed to foaming at the mouth and making up words to make yourself feel better about the mythology you chose that tells you that faith is when you blindly believe while being unable to show any data [Hebrews 11:1, bitches]): http://talkorigins.org/

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    3. Re:Sudden Outbreak of Common Sense by eviloverlordx · · Score: 1, Insightful

      So responding to an AC is trolling? Someone needs to learn to moderate.

      --
      'Loose' is when your pants are three sizes too big. 'Lose' is when you misuse 'loose'.
    4. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      I'm sure people with a dogmatic axe to grind will prove an annoying if minor fault. Creationists regularly mangle papers, taking quotes out of context and all. I can't imagine them being pacified by the messy data.

      Oil companies and people who are dead set against thinking we -might- be changing the atmosphere will undoubtedly cherry pick out from the data, take things out of context from studies supporting climate change as a theory, and those people whose support of climate change is based more off of religion than science will do the same to studies that reach opposite conclusions. It will be extremely annoying to those of us who aren't convinced one way or the other, and rather than focus on good science, the media will focus on the new controversies this will spawn, making even fewer people care one way or the other.

      I take a dim view of the public's ability to do anything useful with raw data, but I recognize I'm a bit of an elitist jerk. I think this might help a few cases of where one researcher gets opposite conclusions based on their own data, and this will allow direct comparison of the two data sets, and that's really all the good this will accomplish.

      I'm not in the UK, nor am I a climatologist.

    5. Re:Sudden Outbreak of Common Sense by blackraven14250 · · Score: 0, Flamebait

      I imagine you propose the fraudsters taking your charitable church contributions and the fraudsters promoting the Republican party are exempt?

    6. Re:Sudden Outbreak of Common Sense by c++0xFF · · Score: 2, Insightful

      Opening the data will encourage further research. The data will be available for others to use, instead of forcing constant duplication.

      "Standing on the shoulders of giants" means to build on what has been done before. Hiding the source data shows just how "little" you are.

    7. Re:Sudden Outbreak of Common Sense by Plazmid · · Score: 1

      So the question is, is it possible to request data on ANY publicly funded research going on in the UK? What about research on SILEX(http://en.wikipedia.org/wiki/Silex_Process), a laser based uranium enrichment process that is much more efficient than other enrichment processes(currently very very classified) or military research?

    8. Re:Sudden Outbreak of Common Sense by NeutronCowboy · · Score: 4, Insightful

      Creationists regularly mangle papers, taking quotes out of context and all.

      Get ready for an onslaught of mangled data analysis, with data being taken out of context, the results published to some blog, and people making policy decision based on those blog postings.

      the media will focus on the new controversies this will spawn

      That's a guarantee. While in theory, I welcome this development, I suspect that in practice it will lead to more chaos than before. Not because the data is shoddy, but because some meteorologist will think that running a data set through an excel curve fitting algorithm is science.

      --
      Those who can, do. Those who can't, sue.
    9. Re:Sudden Outbreak of Common Sense by smoothnorman · · Score: 2

      There is no question that having the data released eventually should be the rule. It shouldn't even be considered proven science until it can be thoroughly recreated. However, the tricky bit is mandating exactly by when it must be released. If a lab has spent a long time, let's say 10 years, accumulating some hard fought data, they should be allowed the benefit of a few publications before releasing all the data so that better (likely privately) funded labs do to the easy rapid analysis and 24/7 postdoc tag-team writing abuse and thus steal all the reward. Give them say... the sqrt(years_to_collect_the_data) out of encouragement to continue to do the heavy lifting. (my experience of this situation comes from protein crystallography and deposition of the hard won data there)

    10. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 1

      So responding to an AC is trolling?

      By responding you give them what they want. I'm probably needing to take my own advice, but there's always a chance you actually were confused.

    11. Re:Sudden Outbreak of Common Sense by Karganeth · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      No, it isn't. The fault is that the data may contain sensitive information. The Army collects data about enemies, should that be free access for the public? Nope. (I'm not arguing against making university data public, but your logic is flawed)

    12. Re:Sudden Outbreak of Common Sense by jacksonj04 · · Score: 4, Insightful

      You know the multi-billion dollar LHC? Guess what they did their first physics on. Not finding new exotic particles, but proving that what we think we know so far still stands up. Duplicating data is exactly how things get proven and disproven. If Group A and Group B use exactly the same source data there's no possibility of Group B proving Group A's research wrong.

      --
      How many people can read hex if only you and dead people can read hex?
    13. Re:Sudden Outbreak of Common Sense by zeropointburn · · Score: 1

      No. Classified data is still protected by law, even if funded publicly and researched at a public university. Courts are extremely unlikely to ever decide that classified data should be blindly released for any reason, and the public nature of the funding behind it would not be grounds for release.

      --
      -1 raving lunatic; +6 subGenius... Things even out...
    14. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 5, Insightful

      some meteorologist will think that running a data set through an excel curve fitting algorithm is science.

      Nope -- it's only science if you adjust and filter the data first to make it match your truth. Resist releasing your data though, others may adjust and filter it other ways to make it match their truth. All science in the world of research driven by political agendas and egotistical arrogance.

      Disclose, when in doubt disclose more. Anything less in scientific arenas where others can't repeat your experiments is just a symptom of fear, insecurity, and lack of confidence that your conclusions will stand up to the view and study of many brains (some better than yours, some worse).

      Same argument for why FOSS is better - many eyes reviewing (in theory) and rapid fixes.

    15. Re:Sudden Outbreak of Common Sense by maxume · · Score: 3, Informative

      Even worse, some hack might shove the data through some perl code:

      http://www.timesonline.co.uk/tol/news/uk/article7028418.ece

      --
      Nerd rage is the funniest rage.
    16. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 3, Insightful

      some meteorologist will think that running a data set through an excel curve fitting algorithm is science.

      Nope -- it's only science if you adjust and filter the data first to make it match your truth.

      I don't think that's what he was saying. He's saying this will lend itself to overly simplistic interpretations. Which is a good prediction in climatology, considering what people got out of "climategate."

    17. Re:Sudden Outbreak of Common Sense by thepike · · Score: 5, Insightful

      I totally agree. If people just start looking at each others data instead of verifying it, a lot of mistakes (or fraudulent data) will never be caught.

      Also, I have to wonder what the timeline for releasing data is. My research is funded with government money (NIH and NSF) but it can take years to get enough data to make a worthwhile paper. If I have to release my data before then it will hurt my ability to publish papers without getting scooped. You could end up with a whole closet industry of people just data mining the data others have had to disclose. And, here's the main catch, if you don't have to release results you haven't yet reported on, the problem isn't solved at all because I could just choose to "not yet publish" any results that don't agree with what I want to say. Nothing says I ever have to publish results I get, so why wouldn't I just sit on them?

      Not that sitting on data just because it doesn't agree is a good thing, but it happens. And plenty of good data goes unpublished (experiments fail, uninteresting results happen, journals don't publish negative results very often etc) so what about that data? Overall this law isn't going to help anything, and will just cause issues.

    18. Re:Sudden Outbreak of Common Sense by Jason+Pollock · · Score: 1

      Except the public who paid for the data isn't the same as the public who are paying the researchers.

      Large amounts of the data under discussion are from _foreign_ governments. Additionally, researchers frequently have to sign confidentiality agreements in order to gain access to health records and other data. If that needs to then be public, they won't have access to it.

      Stupid judge, stupid finding.

    19. Re:Sudden Outbreak of Common Sense by eviloverlordx · · Score: 1

      Actually, my initial comment was intended to be humorous, as the AC IMHO did a good job replicating the style of Mr. Beck. That Mr. Beck's way of speaking resembles The Almighty Shatner is a topic for another discussion :).

      --
      'Loose' is when your pants are three sizes too big. 'Lose' is when you misuse 'loose'.
    20. Re:Sudden Outbreak of Common Sense by Obfuscant · · Score: 4, Insightful
      If Group A and Group B use exactly the same source data there's no possibility of Group B proving Group A's research wrong.

      Wrong. If Group B cannot duplicate Group A's analysis of the data, that proves that Group A did something wrong and probably came to the wrong conclusion.

      If Group B cannot duplicate the experiment and get the same data (and knowing that means being able to compare both sets) that calls the experiment as a whole into question.

      There is more to science than simply applying equation A to data B and getting number C.

      This hubbub all came about because of the difficulty in prying the source data out of the hands of the guy who produced the "hockey stick" figures. It's covered in the book "Broken Consensus" I think it's called. The "hockey stick" is not the "source data", the source data is all of the individual readings from all the instruments, prior to corrections for sampling errors or known issues. One cannot verify the quality of the "hockey stick" result without having the source data and being able to verify the processing steps that were done to it.

      The downside to free and open access to all data is that research groups get grants to collect AND process the data to come up with results. Opening the data up for free access means that other groups, who have more interest in scooping than being right, have more ability to do that scooping. That leaves the people who did the work in the cold. There is good reason to delay opening the data until the group being paid to collect it has a chance to use it.

    21. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 2, Interesting

      Creationists regularly mangle papers, taking quotes out of context and all.

      Would you *really* expect anything else? They do the same thing to the Bible, and they LIKE the Bible.

      Personally, I'm not of the opinion that anyone who seriously believes the planet was instantly created 6000 years ago should be permitted to SPEAK in the debate on climate change. How can you argue about the interpretation of 20k year old ice core data with someone who believes that core was put there by THE DEVIL to confuse people?

    22. Re:Sudden Outbreak of Common Sense by SETIGuy · · Score: 2, Insightful

      Sure, I'll give you the data. But I wasn't funded to put the data in a format that's easy to understand. I've also got a job, and I don't get paid to support a competitor's data analysis attempts. Good luck.

    23. Re:Sudden Outbreak of Common Sense by Jah-Wren+Ryel · · Score: 1

      I don't think that's what he was saying. He's saying this will lend itself to overly simplistic interpretations. Which is a good prediction in climatology, considering what people got out of "climategate."

      100% agree. But nothing in life is free. The hoi polloi will continue the mutual masturbation fest, but actual scientists with the right backgrounds to do something useful with the data will now have a much greater opportunity to do something useful with the data.

      --
      When information is power, privacy is freedom.
    24. Re:Sudden Outbreak of Common Sense by PopeRatzo · · Score: 1

      but I recognize I'm a bit of an elitist jerk.

      These days, from what I can tell, all it takes to be an "elitist jerk" is going past the 12th grade.

      So you are in very good company, my friend.

      --
      You are welcome on my lawn.
    25. Re:Sudden Outbreak of Common Sense by michaelwv · · Score: 5, Insightful

      Absolutely. The public should have access to the data. Public grants then also need to pay for curating the data. Libraries aren't free, archives aren't free, package data in an actually useful form takes precious time, which is scientists most precious resource. Having data in a form that is useful to the 25 people in your research group is very different than providing data that can be used by thousands of people. It's analogous to the difference between the quick bash script you have that backs up your movies to your external hard drive, and having something that you're willing to distribute to 1000 people and provide support.

    26. Re:Sudden Outbreak of Common Sense by oldhack · · Score: 1

      Not very good analogy. Particle physics generate data by experiments. You can't "generate" data in observational sciences.

      --
      Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
    27. Re:Sudden Outbreak of Common Sense by brettper · · Score: 1

      Yes, but Group B may find a mistake in Group A's analysis of the data. This is still useful

    28. Re:Sudden Outbreak of Common Sense by finarfinjge · · Score: 4, Insightful

      I'll probably get flamebait or troll for this too, but this has always been the danger of the over-advocacy of climate change. Climate science is not even close to "settled". Nor is evolution, nor is physics. Well established and able to make verifiable predictions yes. Settled. No. The direct result of making the absurd claim that some cutting edge field of science is settled is this. Some complete moron then says "see, global warming wasn't settled, so evolution is bunk too" I've seen similar idiotic comments about plate tectonics as well. A number of years ago (far enough back it hasn't been cached), I wrote here that as scientists, we had better be right about climate change. Now we reap what we have sown. If it annoys you that idiots make claims like "global warming wasn't settled, so how can you be sure about evolution", look to the strident supporters of the cause. They (I'm talking about realclimate etc., here) are as responsible as Beck. By hammering any and all dissent without any concern as to the validity of the claims, they have made this type of comment inevitable. We will be seeing much more of it and we have only ourselves to blame.

    29. Re:Sudden Outbreak of Common Sense by jd · · Score: 1

      Examining old data has one value and one value alone - verifying that the claim made for the data matches up with the data. It does not verify that the data is ok, or that the experiment is valid, but only that the conclusions match up with the data. This is something I'd totally approve of for refereeing papers, rather than the standard routine skim-through. Referees aren't glorified spell-checkers and should be given sufficient information to confirm that the paper is self-consistent.

      Access to historic data (data in excess of, say, 25-30 years old) might be interesting - less from the perspective of the scientific discipline and more from the perspective of historians and those studying the way science is done.

      Access to raw data for any other reason is pointless.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    30. Re:Sudden Outbreak of Common Sense by martin-boundary · · Score: 3, Insightful
      Who cares? Are you arguing for science, or for little confidentiality fiefdoms?

      There is literally no point in doing Science (with a capital S) if the data isn't available for scrutiny by everyone. Without scrutiny, it's all he said/she said, rumours and bullshit.

      As to signing confidentiality agreements etc, there comes a time when a researcher has to decide: does he want to contribute to human knowledge (=> don't sign) or does he just want to wank around with secret data (=> sign it)?

      It sucks to be unable to use purportedly available data, just because it can't be divulged, but it's better that way in the long run.

      Unsupported data is worse than useless, it's a cancer that grows every time someone else quotes the unsupported result, until it gets to the level of unchallenged folk wisdom within the community.

    31. Re:Sudden Outbreak of Common Sense by Rogerborg · · Score: 5, Interesting

      If a lab has spent a long time, let's say 10 years, accumulating some hard fought data

      If a lab has been spending my tax money for 10 years, I want my employees to give me my data right Goddamn now.

      The "reward" for doing publicly funded research is that you keep getting funded. I don't care one whit what you think you're entitled to: if you're taking my money, you work for me.

      --
      If you were blocking sigs, you wouldn't have to read this.
    32. Re:Sudden Outbreak of Common Sense by the+gnat · · Score: 4, Insightful

      my experience of this situation comes from protein crystallography and deposition of the hard won data there

      Ah, a fellow crystallographer. Welcome, brother!

      I was about to post a similar comment. However, I only agree with you up to a point. Once you publish a paper reporting the structure, all of the raw data should be made publicly available (including diffraction images - although deposition of those isn't quite feasible yet). I would apply the same standard to any other field: you shouldn't publish until you are comfortable releasing the underlying data. I don't care if you're still working on some super-secret follow-up paper, as far as I'm concerned your publication is useless if I can't go to the PDB and download the coordinates. And if you're using public resources to solve your structure (like NIH funding, or one of the DOE's synchrotrons), your results are public property.

      There was once intense resistance to even mandating coordinate deposition (long before I got started in the field), which just sounds insane now. Some of the people doing the most complaining were in fact some of the best funded. A decade later, the field went through the same bullshit whining with regard to reflection data. Now most journals require both coordinates and reflections, and not only has the field not suffered in the slightest, many more studies are now possible and the majority of structures can be solved without experimental phasing. If we'd left things the way the naysayers wanted it, every group attempting to study, say, ribosome structure would have to either plead with more senior groups for coordinates in order to solve their structures (and, almost certainly, further bloat the author lists and potentially cede some control over their project - which, I imagine, would have suited the senior faculty just fine), or waste half a decade making heavy metal derivatives. It is difficult to convey to non-crystallographers how huge a waste of time and money - most of it coming from tax dollars - this scenario would be.

      Now, where it gets messy is situations where you have to release data ASAP, instead of waiting until publication. American structural genomics groups do this (it may be a requirement of the NIH), but PDB deposition is more of an endpoint in itself for them, and no one is going to bother trying to scoop them on most of those proteins. Genomics centers also do this. A grad school classmate of mine worked on a sequencing project where much of the gruntwork was performed by the DOE, and they had extremely strict release rules. She complained that other groups (of bioinformaticists) could start analyzing the data before she'd had a chance to complete her own studies, because the outsiders didn't have to spend a lot of time thoroughly annotating the genome before publishing. (I don't think it held her back in the end - she graduated with several papers in Science.) In many situations like this, to obtain the data you need to agree to an embargo on publications, to prevent that sort of underhanded behavior. I saw an article retraction recently where the scientific content was undisputed, but the investigators had (unintentionally, it appeared) broken an embargo by submitting the paper when they did.

      In general, I think the scientific community - especially the part funded by the public - should err on the side of maximum disclosure of data, and I don't have much sympathy for the researchers in this story (and I'm not particularly sympathetic towards "climate skeptics" either). I do worry that rules will be used to harass researchers in supposedly controversial fields (Richard Lenski's adventures with Conservapedia are a particularly nauseating example), but as a scientist, I also think the benefits of making massive amounts of data available to anyone are far too important to let these risks bother us, and the drawbacks of keeping such data private are much worse than having to fight off the occasional knuckle-dragging lunatic.

    33. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      "on floppy disks and would take too long to transfer to a digital version"

      umm....what?

    34. Re:Sudden Outbreak of Common Sense by martin-boundary · · Score: 1
      That's not necessary. For example, license the data with a BSD style advertising clause, then the lab could point to all the papers that have used their data, and get credit for them by the granting bodies.

      The exact details would be debatable. One extreme is to just have a standard acknowledgement in the references. At the other extreme, the data might be licensed in such a way that the lab gets to be an automatic author of any paper that uses the data. That way, the other researchers don't have to wait, and the lab gets publications for free.

      There's risk in both these extremes: a simple reference is easy to overlook, and an automatic authorship might not be desirable for low quality papers, etc. Still, this could be better than hoarding data for 10 or more years.

    35. Re:Sudden Outbreak of Common Sense by the+gnat · · Score: 2, Interesting

      If a lab has been spending my tax money for 10 years, I want my employees to give me my data right Goddamn now.

      Okay, but does that mean you should get to see the data before they're done analyzing it, before they can write a paper on their results? If we instituted such a rule, there would be nothing to stop scientists from bombarding their competitors with FOIA requests, and using the released data to scoop them. At the very least we'd need embargo rules, but even that won't entirely prevent abuses of the system. Most basic research isn't just a system of data factories, careful analysis by experts is essential for interpreting the results, and if scientists don't have some assurance that they'll be permitted to publish these analyses before their competitors stomp all over them, the academic system would simply break down. (Or is that what you want?)

    36. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      eviloverlordx (99809) wrote:

      So responding to an AC is trolling?

      interkin3tic (1469267) replied:

      By responding you give them what they want. I'm probably needing to take my own advice, but there's always a chance you actually were confused.

      Um, no, I don't think he was confused. You, on the other hand ...

      (Hint: look at the user ID numbers before you insert foot into mouth ... :)

    37. Re:Sudden Outbreak of Common Sense by tsm_sf · · Score: 1, Funny

      Wrong. If Group B cannot duplicate Group A's analysis of the data, that proves that Group A did something wrong and probably came to the wrong conclusion.

      Or it could prove what I've been saying all along, that Group B is chock full of mouth-breathing knuckle-walkers who I wouldn't trust to clean glassware let alone run projects.

      --
      Literalism isn't a form of humor, it's you being irritating.
    38. Re:Sudden Outbreak of Common Sense by peter+in+mn · · Score: 1

      If a lab has been spending my tax money for 10 years, I want my employees to give me my data right Goddamn now.

      Thus speaks someone who has never worked with data. Data isn't just a number. You really have to know just what question the number is an answer to, how and where it was collected -- and when, and what got done to the number along the way. There have been some public data repositories, but it's hard to write this stuff all down, and analyzing data without knowing where it comes from has lead to some pretty screwy conclusions. This kind of forced public access isn't completely wrongheaded, but there's a lot wrong with it.

    39. Re:Sudden Outbreak of Common Sense by youngone · · Score: 1

      I'm gald you responded, as the Troll was so funny. It really brightened up my day.

    40. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 4, Funny

      Creationists regularly mangle papers, taking quotes out of context and all.

      Get ready for an onslaught of mangled data analysis, with data being taken out of context, the results published to some blog, and people making policy decision based on those blog postings.

      Hmm... I think you've brought up another valid point: some researchers might take the data, rehash it and publish it as their own, getting credit for it, much as you have taken my point, restated it with a minor additions, and got all the mod points for it.

      Which is to say, I see what you did there ;)

    41. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 1

      Um, no, I don't think he was confused. You, on the other hand ... (Hint: look at the user ID numbers before you insert foot into mouth ... :)

      So he's been on slashdot longer, he's less likely to be wrong than I am?

      Nice try, eviloverlordx!

    42. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      No, it isn't. The fault is that the data may contain sensitive information. The Army collects data about enemies, should that be free access for the public? Nope. (I'm not arguing against making university data public, but your logic is flawed)

      While you're absolutely correct in your exception, this discussion is about university researchers. I thought it was a reasonable assumption that they're not collecting and researching classified data.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    43. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      Sure, I'll give you the data. But I wasn't funded to put the data in a format that's easy to understand. I've also got a job, and I don't get paid to support a competitor's data analysis attempts. Good luck.

      You know, actually, I think you were. First of all, if you're PUBLICLY funded, your competitor is part of the public. Too bad.

      Secondly, do you really think I'm going to believe your research if your data isn't properly organized?

    44. Re:Sudden Outbreak of Common Sense by mkiwi · · Score: 2, Funny

      A mathematician, an engineer, and a computer scientist are the final candidates for the top tech spot at a major corporation. They are summoned one by one to be interviewed.

      The mathematician goes to the interview. The person interviewing him is the CEO of the company. Only one question is asked: "What is 1+1?"
      The mathematician pulls out a pen and paper, makes a few scribbles, and says "This is proof that 1+1=2!"

      The engineer goes to the interview next. The CEO asks him the same question, "What is 1+1?"
      The engineer promptly grabs a calculator from his pocket, types in 1+1 and presses the equals sign. He shows the result to the CEO: "This calculation proves that 1+1=2!"

      The computer scientist is last. He is nervous, but fairly calm. The CEO asks him the same question.
      The computer scientist pauses, scratches his head for a second, and pulls out his laptop and asks "What do you want it to be?"

    45. Re:Sudden Outbreak of Common Sense by martin-boundary · · Score: 2, Interesting

      The downside to free and open access to all data is that research groups get grants to collect AND process the data to come up with results. Opening the data up for free access means that other groups, who have more interest in scooping than being right, have more ability to do that scooping. That leaves the people who did the work in the cold. There is good reason to delay opening the data until the group being paid to collect it has a chance to use it.

      Why do you think that delaying is necessarily the correct solution to the scooping problem? There are plenty of alternatives that could solve the problem.

      For example, the lab could license the data to anyone on the condition that someone in the lab (or the lab itself) is listed as a co-author of any paper that uses its data. That way, no scooping is possible, and outside researchers could still analyse the data as soon as they want.

    46. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      "There is good reason to delay opening the data until the group being paid to collect it has a chance to use it."

      yes, but how long should we wait. If a penurious researcher has to wait for a succession of tiny grants and slave labor grad students to work for the stipend offered, it could be years and years.

      It's also not unheard of for the Principal Investigator in charge to essentially hold the data hostage to ensure a series of grants for his proteges to analyze it. It's sort of another form of "buying in" to a contract in the commercial world, hoping you'll "get well" on the inevitable change orders. The principal investigator lowballs the cost of the experiment to get under the grant cap, putting in off-books sweat equity of his/her own and that of the grad students, hoping that "preliminary results will be promising" to encourage the granting agency (or others) to fund later analysis.

    47. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 4, Insightful

      Sure, I'll give you the data. But I wasn't funded to put the data in a format that's easy to understand. I've also got a job, and I don't get paid to support a competitor's data analysis attempts. Good luck.

      Your so-called competitors will be sure to mention your viewpoints when your funding runs out and you apply for more. Not only is your research not easy to understand and you don't let others analyze the data to attempt to reproduce your conclusions, but you think that other members in the scientific community are competitors and you feel a need to sabotage their efforts by making it difficult for them to use taxpayer-funded data to advance science. If science is such a business to you, then how about you fund it all yourself from the profits you make?

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    48. Re:Sudden Outbreak of Common Sense by shaitand · · Score: 1

      In other words, regardless of what facts might actually be contained in this data you've already made up your mind. Anything to the contrary "will undoubtedly cherry pick out from the data, take things out of context from studies supporting climate change as a theory, and those people whose support of climate change is based more off of religion than science will do the same to studies that reach opposite conclusions."

      *wipes a bit of foam from the corner of your mouth with his handkerchief* You don't exactly sound like a neutral undecided to me.

    49. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 1

      Absolutely. The public should have access to the data. Public grants then also need to pay for curating the data. Libraries aren't free, archives aren't free, package data in an actually useful form takes precious time, which is scientists most precious resource. Having data in a form that is useful to the 25 people in your research group is very different than providing data that can be used by thousands of people. It's analogous to the difference between the quick bash script you have that backs up your movies to your external hard drive, and having something that you're willing to distribute to 1000 people and provide support.

      Mod this guy up... definitely excellent points.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    50. Re:Sudden Outbreak of Common Sense by HungryHobo · · Score: 5, Insightful

      not really.
      Your problems with these possible situations are based on the deeply flawed system we have in place now.

      Give academics the respect and credit they deserve for collecting vast quantities of high quality data rather than merely for the 2 page paper they write about some interesting statistical anomalies they found in said data and this ceases to be a problem.

      The way papers are written, reviewed and published today and the way academics are given credit is based on a system hundreds of years old when it costly to print hundreds of pages of boring figures.

      Now data is cheap beyond words. Publishing a few hundred words or a gigabyte is little different when your audience is fairly small and the way academics publish should reflect that but it's too hidebound and dogmatic to do that.

      A professor who does nothing but produce a high quality and hard to acquire dataset deserves credit even if he comes to no conclusions at all.

      The problem is with the system and with the way academics think.
      Not with this possible change.

      Fix your system.

    51. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      Public or available on request for reasonable fee? I don't know about the data in question, but some datasets are truly massive and the required bandwidth/physical media could be quite expensive to the researcher, especially if it is widely requested.

    52. Re:Sudden Outbreak of Common Sense by NeutronCowboy · · Score: 5, Funny

      I think you've brought up another valid point: some researchers might take the data, rehash it and publish it as their own, getting credit for it, much as you have taken my point, restated it with a minor additions, and got all the mod points for it.

      I stand on the shoulder of giants. ;)

      --
      Those who can, do. Those who can't, sue.
    53. Re:Sudden Outbreak of Common Sense by the+gnat · · Score: 4, Insightful

      Give academics the respect and credit they deserve for collecting vast quantities of high quality data rather than merely for the 2 page paper they write about some interesting statistical anomalies they found in said data and this ceases to be a problem.

      The problem is that interpreting raw scientific data is enormously time-consuming, because there's so much information available that we can't possibly assimilate it all. I have a PhD in biochemistry and advanced training in crystallography, but I couldn't look at a ribosome structure and easily figure out what it meant, because I don't know very much about ribosomes. The people solving the structure, on the other hand, have exactly the background necessary to perform detailed analyses, and they will undoubtedly notice things that completely escape me. And I think you're understating the value of the scientific literature. A 2 page paper on statistical anomalies won't get you a faculty position at a major university, but a well-written 10 page paper on the meaning of a crystal structure certainly can. This is even more the case if they took additional time to perform non-crystallographic experiments to verify new hypotheses.

      I don't deny that there are issues with our system, but you're completely missing the point of writing papers. Simply generating massive amounts of data isn't considered science - figuring out what it means is. I say this as someone who is very good at generating data quickly, but not particularly good at interpreting it. Now I write data analysis software instead, and leave the question-asking to more suitable minds.

    54. Re:Sudden Outbreak of Common Sense by nospam007 · · Score: 1

      They had it in their shared Bearshare folder.

      If that's not 'making available' enough then RIAA, MPAA and lots of judges must be wrong.

    55. Re:Sudden Outbreak of Common Sense by Jason+Earl · · Score: 1

      No one is asking to see the data before papers can be published. That's a classic straw man argument. The FoI rules specifically cover ongoing research. In this particular case the group doing the research did so little publishing that they were actually closed down. No one is getting paid to analyze this data any more.

      The basic rule should be simple. Published papers should include source data. Take as long as you want to publish (of course, if you fail to publish, you'll likely end up in a different career), just don't try and publish your analysis of some data set without actually including the data set in question.

    56. Re:Sudden Outbreak of Common Sense by T+Murphy · · Score: 2, Insightful

      People making bad conclusions from good data is better than making (any) conclusions from no or bad data. By using good data, it helps give the proper scientists a chance to use logic and reason to correct people. We can't change the minds of creationists because we are not drawing our conclusions from the same 'data'. People believe in global warming because of data, now deny it because of doubt in the data. They may be impulsive and believe whoever speaks the loudest, but it does imply we can bring them to the next step and compare analyses, not just data.

    57. Re:Sudden Outbreak of Common Sense by drainbramage · · Score: 1

      No, responding to an AC is not the definition of trolling.
      But you knew that, didn't ewe?
      It was the trolling.
      Now go back to your lusting after dan rather.

      --
      No brain, no pain.
    58. Re:Sudden Outbreak of Common Sense by the+gnat · · Score: 1

      No one is asking to see the data before papers can be published.

      The comment I was replying to left that somewhat in doubt. It can genuinely take 10 years to figure out a problem before you're anywhere close to publishing, although in crystallography that is much less common than it used to be. And some publicly-funded researchers do indeed have to release their data immediately, whether or not they're ready to publish. These are special cases, and I'm inclined to favor such a policy anyway, but there are drawbacks.

    59. Re:Sudden Outbreak of Common Sense by interkin3tic · · Score: 0, Redundant

      In other words, regardless of what facts might actually be contained in this data you've already made up your mind.

      I don't think that's an accurate summary of what I said, no.

      You don't exactly sound like a neutral undecided to me.

      I was giving my opinion on the subject, so that inherently isn't neutral...

    60. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      Just because the public collectively owns the data doesn't mean that they need to give it away for free. Especially if they have spent a bit of money collecting it. Allowing the public institution to sell the data and use the money raised for research or teaching isn't that crazy an idea. There are all sorts of things that the public pays for but shouldn't expect or demand free access to. Can I just commandeer a ride in a police car if I need it, after all, my taxes paid for it, so I own it right?

    61. Re:Sudden Outbreak of Common Sense by Vornzog · · Score: 4, Insightful

      I work for a government lab that produces DNA sequences. We are obligated to release our data into a public database as soon as it has been verified for any samples that come from the US, and we release most of our foreign data, too, unless the other country involved gets pissy.

      Nothing good comes of that speed. We get crackpots thinking they've made major discoveries (not one real one yet), we get scooped for major papers (think Science), sometimes by our own collaborators using only our data and none of theirs, and we generally spend a lot of time, effort and *more money* on media spin control. There is such a thing as releasing the raw data too fast.

      We get a *ton* of FoI requests, too - people think we are withholding the good data, or being stubborn by not providing them composite statistics in exactly the format they want to see. The truth is, up until I got involved, the data management technology was so far behind the current bog-standard capabilities of the rest of the world, we couldn't actually answer the questions that were being asked, barring Herculean effort.

      Don't get me wrong, I think we *should* be releasing all of this data - delayed by just a bit. That way the people who generate it would have a better shot to get recognition/credit for their work, the crackpots would have less ammo for their rants, the press would be more likely to get the facts right the first time, and the scientific integrity of the whole process would be upheld, as everyone would get the raw data to review. It'd probably save a ton of money.

      The "reward" for doing publicly funded research is that you keep getting funded.

      Collecting good data is hard work, and the payoff is big publications, which you need if you want to continue getting funded. Once you've got that big publication in your pocket, though, you'd better by coughing up that data set. Otherwise, everything you say is suspect. Kudos to the UK for getting this half-way right, but they'd better set some reasonable constraints on the timing of these required data releases, or face any number of frivolous lawsuits from conspiracy theorists and 'data analysis specialists' who don't want to do any of the hard work themselves...

      I don't care one whit what you think you're entitled to: if you're taking my money, you work for me.

      I don't care if you are a ditch digger or a particle physicist. Doing all the hard work and getting none of the credit sucks regardless of what we are discussing or who is paying the bills. So put up or shut up. Would you be willing to do all of the grunt work in your job, but take none of the recognition? Most people wouldn't - those are the kinds of jobs that make people go 'Postal'. If you aren't doing it (and even if you are), do you really expect anyone else to?

      --

      -V-

      Who can decide a priori? Nobody.
      -Sartre

    62. Re:Sudden Outbreak of Common Sense by Obfuscant · · Score: 1
      For example, the lab could license the data to anyone on the condition that someone in the lab (or the lab itself) is listed as a co-author of any paper that uses its data.

      Ummm, would YOU want to be listed as co-author on a paper that publishes a conclusion about your data that you don't agree with? Do you understand what a legal can of worms is opened up by licensing things at a university, especially when it comes time to enforce?

      As for the other guy who replied to what I said, it's good that you've made your agenda public. To assume that someone who can't replicate your data a "knuckle-dragging mouth-breather" is less than unbiased science.

    63. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 1

      Can I just commandeer a ride in a police car if I need it, after all, my taxes paid for it, so I own it right?

      A police car is there to assist in the protection of everyone, not just a select few. So it's very analogous to the sharing of data. If you really want a ride, I'm sure one could be arranged as long as you don't mind sitting in the back seat and having no choice in the destination.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    64. Re:Sudden Outbreak of Common Sense by cdrguru · · Score: 1

      Most scientific research today is indeed a competition, although not for profit but for grant money and tenure. Failure to beat the competition results in no grants, no tenure and stuck in a dead-end role somewhere far less exciting.

      Check out the AIDS research history. This happens all the time, with just about everything that is in the least ways interesting.

      It has nothing to do with business or profit and everything to do with status and funding. If you work for a university, you are nothing without tenure and grants. If you work for a company, you are nothing without status, reputation and either grants or funding. If you aren't ahead of your competition, you are nothing and will be compensated accordingly. Here is your toilet brush, get to work.

    65. Re:Sudden Outbreak of Common Sense by LingNoi · · Score: 3, Insightful

      some researchers might take the data, rehash it and publish it as their own, getting credit for it

      While making reference to the original data? That's called science.

      While not referencing the original data? That's called plagiarism, it's happened in science before and usually ends your career.

    66. Re:Sudden Outbreak of Common Sense by stuckinphp · · Score: 0

      If it takes 10 years for this group on its own to publish results on global warming wouldn't it be smarter.. you know, for the human race, to release this data to the entire world so that it can be analyzed and inspected at a MUCH MUCH faster open source based rate?

      The research in question is funded by the people for the people to better the people.

      The research in question is NOT funded by the people for the people to better the Achievement Points of some professor/grad the world has never heard of.

      This issue of 'scooping' only comes into existence if the scientists are more worried about their careers/reputation/paycheck/streetcred than the real world changing problem they were asked to figure out / fix.

      In other words, go fuck yourself, you work for me, and this is my data.

      --
      if only
    67. Re:Sudden Outbreak of Common Sense by Ctrl-Alt-Del · · Score: 1

      The problem with the hockey stick was *not* the data, but the code. When the methods that Mann used to produce the code were eventually (and very reluctantly, I should add) made public, it was found that you could feed *any* data into the equations, even phone numbers taken from the phone directory, and you would get a hockey stick graph!

      The point is that for the scientific method to really work and be trustworthy, both the data and the methods used to analyse that data *must* be made available. That means good archiving of data (and floppy disks are not good archival formats); Climategate proved that the CRU and others either didn't look after their raw data, or were willing to lose it rather than hand it over.

      The result now is that most of the data that all computer modelling is being done from is based on homogenised data - in other words, the raw data has been fiddled with, in ways which are not documented. Now no-one can prove that this data being fed into models is even valid! (And trusting computer models is a fool's errand - see the fun and games modelling of volcanic ash dispersal has caused on world air travel in the past few weeks - those models just don't match what actual reality seems to be bearing out).

      To summarise: reliable science depends on the original data and methods being made available, so that other scientists can reproduce the original results, to confirm or deny that those results are valid. Hiding either data or methods, or as in the case of Climategate, wilfully destroying or obstructing the release, of either, goes against the most basic principles of science.

      This ruling is nothing but a good thing.

      --
      "Life is like a sewer - what you get out of it depends on what you put into it" - Tom Lehrer
    68. Re:Sudden Outbreak of Common Sense by LingNoi · · Score: 1

      how and where it was collected -- and when, and what got done to the number along the way

      All of which should have been answered in the research paper that the data was used for.

    69. Re:Sudden Outbreak of Common Sense by budgenator · · Score: 2, Interesting

      What if group B notices that a temperature station one day reports the temperature is -12.4C one day and 10 minutes later it's +12.4 C the next? On 2010-Apr-21 22:10, Drifting buoy 48534 did just that and that's an automated report, imagine the fun and games when human error gets added in! The data is bad, there is a lot of bad data points in the records and the records were never intended for the purpose they are being used for so quality control is even more critical. We really need a large number of human eyeballs looking at the data to find these problems.

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
    70. Re:Sudden Outbreak of Common Sense by HungryHobo · · Score: 2, Insightful

      The original objection was that if the data is hard to come by then it's unfair to academics who wouldn't get the credit after gathering the data.

      Of course simply generating massive amounts of data isn't science but it is a very very very important part of science.

      Is an academic who can write that well-written 10 page paper on the meaning of a crystal structure any less mentally capable because he didn't have the funds or facilities to gather the data he's looking at?

      If you open up the data then someone will undoubtedly notice things that completely escape the handful who got the data in the first place.

      The obvious solution is to give credit where credit is due and respect the ability of some people to perform good experiments.

      If economic systems were run the way academics operate then you'd end up with something like this:

      Nobody gets paid for raw lumber.
      Nobody gets paid for seasoned wood.
      Finished wooden items would be worth a fortune.

      And as a result anyone who wanted to make things from wood would have to own an area of forestry, logging equipment, a saw mill, a kiln and finally any tools for the final step.

    71. Re:Sudden Outbreak of Common Sense by budgenator · · Score: 1

      If Group B cannot duplicate the experiment and get the same data (and knowing that means being able to compare both sets) that calls the experiment as a whole into question.

      Group A can't even duplicate Group A's results.

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
    72. Re:Sudden Outbreak of Common Sense by martin-boundary · · Score: 1

      Ummm, would YOU want to be listed as co-author on a paper that publishes a conclusion about your data that you don't agree with? Do you understand what a legal can of worms is opened up by licensing things at a university, especially when it comes time to enforce?

      No, but I wouldn't mind being listed as a co-author on a paper whose conclusions I *do* agree with :)

      I don't think licensing data (as a standard community wide methodology) would be the can of worms you imply. Ideally, a standard boilerplate would be worked out that represents the standards and best practices of the particular scientific community, to promote sharing of data while giving out credit where it is due. The details would be worked out over an appropriate period of time.

      Since this is slashdot, the GPL and BSD licenses come to mind as successful examples with similar goals of sharing. You'll note that these licenses don't try to regulate all uses, rather they only regulate aspects of distribution/publishing. A corresponding boilerplate for scientific data would naturally concentrate on regulating published papers and books, which is what matters after all to the researchers in the field.

      Given the time it normally takes to review and publish/print papers in a journal, there's ample opportunity to contact the lab whose data is being used with a review copy, if that was actually required by the license. Or to contact the publisher of the journal with corrections.

      Maybe the license would not require a full co-authorship, but a special mention on the title page that their data is being used (eg Title, Author, Data Author, Abstract, Intro, etc.). The purpose here might be to show granting agencies the impact of the data (and the importance of the work that went into collecting it), as well as to make it easy for the data author to show up in bibliographic entries of citing papers.

      The alternatives to data hoarding are somwhere in there, IMHO.

    73. Re:Sudden Outbreak of Common Sense by rjiy · · Score: 1

      Huh? Why include the data provider as an "author"? Just because that's the way it's been done so far doesn't mean it can't change to something more appropriate. Maybe like a data citation.

    74. Re:Sudden Outbreak of Common Sense by AJWM · · Score: 1

      Or it could prove what I've been saying all along, that Group B is chock full of mouth-breathing knuckle-walkers

      That depends on whether or not Group C manages to reproduce A's results, or B's. In the latter case, it would tend to indicate that Group A contains the mouth-breathing knuckle-walkers.

      --
      -- Alastair
    75. Re:Sudden Outbreak of Common Sense by rjiy · · Score: 1

      I guess you're in favor of DRM and 20-year patents and 80-year copyrights as well? Because having access in those cases is also just as "pointless"?

    76. Re:Sudden Outbreak of Common Sense by Mr.+Slippery · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      Except that in some countries, the data collection is funded by selling datasets; and those datasets are sold under license that does not permit re-distribution. So I wonder how this ruling squares with UK and international copyright law.

      And sometimes, the data is archived on dead media (if it's available at all); recovering old data can be complex and expensive. Sure, for new data we can copy it to a website pretty easily (IP issues notwithstanding), but how much taxpayer money should be spent digging up old data, and the hardware to read it, to satisfy conspiracy-theory cranks?

      --
      Tom Swiss | the infamous tms | my blog
      You cannot wash away blood with blood
    77. Re:Sudden Outbreak of Common Sense by AJWM · · Score: 5, Insightful

      Examining old data has one value and one value alone - verifying that the claim made for the data matches up with the data. [...] Access to raw data for any other reason is pointless.

      Hardly. One could analyse the raw data looking for something other than what the original researchers were looking for. There might even be some interesting signal buried in the data that original team, focusing on something else, disregarded as noise. Minute timing errors in, say, solar wind data returned from a spacecraft might turn up some oddity of orbital mechanics, for example. The researcher focusing on the sensor data rather than the timestamps will miss it, but it's all part of the raw data. How many biologists discarded moldy Petri dishes as ruined, without recording that, before Fleming thought to investigate why bacteria didn't grow near the mold?

      --
      -- Alastair
    78. Re:Sudden Outbreak of Common Sense by rjiy · · Score: 1

      Kudos to you sir. "Release the data when you publish the paper" needs to beaten into the heads of some of these scientists.

    79. Re:Sudden Outbreak of Common Sense by Rockoon · · Score: 1

      ..and it doesnt really matter if it was answered or not by the paper.

      Knowing what to do with data is not a prerequisite for obtaining it. Once you open THAT can of worms then you need an arbiter somewhere making judgments about who does and doesnt know what to do with the data, which is exactly what these fucking scientists in TFA were trying to do, and that got bitch slapped for it.

      --
      "His name was James Damore."
    80. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      The research in question is NOT funded by the people for the people to better the Achievement Points of some professor/grad the world has never heard of.

      Pretty much is. Who do you think will get the next batch of money?

      This issue of 'scooping' only comes into existence if the scientists are more worried about their careers/reputation/paycheck/streetcred than the real world changing problem they were asked to figure out/fix.

      Yeah, imagine that, scientists are human. They want to be fed, be happy, achieve things, feel important and all that other human jazz.

      In other words, go fuck yourself, you work for me, and this is my data.

      And you're paying the scientists who get results. If they share data and get scooped than you won't pay them next time. Don't bitch about them being stuck in the system you're paying for.

    81. Re:Sudden Outbreak of Common Sense by TheVelvetFlamebait · · Score: 1

      No, but trolling a troll is still trolling.

      As my Ma always told me, two trolls don't make an insightful.

      --
      You know, there is a difference between trolling and pointing out the flaws in your reasoning. Just saying.
    82. Re:Sudden Outbreak of Common Sense by Rockoon · · Score: 1

      Failure to beat the competition results in no grants, no tenure and stuck in a dead-end role somewhere far less exciting.

      Good. Fuck 'em.

      It used to be that a scientist didn't make a name for himself unless he was remarkably more brilliant than his contemporaries. Now they get to make a name for themselves by social networking, and then conspiring within that good ol' boy club that they form, to maximize grant money.

      Fuck that. Thats got to stop. If you want to play it that way, go into the private sector.

      --
      "His name was James Damore."
    83. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      if you're taking my money, you work for me.

      If I'm taking your money, do I also work for every Chinese taxpayer?

    84. Re:Sudden Outbreak of Common Sense by xtracto · · Score: 1

      The original objection was that if the data is hard to come by then it's unfair to academics who wouldn't get the credit after gathering the data.

      We just have to publish the data in some journal (say as an online appendix of a paper *describing* the methods they used to collect it) and then /if/ such data is very important such papers would get hundreds of references!

       

      --
      Ubuntu is an African word meaning 'I can't configure Debian'
    85. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      Scientists aren't your employee, for the likely pittance you paid in taxes.

      I don't care one whit what you think you're entitled to.

    86. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      He's not talking about people trying to repeat experiments. He's talking about people trying to sidestep the scientific method altogether, and then leveraging some poor university's credibility.

      All science in the world of research driven by political agendas and egotistical arrogance.

      So I'm guessing you're not a scientist, then. But thanks for letting us know how easy it is for you to make blanket statements about an entire profession though, it shows how valuable your opinions are. Just because you got your CCNA on the 4th try doesn't mean you're now an expert on every topic you decide to speak about.

    87. Re:Sudden Outbreak of Common Sense by smidget2k4 · · Score: 1

      Nope. Plenty of classified research goes on in university research labs.

    88. Re:Sudden Outbreak of Common Sense by shadowbearer · · Score: 1

        Wrong. If Group B cannot duplicate Group A's analysis of the data, that proves that Group A did something wrong and probably came to the wrong conclusion.

          No, it doesn't. You are assuming that Group B didn't make any mistakes.

        If Group B cannot duplicate the experiment and get the same data (and knowing that means being able to compare both sets) that calls the experiment as a whole into question.

          In experiments as large as the LHC, how would propose that Group B "duplicates" the experiment?

          This hubbub all came about because of the difficulty in prying the source data out of the hands of the guy who produced the "hockey stick" figures.

        The downside to free and open access to all data is that research groups get grants to collect AND process the data to come up with results. Opening the data up for free access means that other groups, who have more interest in scooping than being right, have more ability to do that scooping.

          Oh? Are you saying that other research done by individuals or groups can't come up with other theories? That's at odds with your statement wrt to "the guy who produced the hockey stick figures"

          Your bias is unclothed.

        It's covered in the book "Broken Consensus" I think it's called.

          Which book would that be? Since you can't seem to be bothered to do a few seconds of research or remember the title, I'll try: Wait; can't find any book by that name, or even with those words included in the title or subject, that pertain to global warming. Perhaps you'd like to provide a real citation? It's possible that I missed it in a quick five minute search.

      SB

      --
      It's old. The more humans I meet, the more I like my cats. At least they are honest.
    89. Re:Sudden Outbreak of Common Sense by nacturation · · Score: 1

      Nope. Plenty of classified research goes on in university research labs.

      Then I stand corrected.

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    90. Re:Sudden Outbreak of Common Sense by nedlohs · · Score: 1

      There's an ever so slight difference between running the LHC and duplicating previously done collisions/measurements and building a time machine in order to setup a different set of weather stations around the globe so you can also gather a few decades of temperature data as well.

      And there's a still a huge possibility of group B proving group A's work wrong. After all the data collection is only one part of the process all the rest is still there to be shown wrong. And of course group B can still do their own data work.

      How can having more access to data be bad for science?

    91. Re:Sudden Outbreak of Common Sense by tyler.willard · · Score: 1

      You're not a climatologist, you're implying that public access will come to erroneous conclusions, and you're complaining about dogma?

    92. Re:Sudden Outbreak of Common Sense by TapeCutter · · Score: 5, Interesting

      "This hubbub all came about because of the difficulty in prying the source data out of the hands of the guy who produced the "hockey stick" figures. It's covered in the book "Broken Consensus" I think it's called. The "hockey stick" is not the "source data", the source data is all of the individual readings from all the instruments, prior to corrections for sampling errors or known issues. One cannot verify the quality of the "hockey stick" result without having the source data and being able to verify the processing steps that were done to it."

      I threw away some mod points because it irks me how unskeptical the garden variety climate skeptic actually is when it comes to accepting the hockey stick has been discredited. Here are a few points you should consider with your skeptics hat on...

      1. Mann's original hockey stick was published in the jounal Nature, they are not well known for publishing shoddy work.

      2. A senate inquisition was held on Mann's paper in which the National Acedemies of science were called in to give expert testimony on the veracity of Mann's paper. As you will no doubt learn when reading the testimony the NAS came down firmly in favour of Mann although they did highlight some minor technical problems.

      3. Given that the NAS were able to agree with Mann's conclusions under oath at a hostile inquisition, how did they do so without access to the data?

      4. The journal science is also not well known for publishing shoddy work. So why did NAS then publish a follow up study by Mann in their journal Science if they were not satisfied he had no only addressed the minor technical problems in the original but also greatly increaed the robustness of the findings?

      5. Why can't I find a listing for a book called "broken consensus" which you cite as a source? Shouldn't you at least adhere to your own standards of evidence?

      6. How do you explain the links to the data and methods found in an article called Dummies guide to the hockey stick on Mann's website?

      7. Why do people belive that some difficult to obtain data (ie: time consuming) from a few nations means that the other 99.99999% of the raw data available on the web is insuffitient to recreate the hockey stick?

      8. Why is McIntrye only interested in "auditing" climate science that disagrees with his opinion? Could this be because his own paper did not stand up to the traditional auditing method called "the test of time"?

      If the above points do not at least cause you to question your sources then I can only conclude your sketics hat must have slipped down over your eyes...

      --
      And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
    93. Re:Sudden Outbreak of Common Sense by goose-incarnated · · Score: 1

      If I haven't been getting modded up, it's cos giants were standing on my shoulders

      --
      I'm a minority race. Save your vitriol for white people.
    94. Re:Sudden Outbreak of Common Sense by the_womble · · Score: 2, Insightful

      I totally agree. If people just start looking at each others data instead of verifying it, a lot of mistakes (or fraudulent data ) will never be caught.

      On the other hand, a lot of errors in interpretation and statistical analysis will be caught.

    95. Re:Sudden Outbreak of Common Sense by TeknoHog · · Score: 2, Funny

      If I have been able to see further, it is due to being surrounded by midgets.

      --
      Escher was the first MC and Giger invented the HR department.
    96. Re:Sudden Outbreak of Common Sense by Xest · · Score: 3, Informative

      The issue with the FoIA in the UK is that there is a clause requiring bodies to only have to comply with the request if the cost of fulfilling it is not more than around £450.

      I've seen first hand local government abuse this by claiming that collation of the data would take 18 hours and that their FoI officer is paid £25 an hour, and hence the cost of providing the data is too high. Quite why it requires someone paid £50k a year to collate some basic data that they should already have collated anyway I've no idea, but still, they use this excuse, and the information commissioner allows such abuse of it.

      So although as you say it's a great theoretical win, I believe it'll make no difference in practice either way due to the ease of which public bodies are able to sidestep FoI requests.

    97. Re:Sudden Outbreak of Common Sense by VShael · · Score: 1

      All science in the world of research driven by political agendas and egotistical arrogance.

      Right. ALL science. Let's deal in absolutes and generalisations, because that's how we get to the truth.

      Wait, what?

    98. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      Get ready for an onslaught of mangled data analysis, with data being taken out of context, the results published to some blog, and people making policy decision based on those blog postings

      The data is allready been prooven to be mangled, taken outof context and published in some papers which get (are put ?) into the hands of people politicians who than make (far-reaching) policy decisions based on this single source.

      The only thing you seem to be saying is that a single source of information and the lack of scrutiny is only a good thing ?

      If that is so, I want some of the (gouverment supplied ?) weed you have been smoking ....

    99. Re:Sudden Outbreak of Common Sense by kandela · · Score: 1

      Except you won't keep getting funded unless you publish. If a larger group with more resources takes your data, analyses and publishes it before you have a chance to, then you won't get any funding, and you can't obtain more data.

      --
      Conservation of angular momentum makes the world go round.
    100. Re:Sudden Outbreak of Common Sense by kandela · · Score: 1

      Asking a scientist to show you their data before they have had a chance to analyse it is like asking a novelist to show you their storyline notes before the book is written. Or asking an office worker or government official to show you an incomplete report.

      Generating the data is part of a process. You wouldn't interrupt any other professional half way through a major project and demand they give you what they've got and then judge them on it.

      'Hey builder, I've paid for the frame you've built for my house. Now, despite our contract, for you to build me a house, I'm going to demand that you give me the frame, so I can get someone else to finish it. No, it doesn't matter if you've already bought the bricks, or if you already understand the plans, I'm sure someone else can do just as good a job.'

      Please, give me a break. Scientists are paid to generate science, not data!

      --
      Conservation of angular momentum makes the world go round.
    101. Re:Sudden Outbreak of Common Sense by jandersen · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      It isn't quite as clearcut as that. First off, the research that goes on at universities is often paid for in part by businesses, and their condition for contributing is of course that they have first access to the results. Secondly, even it some research is paid fully by the public, that is not the only investment - the efforts of the researchers involved represent a significant investment, and it is right that the reasearchers should therefore have a significant say in when and how the data are released.

      Of course, the end goal and the normal working practice of scientists is to publish data and results, but if we want to have scientific research at all, then we have to respect the rights of those who invest their effort, talent and interest in it.

    102. Re:Sudden Outbreak of Common Sense by hazem · · Score: 2, Insightful

      Agreed. Asimov wrote in the forward of one of his robot books, "If knowledge poses a dangerous problem, I can't believe that ignorance is the solution." I think it applies aptly here.

      Sure, some people will accidentally misuse the data, and others (hopefully fewer) will intentionally misuse the data, but for many, having that data available has a great potential for increasing the understanding we all have.

    103. Re:Sudden Outbreak of Common Sense by jcupitt65 · · Score: 1

      Your post makes no sense. You agree that scientists who fail to beat competing groups to publication will lose grant money and generally be failures, but you then bizarrely claim that scientific success comes from hob-nobbing with chums in some kind of fantasy old-boy's club. Science is very competitive and you won't get anywhere much by schmoozing. If you win grants but don't deliver results, you will lose funding.

      As an example, I work the #6 university in the world (arguably). The department next door was recently downsized due to anticipated cuts in core funding. They did it very scientifically: there was an equation with inputs for number of phd students, number of publications in top journals in the last 5 years, and grant income generated in the last 5 years. The researchers were sorted by score and the bottom 25% kicked out.

    104. Re:Sudden Outbreak of Common Sense by 16K+Ram+Pack · · Score: 1

      The downside to free and open access to all data is that research groups get grants to collect AND process the data to come up with results. Opening the data up for free access means that other groups, who have more interest in scooping than being right, have more ability to do that scooping. That leaves the people who did the work in the cold.

      Well, hardly "in the cold". They got paid for producing things which the public used. That's the point of public funding of science - as a benefit to the public, not to ensure that scientists get a nice pat-on-the back.

      If you don't like the deal, go and sit in your own lab paid for by someone else and publish your results and get all the glory.

    105. Re:Sudden Outbreak of Common Sense by rwiggers · · Score: 1

      Almost right.
      Privately funded research (very common in engineering) may withhold the data, which can ever be industrial secret.
      As I see, public funded research is public, so the data is also public.
      But there are gray areas...

    106. Re:Sudden Outbreak of Common Sense by R2.0 · · Score: 1

      "Collecting good data is hard work, and the payoff is big publications, which you need if you want to continue getting funded. "

      Really? When I do hard work, my payoff is...getting paid. Do you not receive a paycheck?

      "I don't care if you are a ditch digger or a particle physicist. Doing all the hard work and getting none of the credit sucks regardless of what we are discussing or who is paying the bills. So put up or shut up. Would you be willing to do all of the grunt work in your job, but take none of the recognition? Most people wouldn't - those are the kinds of jobs that make people go 'Postal'. If you aren't doing it (and even if you are), do you really expect anyone else to?"

      And so to the root of the problem. Somehow, people have gotten into their heads that they deserve a reward for doing what is expected. But the last I checked, a reward is for doing something extraordinary. Doing "hard work" should get you what you have contracted for - period. Do something extraordinary (in the literal sense of "above ordinary") and then one might "expect" a reward.

      The idea of "work for pay" died when the first shitty waiter said "Where's my tip?!" and didn't get fired/smacked.

      --
      "As God is my witness, I thought turkeys could fly." A. Carlson
    107. Re:Sudden Outbreak of Common Sense by itsdapead · · Score: 1

      The public pays for gathering the data, the public should have access to that data. Kinda hard to find fault with that.

      I don't see a problem with that, except for a few, small details:

      • The public will also have to pay for the extra effort involved in archiving data, dealing with disclosure requests etc. If you haven't worked at a University then bear in mind that most research projects are fixed term funding, and when that ends the account goes "poof" and even finding 10 quid to renew your domain name can be a major hassle.
      • For medicine and social sciences, that includes the cost of producing adequately anonymised versions of "raw" data to ensure compliance with scientific ethics and existing laws on data protection.
      • Recognise that not all research funding comes from the government - many commerciallt funded projects (and even some government projects not routed through the HE funding bodies) are, essentially, "work for hire" and/or subject to commercial confidentiality and NDAs.
      • Keep the enforcement and penalties proportionate and don't let this get surrounded by swathes of red tape desiged to cover management asses if someone walks on the cracks in the pavement. (Good luck with that!)
      • Have some protection against "nuicance" requests (one alleged factor in the recent climategate affair) - sadly the easiest way to do that is to charge a fee for access.
      • Make sure that our beloved leaders don't have a "public interest*" get-out to protect their policy-based evidence making, so this can take effect where it really matters and not just create extra paperwork for small projects.

      * as in, "its in the public interest for the public not to take an interest in this :-)"

      --
      In a survey of 100 programmers, 111111 thought that duck-typing was a good idea.
    108. Re:Sudden Outbreak of Common Sense by Vornzog · · Score: 1

      Really? When I do hard work, my payoff is...getting paid. Do you not receive a paycheck?

      You don't seem to understand the business side of science.

      If you work do R&D for a big pharma company, your compensation package is mostly monetary, plus a few papers that look suspect to other scientists because you have a conflict of interest. The company makes money by selling the result of your work.

      If you work in an academic or other 'pure research' setting, your monetary compensation is usually pretty low. The big-name prof on a huge grant may make decent money, but no one else does. The total compensation package is made up of a decent wage (but typically far below that made by people in other professions with similar amounts of schooling), and the lack of monetary compensation is expected to be offset by publishing big papers that *might* generate name recognition and lead to better opportunities for the researches long term - bigger grants, more prestigious positions, opportunities in industry, maybe patentable technology. The funding agencies take in money from taxes, and pay it out expecting that investing in fundamental science will lead to a better future for the nation on a long time horizon - most of the research they fund is not immediately commercializable.

      If you work in a government science job, your first job is to be a public servent, and so you get typical government pay scales - decent money, but still *far* below people with similar amounts of schooling, and you are still expected to make up the difference by publishing, getting name recognition, etc. - just like an academic.

      So, if you can make a ton more money in other professions, or non-academic settings, why does anyone, ever, agree to take less pay for doing fundamental research?

      Pro Tip - if money is the only form of compensation you take, hard core science would not be a good career choice for you.

      In both the government and academic cases you are beholden to some external funding agency to get your paycheck. They'll pay your wage today, giving the appearance of the scientist having a '9-5 job' where the compensation is totally monetary. But if you want to continue being paid tomorrow, you are expected to publish - or perish. The 'perish' part happens less in government, because you still have public services to maintain, but make no mistake - you are still expected to publish, demonstrating that you are spending public funds productively, wisely and deserve to be funded again next year.

      The idea of "work for pay" died when the first shitty waiter said "Where's my tip?!" and didn't get fired/smacked.

      In my world, shitty wait staff don't get tipped, and shitty scientists don't get their grant renewed.

      On the other hand, exceptional service at a restaurant will get you a *much* more generous tip. The equivalent for a scientist is spending years hunched over in the lab, making that big breakthough, publishing a career-making paper, winning the respect of your peers, and thereby gaining access to better funding opportunities to continue your research. Very rarely does that translate into any kind vast personal wealth, though - financial secuturity, maybe, rock star lifestyle, no.

      What does this have to do with releasing data? The same thing it did in my first comment. If you release your data too soon, you don't get the 'paid' for your work (regardless of the forms of compensation you accept). If you don't release your data soon enough, your work becomes suspect, especially if people are having a hard time reproducing it. Releasing the raw data in conjunction with that big paper is critically important for the long term success of science, and doesn't actually hurt you (unless you fabricated something...). Force the release of data too soon, though, and watch the quality of science plummet as it becomes a purely for-profit endevor. It is to the benefit of society to take the a long view of the benefits of science

      --

      -V-

      Who can decide a priori? Nobody.
      -Sartre

    109. Re:Sudden Outbreak of Common Sense by Sans_A_Cause · · Score: 1

      Try telling that to the Police when they stop you. Works every time.

    110. Re:Sudden Outbreak of Common Sense by Shotgun · · Score: 1

      The data won't impact public policy until some conclusion is made and presented to the public. The rule should be to simply make the data available when the paper is published. We don't expect to use a public road built with public money until the road is complete, do we?

      --
      Aah, change is good. -- Rafiki
      Yeah, but it ain't easy. -- Simba
    111. Re:Sudden Outbreak of Common Sense by jd · · Score: 1

      *laughs hysterically at the strawman comparison between preventing people from copying data and saying there is no added value to it - I've not seen anything so deluded since I last visited K5!*

      *looks at your 7-digit UID. Ah, a newbie. That explains it.*

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    112. Re:Sudden Outbreak of Common Sense by jd · · Score: 1

      Not really. Because the dishes were handled in unrecorded ways (the biologists discounted the importance of such things, which is why you had contamination) and because the biologists did not record any information abut what they detected, knowing that milk could get mould spots would tell others nothing new.

      Yes, there may well be an interesting signal there, which is why the original signal MUST be preserved. However, because the conditions of data collecting would have focused around the data of interest, there will be little/no information on conditions relating to any additional signal.

      Where the data conflicts with the conclusion (such as engineers looking at sensor data rather than timestamps and thus drawing false conclusions* because of inaccurate interpretation), referees MUST have the capacity to detect the flaw and be able to block false conclusions.

      *An engineer assuming that the space probes were on the correct trajectory because they shouldn't be anywhere else would be concluding something other than what the data says. In this case, it is less a matter of extracting new data and more a case of preventing the willfully-invalid abuse of the data to make a false claim.

      Interesting signals will almost certainly be in the data, and should be available to those studying the archaeology of science. Absolutely. However, there is one special case I didn't cover and you alluded to:- where data mining is more practical than data replication.

      It's hard to build a new LHC, and the Square Kilometer Array telescope will be a one-of-a-kind. Firing a probe into Halley's Comet's tail won't be possible again for another few decades. There are less extreme, but also difficult to replicate, environments. In these cases, the usual rules need to be bent. It may be possible to produce a similar experiment, but it is not possible to control conditions enough to make an identical experiment (within the bounds of normal experimental error).

      Even then, the LHC costs a lot to run, as do many other high-end labs. Daresbury Laboratory, over in England, is permanently in danger of closure despite having some staggering facilities and some amazingly bright minds.*

      *Yes, I did work for them once.

      As a result, there will be some experiments which churn up the unexpected but will never be repeated. The raw data should be exposed in full, immediately, the moment it is known that the experiment is of this kind. It's a one-off, there's no point in waiting for someone to reproduce the results, but there's valuable information there.

      There will also be experiments which are reproduced but where some element of what is classed as noise is ALSO reproduced. This data, together with sufficient technological information to test if the noise is created by the experiment itself, should ALSO be released. In this case, the data should include all runs in which the excess data is reproduced.

      (Noise is random, so if it were noise, it would not be replicated. Thus, it's something else.)

      But these are all very special cases, not the general rule. I have nothing against specific conditions which trigger a full disclosure of all data, and strongly recommend it. But it should not be the general rule as there'd be too much data (leading to an unmineable flood) and too little new experimentation.

      Indeed, within the special cases I've listed or anything scientifically/academically comparable in circumstance or nature, I would argue immediate full disclosure should be mandatory.

      In all other cases, a sensible, SHORT delay should be introduced prior to full disclosure - long enough for other branches of science to be able to understand mundane causes for extraneous data, thus allowing superior cleaning of the data prior to mining.

      But note especially I do not believe that the release of such data should be prohibited, merely that until the means exists to usefully process it, it's a distraction and not a help. I believe that in such cases, the partial or full release of lab notes

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    113. Re:Sudden Outbreak of Common Sense by lahvak · · Score: 1

      ...if you're taking my money, you work for me.

      Actually, they are working for me. Your taxes were used to build that bridge that goes nowhere. And to fix those potholes on the highway that nobody ever drives on.

      I hired those dudes to not only collect the data, but also to analyze them, and use them to draw some conclusions. They should give me access to the data together with their final results, so I can independently verify that the data indeed imply their conclusion. It would also be good if they released the data to the public when they are done with it, in case someone would like to have a go at the data, perhaps to study some other property of it, but I understand that the data may be of such character that it would be useless for anybody else.

      --
      AccountKiller
    114. Re:Sudden Outbreak of Common Sense by finarfinjge · · Score: 2, Interesting

      Point 1, Nature. Was still publishing articles supporting Piltdown man within 2 years of it being finally accepted as a hoax. They have been fooled before
      Point 2, Senate "inquisition" slammed Mann et.al. (if you are talking about Wegman, he called Mann's work obscure and incomplete with conclusions not supported by the data)
      Point 3, Not sure how you came to the conclusion that calling the conclusions unsupported by the data "agreeing"
      Point 4, See point 1,
      Point 5, guess you didn't try too hard http://books.google.com/books?id=8WqYkGxvPlAC&dq=%E2%80%9CShattered+Consensus:+The+True+State+of+Global+Warming%E2%80%9D&printsec=frontcover&source=bl&ots=veoYFgaLg9&sig=khERol_VbglL4JwcNuzN5JbaLJo&hl=en&ei=UxggS90fksmUB7CyxegF&sa=X&oi=book_result&ct=resul#v=onepage&q&f=false

      Point 6, Given that this article is about the FIRST, RELUCTANT, release of some of the very data that Mann used in constructing the hockey stick, I find it a bit rich that you are willing to claim that the raw data is all available
      The temperature data is available, but the 'hidden decline' tree ring data . . . Not so much
      Point 7, McIntyre's site is quite well known for being especially hard on skeptics who post idiotic comments. Of course, Steve is not advocating a point of view that requires the expenditure of billions of dollars. He is just asking for some discipline in the work being done.
      In his real job, he is required to audit data under rules that would make climate scientists crap. In mining you have to publish your data (drill records) as soon as practicable. Competition?? Tough. Don't show your raw numbers and interpretation methods?? Go to jail. McIntyre audits a lot of data other than climate scientists data. Under rules that people like Jones and Mann would whine miserably about.

      Maybe you might want to take off your own blinders and find a sight in addition to realclimate for information.

    115. Re:Sudden Outbreak of Common Sense by sac13 · · Score: 1

      I don't think that's what he was saying. He's saying this will lend itself to overly simplistic interpretations.

      That happens everyday on both sides of the issue with or without the data. The whole problem with the subject is that it's become a political topic and so now everyone is an expert.

      Of course, that fact won't stop the flaming. And regardless of which side you're on, the politicians that agree with you will ensure that not enough will be done in either direction to deal with or not waste resources on the problem/non-problem.

      They'll carve out plenty of power for themselves under the guise of "fixing" something anyway, though.

    116. Re:Sudden Outbreak of Common Sense by Anonymous Coward · · Score: 0

      If Group A and Group B use exactly the same source data there's no possibility of Group B proving Group A's research wrong.

      Wrong. If Group B cannot duplicate Group A's analysis of the data, that proves that Group A did something wrong and probably came to the wrong conclusion.

      Possibly also wrong. If Group B is suitably motivated, they can intentionally arrive a different result and claim that this calls into question Group A's result. Group A is then forced to debunk Group B's analysis, work for which they most certainly do not have financial support, which further distracts from their own efforts.

    117. Re:Sudden Outbreak of Common Sense by Jason+Earl · · Score: 1

      Sometimes I really wish that I had gone into academia. The idea of being able to get paid for 10 years (or more) before being required to write a single paper seems very appealing.

      That being the case my guess is that more and more prestige is likely to go to those researchers that actually gather the necessary data. Doing a brilliant analyses is good and all, but gathering the necessary data is the truly important bit.

    118. Re:Sudden Outbreak of Common Sense by TapeCutter · · Score: 1

      point 2 & 3 Be the skeptic you claim to be, read the fucking testimony.
      point 6 It is not the FIRST release, it's the realease for dummies.
      point 7. I have a bridge for sale.

      --
      And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
    119. Re:Sudden Outbreak of Common Sense by crdotson · · Score: 1

      A cynic might say that we've already seen "an onslaught of mangled data analysis, with data being taken out of context, the results published to some blog, and people making policy decision based on those blog postings" by the academics.

    120. Re:Sudden Outbreak of Common Sense by SpringRevolt · · Score: 1

      Funny...

      I have a PhD in biochemistry and advanced training in crystallography,

      me too.

      but I couldn't look at a ribosome structure and easily figure out what it meant, because I don't know very much about ribosomes.

      Likewise.

      Now I write data analysis software instead, and leave the question-asking to more suitable minds.

      Me too....

    121. Re:Sudden Outbreak of Common Sense by Obfuscant · · Score: 1
      No, it doesn't. You are assuming that Group B didn't make any mistakes.

      No, I'm assuming that when they fail at their first attempt, they discuss the problem with Group A and any mistake is discovered. If it is Group B's mistake, then they CAN duplicate the result (which is opposite what I said -- 'they cannot'). If it is Group A's mistake, then B cannot duplicate the original result, and neither can A.

      In experiments as large as the LHC, how would propose that Group B "duplicates" the experiment?

      Who said they were going to try? "If they can't duplicate the experiment and get the same result" means they can duplicate the experiment and don't get the same result, not that simply being unable to perform the experiment means the result is bogus.

      Oh? Are you saying that other research done by individuals or groups can't come up with other theories?

      I said no such thing. I was talking about people who AREN'T doing the research taking the data and scooping the original workers. You can do a lot of theorizing with other peoples' data, once they've taken it for you. Every experimentalist knows, taking the data is the hard part. Allowing theorists to scoop the experimentalists means you'll have a lot fewer experimentalists.

      Which book would that be? Since you can't seem to be bothered to do a few seconds of research or remember the title, I'll try:

      You didn't try very hard, did you? Google "consensus global warming book". First result: Amazon.com will sell you "Shattered Consensus", the book I was referring to. I assumed anyone interested enough would look it up themselves.

    122. Re:Sudden Outbreak of Common Sense by shadowbearer · · Score: 1

        Thanks for the reply.

        First off, wouldn't assuming that Group B and Group A are collaborating taint any results that Group B come up with? The whole idea of independent confirmation of results depends on the other groups doing the experiment from scratch and coming up with the same results. Any exchange of information, no matter how innocent it really is, can and will likely bring charges of taint.

        Furthermore, if Group B can't duplicate the experiment, they can't duplicate the experiment. If the experiment is so hideously expensive (or time consuming) to duplicate that no other team can try to do so, then there cannot be any confirmation of the first results, independent or not. That was point about experiments done at the LHC.

        I disagree with your third point about other people using the same data to "scoop" the original teams because often the data set is so large there is no way that one team can possibly cover all the possible theoretical approaches nor analyze all the data. Two cases in point would be LHC and SETI data sets.
        If someone else using the same data set finds something that the original team was unable to either because of time constraints, funding contraints, OR because the "someone else" had smarter or more experienced members on their team, then team A has nothing to bitch about. Those reasons are why scientific data should be open.

        You said it was called "Broken Consensus" "or something like that" - "Shattered Consensus" is not even close to the same title nor meaning of title. Broken != Shattered, the two terms have very different meaning. Broken, in this meaning, would mean there are some people who disagree with the consensus - which is true. Shattered would mean that the majority of people don't agree with the consensus, and that indeed there is no consensus at all. Very different meanings. You of course were able to find the book you were looking for because you already knew what you were looking for; I did not.

        I don't have quick access to the book, I looked over the reviews (the blurb is useless) and from what I can see in there, they are rehashing many of the same issues that have already been dealt with in the five years since the book was published. Several of those misconceptions, such as changes in thermometer technology and the understanding of stratospheric cooling - to name two- were dealt with well before the author wrote the book, so the author obviously was not abreast of current, or even older, papers on the subject.
        In addition, many of the issues I see brought up in the reviews have even been dealt with repetitively here on slashdot in the discussions about global warming.

        In any case, you're going to have to do better than basing your conclusions off of one five year old book, especially in the field of climatology, where things are changing so rapidly ;-)

        Cheers,
      SB

      --
      It's old. The more humans I meet, the more I like my cats. At least they are honest.
    123. Re:Sudden Outbreak of Common Sense by SETIGuy · · Score: 1

      There is a process involved in getting public funding. Rules to be followed. For example, my organization agrees to give me a place to do my work, and to pay to keep the lights on and the phone bill paid. The public (i.e. funding organization) agreed to pay for the scope of work I proposed. They set out requirements I needed to meet.

      What the "public" doesn't get to say is that they're changing the rules after the fact, and now I need to spend more money than I was initially granted on making my data available to everyone. The public wants the data to be made available, they need to pay the cost of making it available. I don't work for free, and I can't pay to hire IT support to maintain a data distribution site without money.

      You want my data to be made available, I'll need at least 1 FTE and a grad student to document data format. That'll be $140k per year (which is about the size of the typical NSF astronomy grant. But it would never get funded because NSF won't pay for IT staff.) How long do you want this data archive to be maintained? Now add the hardware...

      Stop with the you took PUBLIC MONEY crap. You might think I was required to make my data public, but nowhere in the proposal requirements did it say I needed to, and nowhere in my proposal did I say I would. I wasn't funded to do so. If you want the data, make it a requirement of the grant and I would have increased my budget to match. So would everyone else. And then you'd be bitching about how expensive it is.

    124. Re:Sudden Outbreak of Common Sense by finarfinjge · · Score: 1

      Normally, I don't reply to such things, particularly when there is such venom involved. It indicates a closed mind and a lack of tolerance, but anyway, here goes. Copied and pasted from the PDF of the Wegman report I have on my computer (and that I re-read prior to my first posting).

      Findings

      In general, we found MBH98 and MBH99 to be somewhat obscure and incomplete and the criticisms of MM03/05a/05b to be valid and compelling. We also comment that they were attempting to draw attention to the discrepancies in MBH98 and MBH99, and not to do paleoclimatic temperature reconstruction. Normally, one would try to select a calibration dataset that is representative of the entire dataset. The 1902-1995 data is not fully appropriate for calibration and leads to a misuse in principal component analysis. However, the reasons for setting 1902-1995 as the calibration point presented in the narrative of MBH98 sounds reasonable, and the error may be easily overlooked by someone not trained in statistical methodology. We note that there is no evidence that Dr. Mann or any of the other authors in paleoclimatology studies have had significant interactions with mainstream statisticians.

      For one (there are others) of the statutes under which Steve McIntyre must work, see this site http://www.bcsc.bc.ca/mining.asp, also do some reading on national instrument 43-101.

      I can't believe you still labour under the impression that all of the data has been released into the public domain.

      This was indeed the first release of this information.

      Rage is not conducive to clear thinking. It shows in this series of posts of yours, all of which exude a strong dollop of rage.

  2. Publicly funded by sconeu · · Score: 1, Offtopic

    Why doesn't this apply to the BBC?

    --
    General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
    1. Re:Publicly funded by Angostura · · Score: 0

      I think you'll find that the BBC actually broadcasts every program it makes, free to air.

    2. Re:Publicly funded by Monkey-Man2000 · · Score: 2, Informative

      Free if you pay your TV tax or pirate I believe

      --
      This post was generated by a Cadre of Uber Monkeys for Monkey-Man2000 (603495).
    3. Re:Publicly funded by Anonymous Coward · · Score: 1, Interesting

      I personally agree. they collect a huge amount of information about program availability, trends, and many other data services that are not the broadcasted portion of their TV that SHOULD be made available to the public.

    4. Re:Publicly funded by Anonymous Coward · · Score: 1, Informative

      You can sit in the pub to watch the football if you prefer.....

    5. Re:Publicly funded by Karganeth · · Score: 1

      Actually, you don't need a license to watch iPlayer.

    6. Re:Publicly funded by sconeu · · Score: 1

      Let me be a bit clearer.

      I was referring to the DRM on iPlayer.

      --
      General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
    7. Re:Publicly funded by jd · · Score: 1

      Because the BBC isn't a theorum. On the other hand, the BBC funded the development of the Dirac codec. You =DO= use Dirac, don't you? No? Then don't complain about what the BBC does or does not provide.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    8. Re:Publicly funded by OldBus · · Score: 1

      I think you do if it is live, but not if you watch previous programmes - the difference between downloading and broadcast.

    9. Re:Publicly funded by Anonymous Coward · · Score: 0

      and British Aerospace etc who receive shed loads of public money.

    10. Re:Publicly funded by Anonymous Coward · · Score: 0

      I don't care what Wikipedia says, I can still download iPlayer programs on Linux using get_iplayer with flvstreamer (which doesn't implement the DRM bits). So, for the moment at least, I can download iPlayer programs.

  3. yro my ass by meow27 · · Score: 2

    making publicly funded non-military research has nothing to do with privacy. Public money is spent for the public good and there is no good justifiable reason to keep it hidden from the public... especially if its meant for the betterment of society.

    if you want your data to be private, get your own privately funded money

    1. Re:yro my ass by Anonymous Coward · · Score: 0

      I don't think you understand the concept of the YRO category...

    2. Re:yro my ass by geekoid · · Score: 4, Insightful

      errr... no always.

      Putting data into peoples hands whoa aren't experts often leads to bad things. See every non expert who believed Wakefield study because they didn't understand how to interpret data. In that case kids died , and kids are still dying.

      In principle I agree with you, but we live in an are where everyone thinks they are a qualified expert in anything. That simply isn't true, and no good will come out of this.

      The data wan't show a flaw in the study because it wasn't used, but he will inevitably cherry pick data to 'prove' the study is wrong. And people like Hannah Devlin are always happy to publish claims without proper study. So no good can come from this, and people need to understand that.

      It's hard problem to solve.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    3. Re:yro my ass by DAldredge · · Score: 2, Insightful

      "No good"?

      None?

      Are you quite sane?

    4. Re:yro my ass by Anonymous Coward · · Score: 0

      the data needs to be made available to experts who are both interested and not on a payroll tied to an agenda.

    5. Re:yro my ass by Anonymous Coward · · Score: 0

      "Among other facets shared by the various fields of inquiry is the conviction that the process must be objective to reduce biased interpretations of the results. Another basic expectation is to document, archive and share all data and methodology so they are available for careful scrutiny by other scientists, thereby allowing other researchers the opportunity to verify results by attempting to reproduce them. This practice, called full disclosure, also allows statistical measures of the reliability of these data to be established."
      -from Wikipedia's article on scientific method

    6. Re:yro my ass by syousef · · Score: 1

      Putting data into peoples hands whoa aren't experts often leads to bad things. See every non expert who believed Wakefield study because they didn't understand how to interpret data.

      So hang on here, what are people suppose to do? You're blaming these people for believing an expert (and a published study).

      In that case kids died , and kids are still dying.

      Well the blame lies on the expert and on the peer review that wasn't properly conducted, not on the people who believed him!!!

      In principle I agree with you, but we live in an are where everyone thinks they are a qualified expert in anything.

      But you just cited a case in which an expert did no better and children died!?!?

      That simply isn't true, and no good will come out of this.

      The data wan't show a flaw in the study because it wasn't used, but he will inevitably cherry pick data to 'prove' the study is wrong. And people like Hannah Devlin are always happy to publish claims without proper study. So no good can come from this, and people need to understand that.

      It's hard problem to solve.

      Again, don't blame the non-experts for bad behaviour of the experts. If anything this is a case for more openness so that other experts can disprove false claims.

      In any case I'd rather be master of my own destiny and at least try to understand expert data on issues of my choosing, rather than just take it on faith that an expert is doing proper science. Yeah I could make an idiot of myself or harm myself but at least I've got a fighting chance!

      --
      These posts express my own personal views, not those of my employer
    7. Re:yro my ass by attributed+insanity · · Score: 1

      Well the blame lies on the expert and on the peer review that wasn't properly conducted, not on the people who believed him!!!

      Close, but no, the blame lies with the media who persistently presented what was in fact a dissenting view in the face of increasing amounts of evidence that suggested otherwise. I refer you to someone brighter than myself who is known for writing on the subject.

    8. Re:yro my ass by Attila+Dimedici · · Score: 1

      OK, so we have a study that was bad using data that was collected for that study and you use that as evidence that allowing people to see the data from publicly funded studies will result in the same thing? We already have the problem that you say this ruling will create.

      --
      The truth is that all men having power ought to be mistrusted. James Madison
    9. Re:yro my ass by Just+Some+Guy · · Score: 1

      Putting data into peoples hands whoa aren't experts often leads to bad things. See every non expert who believed Wakefield study because they didn't understand how to interpret data. In that case kids died , and kids are still dying.

      What about the legitimate expert who wants access to the data so he can make an informed decision on its validity? Do we lock down the source because some bad people might look at it, or open the source so that the experts can, too?

      --
      Dewey, what part of this looks like authorities should be involved?
    10. Re:yro my ass by Anonymous Coward · · Score: 0

      This means that most scientists should not be using computers.

    11. Re:yro my ass by Comboman · · Score: 1

      Putting data into peoples hands whoa aren't experts often leads to bad things. See every non expert who believed Wakefield study because they didn't understand how to interpret data.

      Are you arguing against your own point? Wakefield collected his own data (and falsified some of it). He was disproved by other researchers who could not duplicate his results with independently collected data, and when his conflict-of-interest was exposed. The Wakefield case should be a huge argument in FAVOUR of forcing researchers to release their raw data. As far as the non-experts go, they don't look at the data anyway; they choose which expert to believe based on who makes the better (or more emotional) argument (or the argument that fits better with their pre-existing opinions). Releasing the raw data won't effect that one way or the other.

      --
      Support Right To Repair Legislation.
    12. Re:yro my ass by TheTurtlesMoves · · Score: 1

      Censorship by any other name is still Censorship.

      Whats next? Approved reading lists?

      Who are you to judge what others can or cannot understand?

      Or should we all just "trust" these learned men and woman who are more qualified to interpret truth for us?

      Guess why the bible was *not* on the approved reading list.....

      --
      The Grey Goo disaster happened 3 billion years ago. This rock is covered in self replicating machines!
    13. Re:yro my ass by sac13 · · Score: 1

      Putting data into peoples hands whoa aren't experts often leads to bad things.

      Especially when you have the data and get to use it to define what "experts" means...

      Without the data being public, is everyone agreeing with the conclusions? Do you think hiding it makes the conclusions more credible?

      Idiots will be idiots with or without information (i.e. Intelligent Design). If you hide it, people that actually know what the data means, but have no access, have reason to doubt. If you publish it and it's correct, your support base becomes stronger and your opposition loses anyone that can credibly make a claim to understanding the research.

      Hiding because, "people can't understand and will think the wrong thing," just makes your position seem quite suspicious. With such a contentious subject, it does even moreso.

    14. Re:yro my ass by syousef · · Score: 1

      Close, but no, the blame lies with the media who persistently presented what was in fact a dissenting view in the face of increasing amounts of evidence that suggested otherwise.

      It's easy to blame the media, but was the paper retracted? Was there any official and easy way to gauge if it still had the support of the scientific community? Was a retraction of some sort printed? Because if not to the lay person and the lay media it all just looks like different groups of reputable scientists squabbling over their pet view point.

      The current scientific publication and peer review system is completely broken.

      --
      These posts express my own personal views, not those of my employer
  4. good news by Anonymous Coward · · Score: 0

    good news for everyone but those jerks who wanted to profit.

    1. Re:good news by zz5555 · · Score: 3, Interesting

      I don't know. The USA (and a lot of other countries) might not be too happy since it means releasing the UK is saying it's OK for these scientists to release the USA's proprietary data. So I guess, you're right in that those jerks like the USA (and a lot of other countries) that wanted to profit from this data will get their comeuppance, but I wonder if we now need to increase taxes in order to pay for these services that used to make a profit. So that means that we all need to pay more money because of this.

      I also wonder what it means for the university to release data that is illegal for them to release. I mean, on one side the court says they need to release it, but on the other side other courts say it's illegal to release it. Should be interesting in the UK for a while.

  5. Many academes want this too... by Improv · · Score: 2, Insightful

    Science journals have long fought this, because their profit model is strongest when they own copyright and are the exclusive publishers of a paper. Peer review and scientific principles don't mesh well with peer review though, and many academes have either "published" their papers on their own websites or found other ways to try to work around the journals.

    Ridding peer review and science of copyright would be a great improvement.

    --
    For every problem, there is at least one solution that is simple, neat, and wrong.
    1. Re:Many academes want this too... by DarkKnightRadick · · Score: 4, Insightful

      no, peer review is good. It helps to point out mistakes or inconsistencies. Getting rid of scientific journals is quasi-good (less profit motive in science, but also less chance to get work out there).

      --
      "There is a way that seems right to a man, but its end is the way of death." Proverbs 16:25 (NKJV)
    2. Re:Many academes want this too... by FlyingBishop · · Score: 1

      Wikipedia has proven that peer review can be supported for almost nothing. If universities got their shit together they could make one massive data store with zero difficulty, and no middlemen making it impossible to find crap without spending hundreds of dollars per researcher for month.

      The storage and administrative costs for all research papers should cost at most $50/researcher, and that's probably enough to build twenty redundant data centers around the world to carry every copy of every research paper published for the past twenty years.

    3. Re:Many academes want this too... by FlyingBishop · · Score: 1

      $50/month.

    4. Re:Many academes want this too... by dangitman · · Score: 1

      Peer review and scientific principles don't mesh well with peer review though,

      Peer review doesn't mesh well with peer review?? What?

      --
      ... and then they built the supercollider.
    5. Re:Many academes want this too... by geekoid · · Score: 1

      No they haven't. They make money from published papers and reputation. Not raw data.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    6. Re:Many academes want this too... by Obfuscant · · Score: 2, Insightful
      Wikipedia has proven that peer review can be supported for almost nothing.

      And that's the value you get from it. Allowing everyone to "peer review" everything results in the "truth" being the result of a majority vote, not the result of it being true.

      Peer review requires peers, not random people off the street.

      The storage and administrative costs for all research papers should cost at most $50/researcher,

      You're confusing the cost of "peer review" with the cost of archiving a paper. Peer review takes place prior to publishing the paper. The value of many journals, compared to "random website" is that there IS peer review, and you are less likely to find random babbling and incoherent thought in the journal.

      However, Open Access is the wave of the future, so you will eventually get peer reviewed work online, like you want.

      Of course, this discussion is about the data behind the papers, not the papers themselves. I don't know of a single paper that includes the "raw source" data it was based on. That's the purpose of the paper, to analyze and theorize.

    7. Re:Many academes want this too... by mjwx · · Score: 1

      no, peer review is good. It helps to point out mistakes or inconsistencies. Getting rid of scientific journals is quasi-good (less profit motive in science, but also less chance to get work out there).

      I think the GP meant getting rid of copyright restrictions in peer review and scientific journals, not getting rid of peer review or scientific journals.

      Peer review is a great thing(TM) and should be applied to everyone, scientists and "sceptics" alike.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    8. Re:Many academes want this too... by melikamp · · Score: 2, Interesting

      This is still an overestimate, I think. The Wiki says there are 5758 higher education institutions in the US alone. The entire budget of the Wikimedia foundation hovers around $10000000 a year, which is ~ $1736 per year per institution. We can have a project that costs 10 times as much as Wikipedia, containing, most likely, more than hundred times more data, for measly $17360 per year per institution. This is about as much as one lucky teaching fellow gets paid. This is such a trivial sum of money for the academia as a whole, Harvard alone could afford it for several decades if they wished so, although it would make a quite a blip on their balance sheet.

      Whatever, Wikimedia is already doing it with textbooks, so that part is taken care of. It would be nice, though, do have a big ass research exchange, kind of like famed JSTOR, but where everything comes with the source attached (original LaTeX, raw media, raw data, etc), and everything is available to everyone (public domain or, better yet, copyleft).

    9. Re:Many academes want this too... by Improv · · Score: 1

      I used to be heavily involved with Wikipedia - it hardly reaches the standard of peer review.

      --
      For every problem, there is at least one solution that is simple, neat, and wrong.
    10. Re:Many academes want this too... by serviscope_minor · · Score: 1

      http://xxx.lanl.gov/

      (my favourite URL on the web). It only has papers (no data), but almost every paper comes with the LaTeX source. In some fields it has little content. In other fields, (HEP) it has essentially replaced jounals. This is what I can only hope happens to my field.

      --
      SJW n. One who posts facts.
    11. Re:Many academes want this too... by DarkKnightRadick · · Score: 1

      Even better than copyleft is CopyFree.

      --
      "There is a way that seems right to a man, but its end is the way of death." Proverbs 16:25 (NKJV)
    12. Re:Many academes want this too... by DarkKnightRadick · · Score: 1

      In that case, it's all good.

      Agreed.

      --
      "There is a way that seems right to a man, but its end is the way of death." Proverbs 16:25 (NKJV)
    13. Re:Many academes want this too... by FlyingBishop · · Score: 1

      But the point is all that is managing access... it should actually be easier than Wikipedia. The journal has basically nothing to do with it.

  6. Good news from the UK by nescientist · · Score: 1, Interesting

    More data in more hands is a good thing. It sounds like this specific case was driven primarily by the nonsensical quackings of a global-warming denialist, but whatever; information is beautiful and the more we share the love the better off we all are.

    1. Re:Good news from the UK by geekoid · · Score: 2, Insightful

      You say that until he gets on a major talk show, talks about his improperly interpret results and suddenly 20 million people are parroting his incorrect results.

      Suddenly it's not a good thing because those same outlets will not give the same time to actual experts.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    2. Re:Good news from the UK by Anonymous Coward · · Score: 0

      That's mister "global-warming denialist," world reknowned mathematician, statistician and climate scientist to you buddy.

    3. Re:Good news from the UK by mjwx · · Score: 2, Insightful

      You say that until he gets on a major talk show, talks about his improperly interpret results and suddenly 20 million people are parroting his incorrect results.

      The problem is that we dont apply the same standard to a talk show as we do to a scientific institution.

      If a talk show spreads incorrect information absolutely nothing happens, if a scientific institution does the same there will be a royal commission, investigation, scrutiny and even if they are found innocent someone's career is still ruined.

      What we need is to get rid of the double standard, lets just say if Box News makes a deliberately misleading statement about the Australian Hoop Snake they should be investigates, charged and the editor, producer and reporter fired and barred from working in the media field again. If we started giving news agencies with the same scrutiny and punishments as universities then the level of misinformation would drop dramatically.

      Published scientific reports should also have the data published publicly, however there should be severe punishments for the misuse of this data to spread misinformation and attempts to ruin careers.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    4. Re:Good news from the UK by bendodge · · Score: 1

      What we need is to get rid of the double standard, lets just say if Box News makes a deliberately misleading statement about the Australian Hoop Snake they should be investigates, charged and the editor, producer and reporter fired and barred from working in the media field again. If we started giving news agencies with the same scrutiny and punishments as universities then the level of misinformation would drop dramatically.

      So would freedom of the press. What you are suggesting is essentially state control of the media, and only a step away from a priori censorship. How much easier to control misinformation if you must submit everything but the weather forecast to the bureau for approval before publishing?

      Second, this research is usually funded with tax money - hence the high standards. Most news agencies are not. You can say pretty much anything you want to on your own dime. It's natural to recoil from that idea, but it was still important enough to the founders of the USA to make it #1 on the earth-shaking Bill of Rights (which is might add is still unparalleled 200+ years later).

      Look around the world. It is still awe-inspiring how much power and trust the US Constitution puts into the people, and how strict are the chains on government. Many people wish for tidiness and control of crackpots in the world. I do not. I think freedom is a far more lofty goal.

      Be careful what you wish for. You just might get it (esp if you have a vote).

      --
      The government can't save you.
    5. Re:Good news from the UK by mjwx · · Score: 1

      So would freedom of the press. What you are suggesting is essentially state control of the media, and only a step away from a priori censorship. How much easier to control misinformation if you must submit everything but the weather forecast to the bureau for approval before publishing?

      No, did you even read my post.

      Priori censorship? I cannot fathom how you managed to reach that conclusion?

      What I suggested is that news agencies be made responsible for what they publish. In the same way that I'm responsible for my actions. I know if I do something I know is wrong I will be punished but that does not stop me if I really want to.

      We simply put in place, the same punishments and scrutiny for news agencies, meaning if they do misrepresent facts they can and will be held accountable after the fact. This is no more censorship then being punished for deliberately lying in court or lying on an advertisement.

      Most news agencies are not. You can say pretty much anything you want to on your own dime.

      Am I permitted to destroy your business with slander that is blatantly untrue, so long as it's on my "cent"? Am I permitted to advertise a price that is false? This is the proverbial "fire in a crowded theatre". Currently, in my country if I were to do either of these actions I will be punished, after the fact as both are wrong.

      Sounds like a system that panders to the whims of the person with the most money. It's systems like this that handed so much power to people like Murdoch. They've failed and now news agencies need to be made accountable for what they publish.

      It is still awe-inspiring how much power and trust the US Constitution puts into the people

      It's just a piece of paper.

      Putting blind faith into a document is as bad as putting blind faith into anything. As the GWB regime proved it's only as good as the people defending it, in other words, how are those free speech zones working out for you. Personally I wont be going anywhere near the US border with all these Philipines, Thailand, Cambodia and Malaysia (quite normal for an Australian to travel around SE Asia) stamps on my passport because as a single male they tend to earn extra scrutiny from an organisation that has no accountability.

      By permitting news agencies to publish information that they know is deliberately misleading, you are basically saying it's OK to murder people even though you know that you are doing wrong. A news agency may still publish information they know to be misleading but they will be punished for it. This does not cut down on personal freedoms one iota, it cuts down on the ability of large organisations to manipulate people into believing false information. Putting some limits on lying in the media will, in fact increase personal liberty.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
  7. Re:Good and bad by oldhack · · Score: 3, Insightful

    "Scientists" scared of goofy analysis are priests, not scientists. Take their funding away and use their PhD parchment for toilet paper.

    --
    Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
  8. Re:Good and bad by Monkeedude1212 · · Score: 4, Insightful

    On the other hand, this will likely produce a whole stream of deliberately inaccurate analyses with ulterior motives behind them.

    But with the data public, it'll be easier to shoot them down for picking, choosing, skewing, and what else.

    There is no reason why this kind of data should ever be "secret"

  9. There are problems with this by cranky_chemist · · Score: 1

    Does this mean every biology, chemistry, physics, and engineering research group (I'm talking about grad students and postdocs, here) would have to open their lab notebooks to anyone who asked? Researchers who ply their trade on the cutting edge of science live in perpetual fear of being "scooped" by another group who publishes their discovery first. These are sometimes literally "races." So now a group at one university could demand access to the notebooks of a group at another university? And vice versa?

    1. Re:There are problems with this by Sparr0 · · Score: 2, Interesting

      Yes, yes, and yes. What is the problem? If they are racing, there is obviously something worth racing TO. If both teams have all the data, that goal will be reached no later, probably sooner.

    2. Re:There are problems with this by xilmaril · · Score: 3, Informative

      Does this mean every biology, chemistry, physics, and engineering research group (I'm talking about grad students and postdocs, here) would have to open their lab notebooks to anyone who asked?

      Researchers who ply their trade on the cutting edge of science live in perpetual fear of being "scooped" by another group who publishes their discovery first. These are sometimes literally "races." So now a group at one university could demand access to the notebooks of a group at another university? And vice versa?

      Not at all.

      It means they have access to each others results and source data when published (once the group is done researching this phase, and is ready to publish). There's no "opening notebooks", simply because that's a terrible metaphor for how data is collected these days.

    3. Re:There are problems with this by sourcerror · · Score: 3, Informative

      You only have to publish your data after publishing your article, which means "you won". You don't have to publish data for a research in progress.

    4. Re:There are problems with this by jd · · Score: 1

      You're right. Nobody uses notebooks. Netbooks, yes, but notebooks are SOOO last decade.

      --
      It's a small world and it smells funny; I'd buy another if it wasn't for the money; Take back what I paid (SoM)
    5. Re:There are problems with this by mjwx · · Score: 1

      Does this mean every biology, chemistry, physics, and engineering research group (I'm talking about grad students and postdocs, here) would have to open their lab notebooks to anyone who asked? Researchers who ply their trade on the cutting edge of science live in perpetual fear of being "scooped" by another group who publishes their discovery first.

      I think this would only pertain to data gathered using public money and work that is published.

      It would be certifiably mad to force all material to be vetted before it is published or at least released for peer review. Unfortunately this is the future I see if the anti-intellectuals are permitted to gain power, notebooks searched for "unapproved" research.

      --
      Calling someone a "hater" only means you can not rationally rebut their argument.
    6. Re:There are problems with this by Interoperable · · Score: 1

      They would probably have to request specific information. That would mean knowledge of the results, presumably due to their publication so worries of being scooped wouldn't really come up.

      --
      So if this is the future...where's my jet pack?
    7. Re:There are problems with this by serviscope_minor · · Score: 1

      Yes, yes, and yes. What is the problem?

      Let me explain what the problem is. I assume you work in the computer field.

      Collecting datat is one of the hardest parts of science. It is difficult, time cosuming and slow.

      Imagine you write code for a living. You are coding up a new application for your company. Coding a whole new application is difficult, time consuming and slow. Let's say you have written 95% of it, the back-end (hard bit) works and is debugged, etc.

      Now let's say you are forced to "share" this code with a co-worker with whom you are competing for not getting laid off. On the back of your hard work, he slaps on a quick, crappy GUI and demos it to the VP before you have a chance to.

      He gets all the praise, you lose your job.

      But what's the problem? The company is racing to get a product out and the goal was reached a few weeks sooner. It's win-win for everyone.

      Except you.

      Repeat the above, substitute data for code, paper for GUI, journal for VP and competing scietist for co-worker.

      Perhaps now you understand.

      --
      SJW n. One who posts facts.
    8. Re:There are problems with this by Sparr0 · · Score: 1

      You can prove you did 95% of the work via source control logs, or code/performance reviews for the weeks/months in question, or with your own 30 second pitch to that VP, or probably a few other ways I haven't thought of. If none of those work, it's obviously a poorly managed company. Be glad to part ways.

  10. Awful summary by Protoslo · · Score: 4, Insightful
    It turns out that "the data" are measurements of petrified tree rings, which were collected in the course of (presumably) a government grant-funded study. Now Queen's University researchers must compile the data for release because of the (UK) Freedom of Information Act. The scientists quoted in TFA apparently did not use the ring data for anything relating to climate studies, but Keenan has that purpose in mind.

    Phil Willis, a Liberal Democrat MP and chairman of the Science and Technology Select Committee, said that scientists now needed to work on the presumption that if research is publicly funded, the data ought to be made publicly available.

    That doesn't seem unreasonable to me. Appendices with raw data are often included already in the online editions of journals. Of course, if the ruling applies to all data generated in the course of a study, whether it is used in publications or not, it could be onerous indeed.

    1. Re:Awful summary by blair1q · · Score: 2, Interesting

      Now Queen's University researchers must compile the data for release because of the (UK) Freedom of Information Act.

      Seems unreasonable. They should charge the requester for any effort needed to "compile" or transmit the data. No reason the public should foot the bill for any particular formatting or delivery.

    2. Re:Awful summary by Jason+Pollock · · Score: 1

      The data used in the research can be gathered from multiple sources, and is not necessarily gathered with funds from the government. Therefore, any requirement for public access would preclude the use of that data in research.

      Since we're talking about weather data, many government weather offices charge fees for access to weather information, however, confidentiality clauses would also appear in health research (personally identifiable records, population studies, etc)

      So, you're left with a choice, don't use the data and don't do your research, or do the research and don't release the data publicly.

    3. Re:Awful summary by pkphilip · · Score: 3, Informative

      Michael Mann used the same tree ring data as temperature proxies for his studies and has published papers on this. But now the very same scientists who collected the tree ring data claim that data cannot be used as a temperature proxies - even though they haven't mentioned a word about how this would invalidate Michael Mann's work.

      http://climateaudit.org/2010/04/21/mann-of-oak/#more-10811

    4. Re:Awful summary by Roger+W+Moore · · Score: 1

      That doesn't seem unreasonable to me.

      Really? Even if you work on a large particle detector like ATLAS and have a dataset of several petabytes? Although I've been out of the UK too long to be allowed to vote any more Phil Willis is technically my local MP so I'd be tempted to write to him....but he is not standing in the current election so its unlikely to do any good.

    5. Re:Awful summary by Jojoba86 · · Score: 1

      The FoI act allows to charge for formatting and delivery of the data, though it should only be the actual costs incurred.

    6. Re:Awful summary by jabuzz · · Score: 1

      Rubbish, If you are a UK citizen you can vote.

    7. Re:Awful summary by blair1q · · Score: 1

      Agree. The US FOIA charge is some ridiculous amount per page of information.

    8. Re:Awful summary by Roger+W+Moore · · Score: 1

      If only that were true - it amazed me as well but the rules are very clear: if you have not been registered to vote in the UK within the past 15 years you do not get to vote. I am a British citizen (and it is my only citizenship) and yet I cannot vote in this election because I was last registered in the UK in April 1995. What it REALLY irritating is that Canadian citizens resident in the UK get to vote in both the UK and Canadian elections whereas because I am a Brit living in Canada I get to vote in neither the UK nor Canadian elections. It really is high time that Britain updated its voting and citizenship laws which seem far more suited for 18th century and the time of the British Empire that the 21st century world. I do still get to vote in European elections - although even that is messed up because I used to live in France but was not allowed to vote there and had to vote back in the UK!

  11. Formatting Standards? by Anonymous Coward · · Score: 0

    I think this should be the case for all government sponsored research.
    This will be a very good thing, as the act of publication relies on condensing/compressing actual data results.
    Often things may be left out simply because the author didnt think they were important.
    I am more concerned with the time and effort it will take to format data for external users.
    An accompanying more detailed methodology will surely have to be provided for the data to be used correctly.

    1. Re:Formatting Standards? by Rising+Ape · · Score: 2, Insightful

      I am more concerned with the time and effort it will take to format data for external users.
      An accompanying more detailed methodology will surely have to be provided for the data to be used correctly.

      That is indeed an issue. Presumably the methodology is already published, as is the rule for scientific papers. What could happen is that competent scientists have to waste their time debunking incompetent analyses by axe-grinding cranks.

      Actually, if the requirement is specified up front as terms for the grant, I'm not opposed to it. I don't think it'll do any good, mind you, as a rule all that's useful is published, and scientists are generally happy to cooperate if you need more, as long as you have honest intent. But the current system is a charter for arseholes using FoI requests to harass scientists.

    2. Re:Formatting Standards? by Bigjeff5 · · Score: 2, Insightful

      What could happen is that competent scientists have to waste their time debunking incompetent analyses by axe-grinding cranks.

      It's much more likely that incompetent scientists will be debunked by more competent analysis, because as soon as there is any controversy regarding a study the scientific community swarms to verify one way or the other.

      Also, it's just as important to know what data was disregarded, and why (there are a plethora of valid reasons, but there are even more invalid reasons) as it is to know what was included. The GP's point about the tree ring data that was collected but never used, why wasn't it used? Was it simply because they weren't interested in doing a tree-ring study, and used the data for something else entirely? Or did it make their model not work quite right so they tossed it out? How is anybody to know if they can't look at the data they collected?

      Furthermore, if the raw data is not provided, you cannot verify that the models and statistical conclusions are correct. What if there is a problem with the model the researchers were using? Well, if you plug the data into a better model, or even just a different model, you'll see a big difference if one of them is wrong. Climate science relies heavily on computer models, and often multiple researchers will use the exact same model in their study, so it's not hard to get a systemic error across multiple studies.

      In other words, how can you verify anybody's science without the original data they observed to begin with? I'm never going to look at this data, I wouldn't have a clue what to do with it, but I know there are a lot of climate researchers who are chomping at the bit to verify these studies.

      --
      Security is mostly a superstition... Avoiding danger is no safer in the long run than outright exposure. - Helen Keller
    3. Re:Formatting Standards? by michaelwv · · Score: 2, Informative

      Science makes progress through experiments. You design an experiment; you figure out what measurements you need to make; you make those measurements according to the requirements and specifications of your experiment; what do you need to control for? what calibrations are important? how much data do you need for a statistically significant sample? The answers to all of these questions are different depending on the experiment you want to do. Using data from someone else experiment means you have to go through all of these steps and then try to account for that fact that they way the data were gathered isn't quite right for what you want to do, you need to control for different things than the original experimenters, etc. This takes generally takes expertise in both the original scientific question and the new one. I get enough citations and questions from good-intentioned, responsible astronomers who use our data in published papers in subtly, but significantly, incorrect ways. I try to deal with such occurrences helpfully, but if often takes a long time to guide the interested fellow astronomer through the relevant literature explaining why what they did isn't quite right. When I write about something in a field that's new to me, I'm quite sensitive to this and try to check extensively that I'm not making a classic 1st-year graduate student mistake in that field. Don't even get me started on all of the email I get with re-analyses of our data by retired engineers.

    4. Re:Formatting Standards? by Anonymous Coward · · Score: 0

      In other words, how can you verify anybody's science without the original data they observed to begin with? I'm never going to look at this data, I wouldn't have a clue what to do with it, but I know there are a lot of climate researchers who are chomping at the bit to verify these studies.

      They verify it by doing their own studies, with their own data, and with their own analysis.

    5. Re:Formatting Standards? by Anonymous Coward · · Score: 0

      Often things may be left out simply because the author didnt think they were important.

      Most experimental work generates a huge quantity of data that really is unimportant because it is an intermediate step. It's critical for the research group to keep a careful record of calibrations and debugging runs on equipment, but much data is collected before bugs are found. That data has little value once the equipment is up and running. In my experience, a very small percentage of data even could be useful to anyone.

    6. Re:Formatting Standards? by Rockoon · · Score: 4, Informative

      That is indeed an issue. Presumably the methodology is already published, as is the rule for scientific papers.

      There is at least one case in =two climate research papers where what the methodologies claimed was impossible because the data to do it didn't even exist. This didn't come out for 16 years, and was only discovered because a FOI request was finally honored.

      In this case, the authors of the papers had claimed that the station data that they used was from stations that had "few, if any, changes in instrumentation, location or observation times." (quote from one paper) and "selected stations have relatively few, if any, changes in instrumentation, location, or observation times" (quote from the other paper)

      "Hey! We only used great data!"

      Now, these two authors used the same data, and one of these authors was actually a co-author of the other paper. These authors are Jones (hello climate gate) and Wang.

      Now, they finally sourced the data as being from the Chinese Academy of Sciences, which coincedentally had co-published a report with the US Department of Energy at about the same time as those two research papers, stating quite specifically that DATA OF THAT QUALITY DID NOT EXIST. The report was specifically about the quality of the Chinese climate record.

      Both papers concluded that the Urban Heat Island effect was minimal. Too bad that they didn't actually have data good enough to draw that conclusion. They said they did, tho.

      None of this would have come out if it wasn't for the Freedom of Information Act. Jones and Wang both obstructed the release of the data (denying FOI requests, etc) for nearly 2 decades.

      This all came out several years ago, but the media didnt give a fuck. They did care about hacked emails tho. Go figure. Now, as it turns out it probably wasn't Jones who was lying his ass off. Wang was a co-author on Jones's paper and supplied the "data." Jones gets credit for having his email hacked.

      --
      "His name was James Damore."
  12. NSF by martas · · Score: 2, Interesting

    does anyone know if the NSF has similar requirements?

    1. Re:NSF by imidan · · Score: 3, Interesting

      The NSF has recently taken more of an interest in research data management. They're definitely starting to make it a requirement of grant funding that the research data be digitally stored, backed up, and, after a cooling-off period to allow the principal researchers to publish, made available to the public. I'm working on a research data management group at my university, and the researchers generally seem open to the idea, though they're loathe to put in any extra effort to make it work.

    2. Re:NSF by guruevi · · Score: 2, Informative

      Yes it does, kinda. Thanks to our publishing overlords however these 'making available' issues are more difficult than just publishing it online or so. The data cannot be made available as long as a publishing house has copyrights on it and the publishing house usually takes copyright for all work for years including data that is not directly published by them especially when the work is or becomes popular. However NSF/NIH grants usually have the requirement to release all data to the public a couple of years (usually around 10 or 25 years depending on the grant) after collection or publishing. But if you don't publish through one of the big names, your career as a scientist usually doesn't go much of anywhere. Also, a lot of machinery can't be afforded on any grant but a governments' (multi-million dollar machines), the device that collects the data could be funded by the NIH and the grant has the requirements to release data 10 years after collection. However in order to make money to keep the system running, the institution needs other funds from other sources each with their own constraints.

      Disclaimer: I am not a scientist but I manage about 60TB of collected data owned or funded by a combination of private/individual funds, internal funds, corporate funds, publishing houses, NIH, NSF and other grants which should or should not be made available to the public.

      --
      Custom electronics and digital signage for your business: www.evcircuits.com
    3. Re:NSF by Anonymous Coward · · Score: 0

      NSF certainly promotes the release of data but its not compulsory (at least in my very non-politically sensitive field). I suspect that might be different for climate scientists or food safety or the like. So far as I know DOE funded groups are under no obligation to release their raw data. In contrast, NASA releases all its science data, raw and processed. However they do so only after a "honeymoon" in which the project scientists get a first shot at it (only fair when you've busted your ass to put together an experiment for years and even decades).

    4. Re:NSF by michaelwv · · Score: 1

      Most scientific journal copyright agreements do not restrict authors right to reproduce data from their papers. The copyright is very specifically on the copy-edited published manuscript. The publisher often _manages_ copyright on behalf of the author or society for copyright enforcement and for granting permission for reproductions. The data can definitely be made available. Read the copyright transfer agreements carefully. The real problem in all of this is that: "here is the raw data file in my custom format" is not that useful. Curation of data is important, data should be publically available, and that needs to be funded.

    5. Re:NSF by michaelwv · · Score: 1

      ... they're loathe to put in any extra effort to make it work.

      They're not rewarded by grants, salary, promotion, respect, or tenure for putting in any extra effort to make it work. What really matters is getting more grant money and writing new papers. They're effectively punished for spending their time doing anything else.

    6. Re:NSF by martas · · Score: 1

      I manage about 60TB of collected data owned or funded by a combination of private/individual funds, internal funds, corporate funds, publishing houses, NIH, NSF and other grants

      Holy crap! man, there are all kinds of people on /., aren't there... thanks for the answer, btw.

    7. Re:NSF by imidan · · Score: 1

      Oh, I agree with you, and I see your point, and I don't think it unfair for researchers to spend their time doing the things that they're specifically rewarded for doing. But if the granting agency requires them to propose and implement a data management plan, then they will be punished for not doing so: they'll stop getting grants. When these policies of the NSF get in to full swing, that should be an obvious incentive.

  13. Not always feasible... by Anonymous Coward · · Score: 2, Informative

    We can agree that the whole scientific process does not make much sense if we have to believe in the interpretations without seeing the actual data. From this perspective it is crucial for all scientific data to be open.

    The other perspective comes from the individual scientist. It might take years to put together a complete data set of a particular phenomenon via experiments, literature review, digging in the ground or looking at the stars. So after looking for something special you finally discover something new and write a small article about it. This will just be something along the lines of: "hey, there is something interesting going on here." Now you go back and look carefully at all your data for similar events, filter out noise because you have a better idea what to look for and then hopefully publish more about. So the next article will not only contain more information but also some analysis about the possible origins of the phenomenon and so forth.

    Imagine you had to open your carefully put together data right the second after you recorded it. Other people might grab your stuff and your research might not even be cited because they just looked at all the steps that you took that were not successful and repeated the experiments or used other available data.

    This interest in keeping your data private cannot be avoided with the current system of judging a scientist by his or her publications.

    1. Re:Not always feasible... by mick232 · · Score: 0

      So what? Someone else will publish something that you didn't find out. Society will nevertheless benefit and thus get what they pay for. They don't pay for the individual benefit of single researchers.

    2. Re:Not always feasible... by Anonymous Coward · · Score: 0

      Scientists get payed for publications, not data.

    3. Re:Not always feasible... by dmwst30 · · Score: 1

      Nice straw man. The scientists never made the defense that they were using the data for a similar analysis that the mathematician (PhD) wanted to perform. If they had simply said they were working on a paper with the data set, and would share the data once they had their paper accepted, everyone would have been happy! Or they could have collaborated, and gotten another paper out of the deal!

      Instead, they flat out, over a period of years, denied him the information. While sharing it with other scientists (Mann) in the community. While ignoring the laws stating they had to share the information.

    4. Re:Not always feasible... by pydev · · Score: 1

      So what? Someone else will publish something that you didn't find out.

      But scientists wouldn't invest the years of hard work to collect the data in the first place if they only get one paper out of it before they get scooped by other research groups, so there will be no data based on which to publish.

      See, scientists don't just work for money, they work for publications, because that determines their career. And the problem is even worse in the UK since number of publications and citations strongly determines everything from pay to whether you keep your job in the UK.

      The current system already discourages deep analysis or anything that requires lots of effort. Many papers are theoretically shallow ideas supported by little data because investing more time and effort simply isn't rewarded. This rule makes the situation even worse.

  14. Re:Good and bad by Rising+Ape · · Score: 3, Insightful

    That doesn't matter. The important thing is that the attacks are made. Even if every one is shown to be completely wrong, people will still remember all those (erroneous) anti-global warming reports. Especially since the media will enthusiastically report the initial attack and relegate the news of its rebuttal to a small paragraph on page 34, if they report it at all.

  15. naive by Anonymous Coward · · Score: 0

    Yeah, sure, it's perfectly easy for people to shoot down bad analyses (which don't just include incorrect selection of data, but also choosing inappropriate statistical techniques, bad models, etc.).

    But the people with the most vested interest in producing deliberately bad analyses are the ones with the loudest megaphones and the greatest access to the press. Scientists all over the world, overwhelmingly, think that there's significant evidence of anthropogenic climate change as the result of our carbon output. Notice, though, that the public at large isn't nearly as convinced - mostly because 1) the science is actually complicated, and 2) because oil companies and their allies have a shitload of money to fund PR campaigns aimed at distracting the issue.

    Make the data public, sure. But don't for a second think that you'll get any improvement in the bullshit factor.

  16. Re:Good and bad by tsotha · · Score: 1

    On the other hand, this will likely produce a whole stream of deliberately inaccurate analyses with ulterior motives behind them.

    I think they're a lot more worried about accurate analyses than inaccurate ones. FUD will be much easier to deal with if it's really FUD because it can be criticized by more people than just the keepers of the sacred data.

  17. Re:Good and bad by Rising+Ape · · Score: 1

    Trol modl? Er, what? I don't think so. Note to moderator: "troll" does not mean "I don't agree".

  18. Publishers had nothing to do with this. by pavon · · Score: 1

    This has nothing to do with journals. The data was not available anywhere - not in a for-pay journal, not on a website, not on request. It was the researchers that refused to release the raw data - the publishers have no motivation to suppress these release, because it is the published paper that earns them money, not the raw data.

  19. Re:Good and bad by Monkeedude1212 · · Score: 1

    The worse it gets, the more me and my colleagues wished we could see the raw data and draw our own conclusions.

    I don't know about you, but I'm tired of listening to reports. I'm tired of hearing scientists say this and scientists say that.

    Give me some FACTS, and I'll draw my own conclusions. If I'm wrong, you can correct me, and show me where I'm wrong. If you can't do that, I know who is lying.

  20. Re:Good and bad by Rising+Ape · · Score: 1

    You can read the publications. Well, the ones that aren't behind journal paywalls, unfortunately, but that's not by the choice of scientists.

    The raw data will not help you, but you can judge the validity of the analyses if you like.

  21. So what was your UK tax burden? by Anonymous Coward · · Score: 0

    So what was your UK tax burden? Can UK taxpayers get US data for free now too?

  22. Re:Good and bad by ClickOnThis · · Score: 1

    On the other hand, this will likely produce a whole stream of deliberately inaccurate analyses with ulterior motives behind them.

    But with the data public, it'll be easier to shoot them down for picking, choosing, skewing, and what else.

    There is no reason why this kind of data should ever be "secret"

    Surely it's not hard to see the dynamic that will unfold from this. Yes, the truth is "out there" for better or for worse. However, scientists will have to spend an enormous amount of time and money defending their work against cheap public shots from unqualified critics, instead of a smaller number of competent but dissenting colleagues. That will mean less time for doing research, preparing publications and writing grant proposals.

    Consider also that even an expert can misinterpret raw data. Usually it takes an intimate knowledge of how the data was collected, the characteristics of the instruments, etc., in order to interpret it properly. Frequently, the only people with the proper knowledge to interpret raw data are those who collected it, built the instruments used for the measurements, or both. Without some kind of curation, raw data can generate far more "noise" than "signal."

    --
    If it weren't for deadlines, nothing would be late.
  23. Funny how nacturation is against sharing by Anonymous Coward · · Score: 0

    Funny how nacturation is against sharing when it comes to other intellectual property, but 150% behind this one.

    1. Re:Funny how nacturation is against sharing by nacturation · · Score: 1

      Funny how nacturation is against sharing when it comes to other intellectual property, but 150% behind this one.

      Ah, I have a secret admirer. How long have you been following me?

      Anyways, if a movie were to be made made entirely through public funds, then you ought to be able to share that movie. Are you unable to see the difference between publicly and privately funded efforts?

      --
      Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
    2. Re:Funny how nacturation is against sharing by LingNoi · · Score: 1

      In the real world however this doesn't seem to happen..

      The BBC in the UK is publically funded and they make money of various sales from their publically funded shows and movies.
      The MET office which collects weather data in the UK sells its more detailed reports on to airports and other businesses.

      The idea is that by making these publically funded organisations more profitable then we can spend less tax on them (like that ever happens). However this situation with universities holding copyright to data is different. It's simple greed for personal gain and as I have a personal grudge against the parasitic universities in england (which take tax payers money and use it for marketing to international students to get even more at the expense of english students. Oh, and the middle management that only care about office politics) I say fuck'em.

  24. Re:Good and bad by jfw · · Score: 1

    "Scientists" scared of goofy analysis are priests, not scientists. Take their funding away and use their PhD parchment for toilet paper.

    Nonsense. I have much better things to do, like reading and posting on slashdot, than respond or deal with every crack pot who has an axe to grind but has no real idea how to do it. Seriously, you want to analyze the data? Go collect it yourself.

  25. Re:Good and bad by Anonymous Coward · · Score: 0

    Mod parent up!

    This is most definately not a troll.

  26. The data spans 40 years by Nicolas+MONNET · · Score: 1

    Do you think grad students were collecting data in the field on iPads in the 1980s?

    Most of the data is probably in the form of moldy old penciled notebooks, core samples, B&W photo negatives and microscope slides. I hate to break it to you, but you know what, except maybe in physics or electrical engineering, not all experimental data was systematically recorded digitally until 15-20 years ago.

    They collated, analyzed their data at the time, published their results in peer reviewed journals, and that was good enough. Now this judgement will require them to waste countless hours digging through old archives, scanning lab notebooks and so on, and for what? For one fucking idiot who apparently didn't even bother to read their latest paper in which they demonstrate that tree ring data for the trees they studied (oak or something) does not correlate with temperature but with summer rainfall.

    1. Re:The data spans 40 years by dmwst30 · · Score: 1

      You're the ignorant fucking idiot here, not the person making the request.

      The request was made back in 2007.

      Mann et. al., in 2008, used Oak ring reconstructions, including several from this same group, freely for temperature reconstruction.

      The paper saying the Oak rings don't correlate with temperature came out in 2009. Nary a peep that the Mann '08 paper was at all flawed as result!

      So they not only shared data with some groups and not others, the request was relevant and would have remained relevant if responded to in a timely manner. The researchers did not even bother to reply that they were actively analyzing the same issues he wanted to examine, they just flat-out refused. If they had replied that they are planning on publishing a paper on the same topic, and given a time-line to share their data, this would have been a non-issue.

    2. Re:The data spans 40 years by wisty · · Score: 1

      In the 1980s, science was about collecting data, then doing some basic statistical tests.

      Now, papers get rejected if the researcher just collects data, but doesn't add some methodological twist into the analysis. No fuzzy k-means cluster Bayesian chi test? No paper for you! (Yeah, that's an exaggeration).

      As methods get more and more convoluted, making the data available becomes more important. Researchers beginning a project can use the data to validate their computational procedures, before they try to replace the data with new sources. At the same time, they are validating that the other guys work was correct. Face it, lots of errors in papers will be due to the analysis these days.

    3. Re:The data spans 40 years by Ctrl-Alt-Del · · Score: 3, Insightful

      Unfortunately, Climategate proved that, at least in the field of climate research, "peer review" is worthless; Mann et al were actively conspiring to ensure that only "friendly" eyes carried out the reviews; anyone thought to be showing signs of scepticism were blacklisted, whether individuals or publications.

      To add to that, Glaciergate proved that much of what was claimed to be peer-reviewed was actually just regurgitated propaganda, often based on anecdotal evidence (reminisces of mountaineers published in a student rag? Puh-lease!)

      So, appeals to authority ("oh but all this research has been peer reviewed") just don't hold any more. Not until all the data and all the methods used to arrive at the results are made available, and the results can be independently confirmed or denied, can we say whether the research was worth the weight of mouldy notebooks it was archived on.

      --
      "Life is like a sewer - what you get out of it depends on what you put into it" - Tom Lehrer
    4. Re:The data spans 40 years by Anonymous Coward · · Score: 0

      Excellent crimestop citizen. Doublethink plusungood. Moderate downdoubleplus.

      Trust authority, they will never lie to you. We have always been at war with CO2. I mean methane.

    5. Re:The data spans 40 years by Grumbleduke · · Score: 1

      fyi, this is how many of us read your comment:

      Unfortunately, Climategate proved blah blah blah I'm pushing an agenda uphill blah blah blah...

      the brain said to stop reading at the second word. momentum brought in the third word which just confirmed the notion.

      Why was this modded flamebait? I know it isn't the most civilised and politely-worded post in this thread, but it does have a point. I'm no scientist (degree in maths) but I have been following this business with the climate change "controversy" from a safe distance, reading both arguments on either side and looking at available (non-technical*) evidence and have come to the conclusion that this "huge, controversial debate" is only going on in the media, not within the scientific community.

      From what I've seen over the last 10-15 years, climate scientists got this sort of stuff pretty much sorted out a long time ago and, much like the same debate that happened over the last few decades with the negative effects of smoking, what we are seeing now is those companies with an interest in this issue waking up to the scientific community and doing everything they can to fight it. I hope we will see the same pattern here; scientist suggests something, scientists investigate, scientists accept; companies kick up a fuss, media 'investigates', media stirs up controversy (it seems to think it exists to do that - and in some ways, it does), overwhelming scientific evidence finally convinces media, companies shut up and try to work solving their problem rather than pretending it doesn't exist. Now, obviously there are still some people who think that smoking isn't bad for ones health and so there will likely be some that continue to dismiss global warming as a fantasy, but there are still those that suggest the Earth is flat or dismiss evolution as a pack of lies...

      Anyways, my point was that the parent is correct in that, by appending "-gate" onto Climate, one demonstrates that this is an issue being discussed in the popular media, with catchy headlines and phrases, not in a scientific community, as such, many people (including myself) are tempted to stop reading after that word appears (although I didn't).

      Having the "raw data" won't help anyone with this. Those who can actually understand it probably already have it (or as close as possible) and don't really care. Those who can't shouldn't be playing with things they don't understand and are only going to create more confusion.

      Oh, and don't get me started on the word "proved"... that has no place outside mathematics.

      *As a mathematician not someone with a phd in climatology, I won't pretend to be able to understand the actual data, and am highly sceptical of anyone not in that field who claims to be able to.

  27. Teaching? by jfw · · Score: 2, Interesting

    OK, why does this argument not also apply to teaching? I am paid to teach and do research from the public purse. My teaching is available to any one who meets certain standards and pays a user fee. Access to data should be the same.

  28. Re:Good and bad by jacksonj04 · · Score: 1

    Unless you happen to be a scientist in a related field, raw data tends to be next to useless. Anybody can draw pretty graphs in Excel and get worried about a rising trend line, declining trend line or anomalous result but it takes someone who knows what they're talking about to explain what they actually mean.

    --
    How many people can read hex if only you and dead people can read hex?
  29. Re:Good and bad by interkin3tic · · Score: 3, Insightful

    But with the data public, it'll be easier to shoot them down for picking, choosing, skewing, and what else.

    Not sure what regulations are on "release all data to the public" but seems like there are loopholes big enough to drive a bus through. For instance, in my field, no one but me knows how many cells I looked at. Maybe that thing I said happens in these cells happens in all those cells. Maybe I looked at 300 before seeing one doing what I said, took a picture of that one, and that was that. All my data would be that one cell I cherrypicked.

    Even if I did take pictures of all 300, no one knows but me. Those other 299 can dissapear.

    If I'm -not- evil though, this could hurt me. If I looked at say 3000 cells, and 10 were doing a thing that I thought was significant, I could have my reasons. Maybe the other 2990 were the wrong cell type or something. Being the expert, that might be obvious to me just from looking at them. A non expert looking at them might not see that. They would just see that out of 3000 cells, I chose the 10 that supported my data. They might call foul without bothering to have me explain myself.

    There's no reason the data should be secret, but most data doesn't stand on it's own, and writing up supporting information to -all data gathered- just isn't going to happen.

  30. So a non expert by geekoid · · Score: 2, Interesting

    wants to use data that wasn't used for climate change and models in order to prove that the studies that didn't use them are flawed.

    Add to that a reporter who continually overstates anything the climate change denilist say, I'm sure it will confuse even more people.
    This should be fun.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    1. Re:So a non expert by Attila+Dimedici · · Score: 1

      They want to show that data that was collected for climate research was not used because it not only failed to support the theory that the researcher proposed but contradicted it.
      Yes, the data was not used for climate change articles, but it was collected for climate change research. Those asking for the data are suggesting that the researchers did something like the following:
      Theorize that "If A, then B". Collect 15 data sets. In 14 of the data sets there is either, "A" but not "B", or there is "not A", but "B". In the 15th set there is both "A" and "B". The researchers publish an article saying "This data proves, 'If A, then B'."

      --
      The truth is that all men having power ought to be mistrusted. James Madison
  31. Re:Good and bad by geekoid · · Score: 1

    Except they aren't experts at knowing what is picking, choosing and skewing and what is a correct and practical analysis of data. Prepare to see a lot of cherry pickling and so called 'experts' interpreting this data incorrectly. Many of whom won't even know what a P value is.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  32. Re:Good and bad by geekoid · · Score: 2, Interesting

    Scientists are always concerned when people who have no idea what they are doing try to interpret data. It has nothing to do with being scared.
    For example:
    Lets say this guy cherry picks some data to support his belief and Opera finds out about his 'findings' and puts him on the air. Suddenly 25 million people who aren't qualified to judge his assessment is not hounding politician over incorrect data.
    I just spent about 10 years watch this very thing happen to Vaccines. Some idiots bad study gets on Opera, and a year later people are dying.

    It's a real and serious problem, and the people causing it(media) are doing nothing to fix it.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  33. You meant "peer review vs. copyright", right? by jonaskoelker · · Score: 1

    Peer review and scientific principles don't mesh well with peer review though [...] Ridding peer review and science of copyright

    I assume the first "peer review" was meant to be "copyright"?

    Also, I think you can do peer review with restrictive copyrights just fine. It's the whole sharing-of-results that goes away if there's a for-profit journal owning all the results.

    Also, the article really isn't about sharing the results themselves, but sharing the data that informs and/or is the foundation for the results. Copyrights really doesn't enter into this question (IANAL TINLA) unless the data is contained within the article. By way of analogy: I can collect data on my computer's fan speed, you can come ask me for that data, and I can say "no way, go 'way"; I don't need copyrights on my data collection to not give it to anyone.

  34. Re:Good and bad by TubeSteak · · Score: 1

    "Scientists" scared of goofy analysis are priests, not scientists. Take their funding away and use their PhD parchment for toilet paper.

    Big business has a long history of setting up think tanks and foundations in order to churn out disinformation.
    In the past I've brought up the tobacco industry as a prime example of business producing bad science in order to stave off regulation.
    Less pernicious, but equally anti-science, are creationists, anti-vaxxers, and those pushing abstinence only.

    I disagree with locking the data behind university walls, but it's amazingly naive to think that scientists shouldn't be scared of "goofy analysis".

    It only takes a few morons in a hurry to poison what could be an otherwise rational debate.
    See: Death Panels

    --
    [Fuck Beta]
    o0t!
  35. Official notice - AYBABTU by NemoinSpace · · Score: 1

    Curiously, the decision was sent rather obscurely... OFFICIAL NOTICE -AYBABTU.

  36. Re:Good and bad by Areyoukiddingme · · Score: 1

    What does a web browser have to do with it?

  37. How about source code by Anonymous Coward · · Score: 0

    If one desires real transparency in an experimental procedure, One should also release the code that turns data in to the published experimental findings. The can be "black boxes" but the source that implements the experimental should be public so that the experiments are reproducible.
    Ultimately any "computational" experiments results should depend only the data and random seed.
    To go even further a autonomous authority (ex a scientific journal) could assign the random seed to any prospective published research and require experimental results derived from published source code, data and the key.
    The random seed is applicable to methods that introduce random elements in to calculation which is quite common in our days.
    The assignment random seed would prevent an experiment with random elements to be repeated until satisfactory results come out.

    It would be quite hard to

  38. Re: data retention now required too? by qubezz · · Score: 1

    There is no law that researchers need to retain their raw field data(?). After I publish my paper, this kind of hassle could be avoided by dragging the 'research data' directory into the rubbish bin on the desktop. Oops, I just dropped those notebooks into the shredder too, they were a fire hazard anyway...

    If someone wants to peer-review, why don't they get their own grad students to drill their own tree cores, measure them, and come up with their own conclusion. It's not like the institution is hoarding a Rosetta Stone for themselves. Collective independent research will converge on the the most likely scenario and prove or disprove the controversial hypothesis put forth, and using data from the suspect study would only pollute further endeavors.

  39. Peers? by Gonoff · · Score: 0, Troll

    peer review is good

    The trouble is that I don't consider most deniers of AGW to be peers of those who are doing genuine research. They are more likely to be peers with their friends/sponsors in the oil industry and other big businesses.

    Just in case someone wonders, no I don't believe that everyone who does not believe in Climate Change is in the pay of the corporates. There will be some few who have done research and come to different conclusions. They are probably in the minority.

    Peer review is productive when someone actually reviews it. That is not what those FOI requests were made for. They are made with the specific aim of disproving a conclusion they don't like. A review is neutral and could end up agreeing with the other person. This is not likely to happen here. They have made up their mind already.

    --
    I'll see your Constitution and raise you a Queen.
    1. Re:Peers? by sl149q · · Score: 5, Insightful

      As opposed to the proselytizers who are funded by the NGO's and the new "Green" capitalists and rent-seekers.

      One of the more interesting bits of the Climategate emails showed that Mann was happy to share his data EXCEPT to people who he thought would disagree with his methods and results.

      And in this case Mann was also the recipient of the tree ring data showing that again if you agreed with the owners ideas he had no problem getting you copies of what you needed.

  40. Re:Good and bad by oldhack · · Score: 1

    The worst thing you can do is to hide your data because some fool may make a hash of it. The proper thing to do is, if an erroneous analysis get enough circulation, you point out the error. Chances are, you won't be alone in pointing out and so you probably won't have to bother yourself - more so for more frivolous and trivially inaccurate arguments.

    --
    Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
  41. It's a step in the right direction... by El+Fantasmo · · Score: 2, Interesting

    I believe that all public universities (in the US) that cannot prove public money was not used on research, should be required to release the findings/data to the public shortly after it is published. Of course there are exceptions for things involving national security and what not.

  42. Conflicting Laws? by Sir+Mal+Fet · · Score: 2, Interesting

    I wonder how this conflicts with the laws about Privacy of Data. For example, if a company shares a dataset that contains sensible information with a University (this CAN be done, at least in my country, with a contract. We compromise to safeguard the data and to not violate Privacy laws, consultants also do this everywhere) for the purpose of developing a model or some other application that needs the data. The professor then publishes a paper with the main (non-corporate secret) results, and uses public funds for an undergrad or something. Does this mean the professor can be sued into giving the information away? Doing this clearly violates the laws on privacy, but would conflict directly with the Freedom of Information Act. Compelling with one law contradicts the other... I do not think that this can be upheld in court for EVERY case, instead it would have to be analyzed in a case-by-case basis using (possible costly) lawsuits. Then again, IANAL, so maybe I'm wrong... (Full disclosure: I am a researcher in data mining)

    1. Re:Conflicting Laws? by OldBus · · Score: 1

      The only law I can see conflicting with the Freedom of Information Act, is the Data Protection Act. Data Protection deals with what you can do (or not) with information about individuals. Data Protection in general will override Freedom of Information (you can't make a FoI request to find out the home addresses of the institution's employees, for example.) However, in this case tree ring data is not about individuals, so Data Protection doesn't apply.

      The only reason for not complying with a FoI request in this case is that it would be too costly to comply. It appears that the Commissioner disagreed with the University about this, so they must release the data.

  43. Re:Good and bad by oldhack · · Score: 1

    Fair enough. Just return the public grant money.

    --
    Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
  44. Re:Good and bad by Areyoukiddingme · · Score: 2, Informative

    If I'm -not- evil though, this could hurt me. If I looked at say 3000 cells, and 10 were doing a thing that I thought was significant, I could have my reasons. Maybe the other 2990 were the wrong cell type or something.

    Of course you would. And if you truly did find such a strange sample set, you would document those reasons with just such a sentence. Maybe they WERE the wrong cell type, and in your paper you would be expected to say precisely that. Odds are fairly good you would have a citation concerning why those cells are the wrong type, if not several, since any such assertion that 99.7% of your sample set is junk would be unusual enough to require justification of your methodology. Perhaps no better cell culture or separation method is available. This should be easy enough to document and explain. It took me just one sentence fragment, after all.

    Most likely, if there really isn't any better method, there have been multiple papers describing what the limitations are and why, in an effort to formulate a better method. There is probably also active, ongoing research into creating a better method, since any line of inquiry with such poor sources is bound to attract attention. To paraphrase Heinlein: To score an academic coup, find out what everyone agrees is impossible. Then do it.

    Yes, you're probably going to experience an uptick in the noise floor if you're a UK researcher. The timecube guys are out there, and audible now. But complying with the UK directive is easy. Provide the data. You're not required to address specific demands of every crank who claims your data proves the existence of aliens. The people who control your tenure/salary/book deals/whatever read your paper, saw your cite, and moved on. The guy worried about aliens doesn't affect much of your life. Just your inbox.

  45. Re: data retention now required too? by michaelwv · · Score: 1

    Most ethical standards of scientific societies are quite clear that retaining records and the original data are a vital part of the scientific process.

  46. Re:Good and bad by Anonymous Coward · · Score: 0

    Studies on private data of people, like medical data, personal data, etc?

  47. Re:Good and bad by Anonymous Coward · · Score: 0

    The point is that the data presented and even peer reviewed isn't good enough. I've seen lots of errors in published articles, and I would have wanted both code and raw data in order to judge the validity.
    How can you even claim the raw data would not help me or others? You are just making this up without any argument.

    And open publishing is mostly the choice of scientist.

  48. Re:Good and bad by sl149q · · Score: 1

    From Climate Audit: http://climateaudit.org/2010/04/21/mann-of-oak/

    Notwithstanding the considered opinion of Baillie and Wilson that oaks are “virtually useless as a temperature proxy” and “dangerous” to use in a temperature reconstruction, no fewer than 119 oak chronologies were used in Mann et al 2008.

    Among Mann’s oak chronologies were three Baillie chronologies: brit008 – Lockwood; brit042 – Shanes Castle, Northern Ireland; brit044 – Castle Coole, Northern Ireland.

  49. Re: data retention now required too? by sl149q · · Score: 4, Insightful

    If people cannot replicate your results it isn't science.

    And with Climate Science part of the process is showing how you collected and interpreted the data. If you are not willing to share the raw data so other researchers can attempt to replicate your methods and results then don't bother publishing.

  50. Re:Good and bad by Anonymous Coward · · Score: 0

    That's charmingly naive or deliberately ignorant of the state of affairs in our modern society.

    Only the initial report gets made - that someone's "proven" something or "found a mistake" in published research. When what the person's found is shown to be his own error, even a deliberate distortion, that's seldom reported, and never as widely or clearly.

  51. Re:Good and bad by finarfinjge · · Score: 3, Insightful

    MOD PARENT UP!!

    The problem that the climate scientists have created for themselves is that they are hiding the data from everyone. Up until a few months ago, these requests were relatively rare. Some of the requesting parties actually have fairly strong credentials. Steve McIntyre may be hated by the folk at realclimate, but he is an IPCC reviewer. To stonewall him is a little different than refusing to provide it to Jenny McCarthy.

  52. Re:Good and bad by craklyn · · Score: 1, Insightful

    For many academic scientists (i.e. professors, post-docs, graduate students), a part of their pay is the ability to publish their research findings. It takes long thought and work to devise and carry out experiments which gather pertinent data. It's not unreasonable to allow some time for these scientists to analyze their data and properly understand it.
    If you mandate all data be immediately made public, the researcher can be "scooped" by anyone. This is bad for science because it removes the incentive to actually gather the data. This is one argument for why data may be kept internal, at least for a while.

  53. This is definitely a good thing by OrwellianLurker · · Score: 2, Insightful

    I am a pretty big cynic, and I remain unconvinced that AGW is a significant problem. It doesn't help that the raw data isn't disclosed. I wish scientists would go back to doing science and quit trying to be policy makers.

    --
    'Political power grows out of the barrel of a gun.' - Mao Tse-tung
    1. Re:This is definitely a good thing by allawalla · · Score: 1

      If only policy makers would stop being beauty queens, and the general public pageant voters. We should all get back to our proper places...

  54. Re:Good and bad by martin-boundary · · Score: 2, Informative

    That doesn't matter. The important thing is that the attacks are made. Even if every one is shown to be completely wrong, people will still remember all those (erroneous) anti-global warming reports.

    People don't matter. Science doesn't advance by asking what Aunt Rosie from Ohio thinks about a particular result. Those who matter are scientists, and scientists read peer reviewed journals. Peer review is all about filtering out all those attacks so that nobody who matters needs to read them.

  55. MOD PARENT UP by Anonymous Coward · · Score: 0

    Now if only the same rules were applied to the fraudsters who promote evolutionism as it is to the fraudsters who promote global warming. Instead, the evolutionists will just hide behind the same bullshit arguments about the science being "settled" and we will have another 200 years of lies before the real truth comes out.

    Why is this marked Troll?? It's a beautiful example of satire, and a perfect argument against climate deniers.

  56. Re:Good and bad by pkphilip · · Score: 1

    This is a straw man argument. The data has to be public - I don't want the scientific community impose their own version of the nanny state.

  57. Re:Good and bad by TofuDog · · Score: 1

    Scientists are universally interested in protecting science (disclosure; IAAS). The problem is not in sharing data with other scientists (i.e., those trained in data analysis and objectivity), it's sharing the data with "cynics" who have a conclusion they'd like to cherry-pick supporting data for. It won't pass peer review, but that won't stop an ideologue from posting his "analysis" on the web, etc. and feeding non-objective BS into the policy debate.

  58. Please no. by HornWumpus · · Score: 1

    An archive of professors 'teaching' would be useless except as a cure for insomnia.

    Putting together a proper multimedia course is hard work.

    Putting up 3000 bad videos with even worse sound of 99% bad lecturers all teaching the same material differently and in different order is damn near useless.

    So build a web sight, convince the profs to all contribute (yeah right), collect data and rank the videos in order of quality and organized by relevance to each other.

    The undergrads will still sleep off their hangovers through the original lecture.

    You know its true.

    --
    John McAfee 'It was like that time I hired that Bangkok prostitute; to do my taxes, while I fucked my accountant'
  59. why are they so afraid of releasing the data? by Adult+film+producer · · Score: 1

    Whats the big deal? Publish the data.. so what? Maybe someone will find a few errors and then it can be corrected. IMHO its a great idea... if some climate denier wants to mangle the data they should be free to do so, so what? Its not like their bullshit will be published in scientific journals.... it will die a quiet death in the extremists blogs.

    Nothing to be afraid of here.. in fact, these guys should be proud of the work they did and be willing to give to everyone that asks.

    1. Re:why are they so afraid of releasing the data? by ub3r+n3u7r4l1st · · Score: 1

      because by releasing the data, the world will know the entire research is a scam trying to defraud USA and China hard-earned money funneling to cap & trade scheme setup by Al-Gore, Goldman Sachs and alike.....

    2. Re:why are they so afraid of releasing the data? by pydev · · Score: 1

      Generally, people often don't like to publish data because they invested a lot of time in gathering the data. If they have to publish it, presumably along with the first paper they write, many other groups can take the data and start analyzing it and publishing papers on it without investing work in gathering their own data. Since scientific careers stand and fall with publications, that's a serious concern to scientists.

      I like the idea of all data being public, but I don't know what a good solution is or what the consequences of this decision are.

    3. Re:why are they so afraid of releasing the data? by 16K+Ram+Pack · · Score: 1

      Probably because he might find a problem with it.

      The bottom line is that there is a massive failure rate with experimentation. The guy might discover nothing. He might come up with some nonsense. But what if he finds something useful?

      And there are strong arguments against "only official scientsts should do this stuff". The man who solved the problem of determining longitude wasn't a scientists but a clockmaker and the scientists tried to discredit his method in favour of their own for years, despite it being far more accurate.

  60. Re:Good and bad by Anonymous Coward · · Score: 0

    "Scientists" scared of goofy analysis are priests, not scientists. Take their funding away and use their PhD parchment for toilet paper.

    The issue is that much of the work in science is producing a high quality interpretation of data. That requires knowledge of the field, the methodology used to collect those data, statistics, good judgment, etc. Conversely, it takes nothing to create a poor interpretation of data that is indistinguishable from the real thing. That is what is potentially dangerous.

    Of course science invites criticism, but it really only benefits from well informed criticism. And the criticism needs to be subject to criticism. Everything else is just a lot of noise. Better access to data is fine, at face value, but it will only ever really help in a meaningful way when it comes coupled with good critical thinking from all parties involved.

  61. Raw data can be useless by Roger+W+Moore · · Score: 4, Interesting

    Opening the data up for free access means that other groups, who have more interest in scooping than being right, have more ability to do that scooping. That leaves the people who did the work in the cold.

    That is not hard to achieve: someone has to make an FoI request, the cost to prepare the data has to be estimated, someone has to get hired to collect and format the data and then the data is released. That can take a considerable amount of time.....but that's not the only issue. In my field of particle physics raw data is generally useless unless you understand how it was collected and how to analyse it.

    Even assuming that you had several petabytes of disk/tape available to store it, raw data from ATLAS would be completely useless to you unless you really understand the detector "warts and all". Trying to understand this data without access to the detector itself and the ability to test and cross-check ideas looking at (and sometimes carefully tweaking) the hardware is literally impossible....and that is before you get into the thorny international issues about who did what and so whether it falls under any one country's laws.

    These issues were discussed on a previous experiment I worked on in the US and the conclusion was that it did not serve the public to have data released in just about any form: the raw data was useless and even the processed data still had considerable "quirks" which required understanding (e.g. acceptance drops at detector boundaries etc.). This was aptly demonstrated by a pilot project which resulted in no interest at all from the public but which worryingly attracted a few nutters who were more interested in proving their pet theory than in doing science.

    So while I am very sympathetic to the "the public paid for it the public should be able to access it" argument I do not think that the public's interest is best served by releasing raw data in all (most?) cases. The best way to serve the public interest is to ensure that results and ideas arising from that research are freely available to all and allow the public to build on that.

    1. Re:Raw data can be useless by budgenator · · Score: 1

      If you want ground station air temperature data it's at ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/, what's there is as close to raw primary data as anybody has. Everybody else's products have been "value added" i.e., cooked, homogenized, and adjusted.

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
    2. Re:Raw data can be useless by Citizen+of+Earth · · Score: 1

      Everybody else's products have been "value added" i.e., cooked, homogenized, and adjusted.

      ... and pruned and faked to fit their pet theory.

    3. Re:Raw data can be useless by Anonymous Coward · · Score: 0

      You've explained why releasing data to the public is less good for them than it might appear (since, in most cases, it's not going to be any good to them). But you haven't given a reason why it's actually bad - so surely the balance still falls in favour of making it available?

    4. Re:Raw data can be useless by Anonymous Coward · · Score: 0

      [In my field of particle physics] raw data is generally useless unless you understand how it was collected and how to analyse it.

      True, so what? I assume that most people interested in "the raw data" would be intelligent enough to recognize that and would ask/research for information about the missing details of the experiment. So the producer/publisher of the raw data just have to provide (surprise!) a bit more data to the public. I certainly hope, that the rules for public disclosure of scientific data would be interpreted aiming in this direction instead of being governed by the "you would not understand it, anyway" attitude.

      and that is before you get into the thorny international issues about who did what and so whether it falls under any one country's laws.

      Point taken, but this is another (sad) issue. My, admittedly idealistic, stand on this that the results (raw, medium, well done, over cooked etc.) of science should be available to everybody with no restrictions.

    5. Re:Raw data can be useless by Anonymous Coward · · Score: 0

      I do not think that the public's interest is best served by releasing raw data in all (most?) cases

      That is not true for neuroscience data sets, where stable ontologies are in the works to provide good annotation and thus a priori structured data sets. In other words, publically available neuroscience data sets are very useful, and can be directly analyzed if familiar with the data set (as always). Even stronger, making such data available will counter possibly cleaned up models and allow to rerun the analysis.

      I'm not familiar with your data sets, however for neuroscience or cognitive psychology, such availability would be highly beneficial (more over for running meta-analysis).

    6. Re:Raw data can be useless by gnud · · Score: 1

      The FoI act doesn't really exist in the UK, though.

    7. Re:Raw data can be useless by jabuzz · · Score: 1

      Not been to the UK recently have you

    8. Re:Raw data can be useless by jcupitt65 · · Score: 1

      The UK FoI act was passed in 2000, here's the page for making requests:

      http://www.justice.gov.uk/requestinginformation.htm

    9. Re:Raw data can be useless by gnud · · Score: 1

      oh save me, they stole the name.

    10. Re:Raw data can be useless by budgenator · · Score: 1

      A lot of it is faked, especially arctic data, how likely is a civilian contractor working on the DEW line or a Russian out in Siberia to go outside, in -40 degree weather and possible get eaten by a polar bear to read a thermometer verses get writing what they got yesterday +- 1 or 2 degrees?

      --
      Apocalypse Cancelled, Sorry, No Ticket Refunds
  62. Re:Good and bad by oldhack · · Score: 1

    "... it takes nothing to create a poor interpretation of data that is indistinguishable from the real thing."

    Good thing you posted as AC, cuz that's one stupid statement - you are implying the science is crap.

    --
    Fuck systemd. Fuck Redhat. Fuck Soylent, too. Wait, scratch the last one.
  63. Think about what you are asking by Roger+W+Moore · · Score: 2, Insightful

    If a lab has been spending my tax money for 10 years, I want my employees to give me my data right Goddamn now. .....if you're taking my money, you work for me.

    Just stop and think for a second about exactly what it is that us scientists are being paid to do. We are NOT being paid to collect data we are being paid to figure out how the world works and how to apply that knowledge for the betterment of mankind. The data is an end towards that means.

    Now, do you REALLY want us to spend a serious fraction of our time and money preparing and making available the raw data in a form which will probably be useless to you instead of analysing and coming up with results which you are far more likely to find useful? Is that REALLY the best way for us to serve the public interest?

    Examples of how this could go horribly wrong immediately come to mind: it could delay finding medical cures as researchers spend time releasing, instead of analysing data, companies could request the data and develop/patent drugs which YOU will then pay through the nose for, nutters will start horribly misrepresenting the data to "prove" their pet theory on warp drive etc. etc. How does any of this serve the public interest?

    If you want an even clearer example: taxpayers fund each country's intelligence agencies. So does this mean that since you own all the data every tax payer should be able to request to see it whenever they want? Obviously not because it would not be in YOUR best interest for such data to be public. While the reasons are different the conclusion is the same for scientific data. It may be your data but you are paying us to collect it, analyse it and come up with results which ultimately improve yours, and everyone else's, standard of living.

    1. Re:Think about what you are asking by rjiy · · Score: 2, Insightful

      Now, do you REALLY want us to spend a serious fraction of our time and money preparing and making available the raw data..

      Nope. We expect you folks to spend some time thinking up a way so that you don't spend any time at all on "preparing" the supposedly "raw" data _and_ still make it available to the desirous public. Like you know putting up a file on a website with some footnotes. I hear universities have some websites.

    2. Re:Think about what you are asking by Rockoon · · Score: 1

      Now, do you REALLY want us to spend a serious fraction of our time and money preparing and making available the raw data in a form which will probably be useless to you instead of analysing and coming up with results which you are far more likely to find useful?

      So you are the grand arbiter, eh? YOU decide whats useless to others and what isn't, on my dime?

      You are soaking up public money, so there is a very serious conflict of interest if YOU get to play arbiter. Obviously you think that you are smarter than every other person, but here you are wanting to 'prepare raw data.'

      Its raw data. You don't need to prepare it. You just make it available. Its really that fucking simple. Not so smart.

      --
      "His name was James Damore."
    3. Re:Think about what you are asking by smidget2k4 · · Score: 1

      As per FoI requests, data needs to be prepared in some form. I couldn't just send you a 60GB .txt file full of numbers and no information about it. Compiling that information into a useable form takes time and money that could be much better spent on doing research. Data can be incredibly complex and require years of focused background in the subject to understand.

      And if you want to bitch about how we're using YOUR money to do work, then you should fucking pay us more. Shit wages. Especially for anyone who isn't tenure track. Academic research amounts to slave labor already. If you want to get the information, pay someone to come down to the lab and prepare it for you.

    4. Re:Think about what you are asking by qc_dk · · Score: 2, Informative

      I don't think you understand how scientific funding works. I am not given a lump sum and then told to go figure something out. This is how it works in the EU:

      I am given a sum of money. This has to be accounted for. There are a number predefined areas where I can spend this money. During this project I will have to fill in time sheets detailing what I'm spending my time doing. All the different work areas will have spending limits. I.e. I can't just put some more time into community outreach(like preparing data) at the cost of Research and development time. There will be a number of milestones I have to reach along with something called deliverables. Deliverables can be reports or code or raw data if the EU has decided it was of interest. At the same time I also have to prepare papers and so on to keep my position at the university.

      Where do I account for the time spent on giving out data because some random person wants it? Answer: I can't so it will have to come out of my own pocket or I could commit fraud and put my time down into one of the projects and newsflash I'm not going to risk jailtime.

      The fact is that while the public is funding the science, what they are funding is SPECIFICALLY not the data distribution. You don't believe you should have access to Pfizers raw data even though it is funded by the public(through the purchase of medicine), do you? Because what you paid for was a pill and not the data.

      If you believe in this so strongly, please lobby research grant givers to always include funding for public data dissemination, no matter whether the grant giver thinks the data will be of use and has included it as an accountable deliverable.

    5. Re:Think about what you are asking by Rockoon · · Score: 1

      Where do I account for the time spent on giving out data because some random person wants it?
      Everywhere. You account for all of your time.

      If the data is somehow in some random inexplicable state right this second, and your excuse is that you didnt have enough money allocated for "community outreach", then I believe that you should have your grant money pulled immediately.

      In the private sector, YOU GET FIRED if your data never makes it to a usable state. There isnt special "outreach" time for this. Keeping careful and detailed track of your data IS PART OF THE SCIENTIFIC PROCESS and is DONE IMMEDIATELY. This falls under "research." You keep your shit organized.

      That community outreach time you need.. if you dont have enough to take that ALREADY carefully documented and tracked data that you ARE SUPPOSED TO HAVE and stick it on an FTP site, then you need more than 15 minutes of time. Is that what you have? Less than 15 minutes of "outreach" time? Really?

      Stop making excuses. Put the data up. Click click click. Done.

      --
      "His name was James Damore."
    6. Re:Think about what you are asking by Rockoon · · Score: 1

      As per FoI requests, data needs to be prepared in some form. I couldn't just send you a 60GB .txt file full of numbers and no information about it.

      Why is your data in a 60GB completely undocumented text file? You've already failed to fulfill your responsibility if thats the state its currently in.

      In science, WE DOCUMENT THE DATA.

      --
      "His name was James Damore."
    7. Re:Think about what you are asking by qc_dk · · Score: 1

      Where do I account for the time spent on giving out data because some random person wants it?

      Everywhere. You account for all of your time.

      BUT WHERE? I can't argue that it is research time. I can't argue that it is a support action. Unless it is directly covered in the description of work(600 page legal document). So where do I account for it? Remember, I will have to argue against the EU commission that I decided that the agreement we had was stupid because some random person wanted some intermediate data, or I will have to pay out of my own pocket for the time spent. If I don't account for my full time I won't get my paycheck reimbursed fully.

      If the data is somehow in some random inexplicable state right this second, and your excuse is that you didnt have enough money allocated for "community outreach", then I believe that you should have your grant money pulled immediately.

      Why? If that was not stipulated in the grant?

      In the private sector, YOU GET FIRED if your data never makes it to a usable state.

      Exactly. The operative word being "makes". What data that must be published and what results that are required are stated in the funding contract. That will be published. That is the contract I have agreed to.

      That community outreach time you need.. if you dont have enough to take that ALREADY carefully documented and tracked data that you ARE SUPPOSED TO HAVE and stick it on an FTP site, then you need more than 15 minutes of time. Is that what you have? Less than 15 minutes of "outreach" time? Really?

      I have no "outreach" time accountable on the two EU projects I'm going to be in in FP7(the current research funding framework programme in the EU).

      Stop making excuses. Put the data up. Click click click. Done

      And if I have access to code and data developed in the US, that could be used for nuclear weapons simulation? If I were to release that to the public I would get a quick visit from representatives of the United States Government in dark suits, who would like to have a "little talk".

      Before I'm putting anything up that isn't covered by my contract, I'll need lawyers and an OK from the US partners and government. That's a little more than 3 clicks and 15 minutes. And I'm not paying the lawyers!

    8. Re:Think about what you are asking by smidget2k4 · · Score: 1

      It would probably be a dump of a large, fully annotated and understood database system. Or there are plenty of files I have that are several hundred megabytes where the columns are x y z and are motion tracking files, which are fully annotated in perfect order as far as protocol is concerned, but if I simply handed you that file with no preparation of what any of it actual is, then you have no idea what the hell to do with it.

      When we analyze that data, we know exactly what the limitations of the system that collected it are, what parts of the data are good and which parts are bad, what went wrong during that study, etc. But do to privacy issues, we cannot turn over videos of the subjects performing the tasks (this is an IRB rule, not ours) without consent of the subjects. Handing you a text file without the video is worthless. Data without context, understanding of exactly how every piece of it was collected, or understanding of its limitations is meaningless at best. At worst, it is dangerous in the hands of someone with a larger megaphone who does not understand the data (as the AGW deniers love to do).

      Like I said though, I think that data should be released, but releasing data as soon as it is generated rather than after the relevant papers have been published is an undue burden on the already heavily time constrained researchers.

    9. Re:Think about what you are asking by Rockoon · · Score: 1

      BUT WHERE? I can't argue that it is research time.

      Yes you can. It *is* research time. Keeping your data organized and usable IS PART OF RESEARCH.

      Why? If that was not stipulated in the grant?

      Because you are doing science, asshole. Thats right.. the gloves are off, because you are also doing it on public money. You are doing FUCKING SCIENCE. THE DATA IS THE MOST IMPORTANT THING yet you seem to think that you get to treat the data like shit, being completely careless with it, not even having any of it organized, ..

      Its this fucking simple:

      YOU ARE NOT THE RESEARCH. THE DATA IS. IT SHOULD ALWAYS BE ORGANIZED, EVEN IF YOU ARE A SLOPPY INCONSIDERATE FUCK THAT WASTES PUBLIC MONEY.

      --
      "His name was James Damore."
    10. Re:Think about what you are asking by Rockoon · · Score: 1

      It would probably be a dump of a large, fully annotated and understood database system. Or there are plenty of files I have that are several hundred megabytes where the columns are x y z and are motion tracking files, which are fully annotated in perfect order as far as protocol is concerned, but if I simply handed you that file with no preparation of what any of it actual is, then you have no idea what the hell to do with it.

      How hard is it to create readme.txt, and then type in "This is motion tracking data, format is X Y Z, its from ..."

      Give me a fucking break. Stop making excuses for why you do not want to release your data. Your excuses are so weak that it is clearly obvious that YOU SIMPLE DONT WANT TO RELEASE THE DATA. YOU SEEM TO THINK THAT ITS YOURS.

      --
      "His name was James Damore."
    11. Re:Think about what you are asking by smidget2k4 · · Score: 1

      Again, releasing data is fine. I am all for it. I think data should absolutely be released once the lab that generated is finished with it, and that that should be shortly after the initial batch of papers is published. This also gives the scientists time to fully understand and test the data, and understand the limitations of how it was collected.

      Data should be released, yes. And like I said, you can always pay more taxes so labs can hire a grad student or someone to release data for them as soon as it is generated.

      But then you'll just complain about the data you can't get because of privacy constraints or NDAs. Labs that generate data should have first dibs on that data, but I absolutely agree with you that is should be released on a reasonable time scale after their publications from it were made. However your demands of releasing it immediately are unreasonable and counter-productive to science.

    12. Re:Think about what you are asking by qc_dk · · Score: 1

      BUT WHERE? I can't argue that it is research time.

      Yes you can. It *is* research time. Keeping your data organized and usable IS PART OF RESEARCH.

      No it isn't unless it is specifically stipulated in the contract I have signed with the funding agency.

      Why? If that was not stipulated in the grant?

      Because you are doing science, asshole. Thats right.. the gloves are off, because you are also doing it on public money. You are doing FUCKING SCIENCE. THE DATA IS THE MOST IMPORTANT THING yet you seem to think that you get to treat the data like shit, being completely careless with it, not even having any of it organized, .. Its this fucking simple: YOU ARE NOT THE RESEARCH. THE DATA IS. IT SHOULD ALWAYS BE ORGANIZED, EVEN IF YOU ARE A SLOPPY INCONSIDERATE FUCK THAT WASTES PUBLIC MONEY.

      Ah, so because I'm in research I should work for free and pay the cost of publishing all intermediate data? And might I add on something completely useless because some random person on the internet believes he knows better how the public is served on a project he knows nothing about than the people giving out the grant and the independent reviewers (often "competing" scientists with intimate knowledge of the field) hired by the grant giver to insure the public interest is served?

      You don't seem to be able to get through your head that spending time and money on publishing intermediate data would be wasting public money. But, if you feel so strongly about it why don't you contact the politicians who have decided the framework for publicly funded research? Or, maybe you could try to become a reviewer on these projects.

      But until you change scientific funding to a massive waste fest, I'll keep to the contract I have signed with the funding agency.

    13. Re:Think about what you are asking by Roger+W+Moore · · Score: 1

      So you are the grand arbiter, eh? YOU decide whats useless to others and what isn't, on my dime?

      No, I do not get to decide, governments do based on the public interest (at least that's the theory). The reason for this is the it is not just your money but mine and every other taxpayer's as well. So as both a taxpayer and a scientist I certainly have a right to express an opinion. If governments want to up research expenditure to pay to make raw data available then that is their decision but as a tax payer I think it would be a complete waste of money (that's an opinion based on experience of actually using scientific data myself which, since you seem to be very anti-science I suspect you lack). As a scientist I am not bothered as long as extra funds are made available to hire people and buy the equipment to do it.

    14. Re:Think about what you are asking by Rockoon · · Score: 1

      Again, releasing data is fine. I am all for it. I think data should absolutely be released once the lab that generated is finished with it, and that that should be shortly after the initial batch of papers is published.

      Oh come on.. rationalize this time frame ("shortly after we publish") for us.

      Are you honestly claiming that its not reasonable to have the data ready, prior to publishing? Also, how can the peer review process work if the data isnt available until after publishing?

      Apparently there isnt any review of the data after all, right?

      I'm calling bullshit. You just want to be able to publish before releasing because you think that you have more rights over the data than the people that paid for it.

      --
      "His name was James Damore."
    15. Re:Think about what you are asking by smidget2k4 · · Score: 1

      Generally by the time publication comes around, you've already prepared your data for presentation because you have a good idea whether or not the reviewers are going to ask for your raw data/processing code/whatnot. That varies depending on the data set and the reviewers.

      The biggest part that is the problem of releasing the data right away is because you don't know if the data is crap without a lot of vetting. Once and a while, it doesn't get caught until a fresh pair of eyes (reviewers) take a look at it and pick up on something maybe your team hasn't thought of.

      The other problem is then you would create a whole industry of research snipers who would poach papers from researchers, thus eliminating the benefits of gathering huge, difficult to collect data sets. There would be no incentive to spend 10 years gathering data, only to have your research sniped as you are writing up your papers.

      Sure, the data belongs to the people (and rightfully so, information should be free), but you have to give some incentive to researchers who are already severely underpaid and overworked to painstakingly collect data. Pretty much the only thing they have is recognition.

    16. Re:Think about what you are asking by Rockoon · · Score: 1

      The biggest part that is the problem of releasing the data right away is because you don't know if the data is crap without a lot of vetting. Once and a while, it doesn't get caught until a fresh pair of eyes (reviewers) take a look at it and pick up on something maybe your team hasn't thought of.

      umm... releasing the data generates "fresh pairs of eyes", pretty much the ultimate in "vetting."

      So this is not a reason not to release the data.

      The other problem is then you would create a whole industry of research snipers who would poach papers from researchers, thus eliminating the benefits of gathering huge, difficult to collect data sets. There would be no incentive to spend 10 years gathering data, only to have your research sniped as you are writing up your papers.

      The benefits of gathering huge "difficult to collect" data sets with PUBLIC MONEY, is that PUBLIC MONEY pays your salary. Period.

      --
      "His name was James Damore."
    17. Re:Think about what you are asking by smidget2k4 · · Score: 1

      Generally you want the other pair of eyes to also be an expert in your field, so they can criticize your work. If bad data is released into the wild, it is very hard to get back and make sure nothing gets published or released with it.

      Also, if money was what researchers were after there are plenty of other fields where you can make tons more money for much less work.

      You would be removing the main incentive for researchers; prestige amongst their peers and a feeling of having contributed to the furthering of human knowledge by not allowing them to publish their own data sets. You would also be removing many of the checks and balances internal to the system that help weed out a lot of crap.

      Also, I don't get why "release when published" is so crazy. Do you go down to Washington and demand that your Senate rep opens up Word on his computer to show you the half completed/fleshed out draft of the bill he was working on? Do you want to go into the police station and demand to know their list of informants and undercover cops because they are paid with public money? Anything with the military?

      Your position is unreasonable and counterproductive.

  64. Re:Good and bad by shaitand · · Score: 1

    It is neither the duty nor the right of scientists to be the gatekeepers of public interpretation or opinion.

    That is akin to claiming that only ABA lawyers should be permitted access to the law or computer technicians access to benchmarks.

    As far as I am concerned the media can stick anyone they like in front of a microphone to speak on any issue they like. It IS the duty of the VIEWER to access the competence of the individual speaking.

    If you have a problem with the fact that viewers do not do their due diligence than I suggest supporting efforts to add critical thinking and local the curriculum in our grade schools.

  65. Re:Good and bad by shaitand · · Score: 1

    In that case the results will be natural selection at its finest. What is the problem here?

  66. Re:Good and bad by shaitand · · Score: 1

    "If I'm -not- evil though, this could hurt me. If I looked at say 3000 cells, and 10 were doing a thing that I thought was significant, I could have my reasons. Maybe the other 2990 were the wrong cell type or something. Being the expert, that might be obvious to me just from looking at them."

    And no doubt it will be equally obvious to all your expert colleagues should have to defend your decision. The support for your conclusions would thus be in the data.

    If you are basing decisions on things that are not in the data then you're a quack or a fraud and its best it be uncovered anyway.

  67. Re:Good and bad by shaitand · · Score: 1

    "You're not required to address specific demands of every crank who claims your data proves the existence of aliens."

    This annoys me since intelligent life elsewhere more likely than not exists (regardless of whether or not it mutilates our cows or is physically possible for us to reach).

    How about this instead:

    "You're not required to address specific demands of every crank who claims your data proves the existence of a deity."

  68. No we don't! by Roger+W+Moore · · Score: 1

    Science journals have long fought this, because their profit model is strongest when they own copyright and are the exclusive publishers of a paper. Peer review and scientific principles don't mesh well....

    You are getting somewhat confused: raw data is not the same as published papers. If anything releasing raw data would increase the number of papers being published - although likely at the cost of greatly reducing the signal to noise ratio! Peer review and scientific principles mesh incredibly well. Science is founded on reproducibility so you have to explain your work in a manner that your peers can understand and follow in order for them to be able to reproduce and check your conclusions: without that it is not science. That does not mean that peer review cannot be abused but if you find a rotten apple that does not mean that you should never eat an apple again.

    Regarding copyrights you are correct that many of us academics are finding ways to get around the restrictions and/or getting increasingly annoyed with them. However this only applies to published, scientific papers NOT the raw data. So far nobody has come up with a good way to still provide peer review without ending up with a paid journal or one in which you pay to publish (which raises different ethical issues) but I have heard that people are working on developing solutions.

    1. Re:No we don't! by Improv · · Score: 1

      In my research in neuroscience, I've found that with some funders, they actually do want our raw data (anonymised) for public access. This may be a lot of work (particularly when an fMRI study is a bit over a gig per subject) to upload for reasonable studies, but we are happy to provide - there's more than one way to analyse the information, and it makes reproducibility even easier.

      --
      For every problem, there is at least one solution that is simple, neat, and wrong.
  69. Re:Good and bad by shaitand · · Score: 1

    That sort of data is only useful if it is anonymous and double-blind anyway.

  70. Re:Good and bad by Anonymous Coward · · Score: 0

    So, we're supposed to believe that your selection of 10 supporting cases out of your sample of 3000 is correct just because you're an "expert"? Have you heard of the fallacy of authority?

    If you're data doesn't stand on its own (and sure, I'll grant you there might be perfectly reasonable situations where that might be the case) then it is open to criticism, and RIGHTLY SO! Your claim to being an expert doesn't grant you a free pass, sorry to tell you. If that criticism indeed happen then you're free to either a)writing up supporting information to the data UNDER QUESTION, no need to support all data gathered, or b)ignore the criticism, say, on the basis that it comes from non-experts. Your peers will decide if they're OK with what you say, either way. But keeping your government grants without being able to properly justify why it's OK to cherry-pick 10 out of 3000, expert or no expert, ain't gonna happen either ...

  71. For those who don't already know... by damn_registrars · · Score: 1

    The US has had a similar policy for some time. If your research is funded by the NIH (the largest funding source for life science research) you are required to make your published works available freely. Even if you publish in a high-impact expensive journal, your published works must be available freely through the NIH.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
  72. NIH does by damn_registrars · · Score: 1

    does anyone know if the NSF has similar requirements?

    I don't know if NSF does, but NIH definitely does.

    --
    Damn_registrars has no butt-hole. Damn_registrars has no use for a butt-hole.
  73. No problem by PPH · · Score: 1

    Just post the results on the Usenet.

    --
    Have gnu, will travel.
  74. Supreme Court by nometacognition · · Score: 1

    A friend had a bitter complaint about the recent United States Supreme Court ruling about dog fighting videos. She thought that any video that depicted cruelty to animals...yada...yada...yada. If people actually looked at what words were spoken and written by the people producing data, this question about public release would go away. It is that people are not able to look at the data that there are requests for it. If professional statisticians are not working on the data, at least let amateurs. I will get a citation (of the global warming statistics questions) if you absolutely are unable to find it yourself on google/yahoo/bing/askoxford.com

  75. NOT ONLY DATA, METHODS WANT TO BE FREE!! by xtracto · · Score: 4, Insightful

    Simply generating massive amounts of data isn't considered science - figuring out what it means is. I say this as someone who is very good at generating data quickly, but not particularly good at interpreting it.

    Spot on. I have a PhD in Comp. Sci. (Multi-Agent Systems / Market Based Control). One of the things you learn (maybe in you Universitity degree courses or in your first paper presentation) is that data does not mean *anything*, what matters is the interpretation of such data.

    Nevertheless, I am of the opinion that programs used for the generation / manipulation of such data should also be free / scrutinable. Specially those developped during the research as they are also being paid by the tax payers money.

    In the field I am working now (Agent based computational economics) a lot of people do these so called agent-based simulations, then they write a nice paper about what their simulations showed and try to publish it. The problem is that they keep their code! and in that respect they are deffinitely removing a good chunk of the "methods" part of their research. It is absolutely impossible to duplicate that work without the code.

     

    --
    Ubuntu is an African word meaning 'I can't configure Debian'
    1. Re:NOT ONLY DATA, METHODS WANT TO BE FREE!! by HungryHobo · · Score: 1

      I've been trying to implement the methods described in comp-sci research papers this year as well and it's remarkable how many of them

      1: Try to make what they're doing sound more complex than it really is by describing an algorithm with half a page of mathematical symbols where 3 lines of pseudocode would describe what they're doing far more clearly.
      2: Don't include their actual code.
      3: Don't include their data.

      And from implementing a few and eventually falling back on emailing the authors I've come to the conclusion that

      [1] is most common when what they're doing is actually fairly simple and they don't want to admit it.(bonus points if they spend half the paper describing in detail the steps taken by some standard library functions which we really don't need described since it only muddies the waters when trying to distinguish those steps from the important parts of their paper)
      [2] is most common when their methods actually have some significant shortcomings which are only evident when you have working code to test with.
      [3] is most common when the tests are the most vaguely described.

      Do phd students think nobody see's through this???

  76. Re:Good and bad by smidget2k4 · · Score: 1

    Hire people to get the data to you then. I would imagine that most researchers would be fine releasing data if the time and money burden of doing so was lifted from them.

  77. Private funding to public universities by Bazman · · Score: 1

    People should note there is a big drive at UK universities to get more outside funding from corporations and other external funding bodies. Only last week I heard that one of our guys was close to clinching a deal worth several million from a multinational company that would employ several new people and research students.

    Do you think their data will be made publicly available?

    I recently made a grant application to one of the UK research councils (govt funding bodies) and explicitly wrote in that code would be open-sourced and data would be made available. Got the grant :)

  78. Re:Good and bad by smidget2k4 · · Score: 1

    I think he meant "indistinguishable to the layperson". That is a real issue. Jenny McCarthy can say all she wants, and to Joe Blow, it is all the same as what scientists say. Scientists know she is full of shit, but people not trained in science might not be able to tell.

  79. Re:Good and bad by Anonymous Coward · · Score: 0

    There are some studies in which such raw data could be partially patient identifiable - for instance, in rare diseases and small studies.

  80. replicability by pydev · · Score: 1

    People should not forget that climate science is an exception. In most sciences, this isn't necessary because experiments are supposed to be replicable by other scientists. If the experiment can't be replicated, the original paper and conclusions are invalid. Publishing the original data really serves little purpose and may actually discourage (necessary) replication of the experiments.

    Climate science is a special case because it is based on massive observational data that cannot be replicated and that was derived from tax-payer funded satellites and other data sources.

  81. Mathematician Douglas J. Keenan? by Anonymous Coward · · Score: 0

    Really? From his web site "I used to do mathematical research and financial trading on Wall Street and in the City of London; I now study independently"

    What could possibly go wrong?

  82. Universities in the UK by peetm · · Score: 1

    >>...which are all public in the UK

    Not so - there's one private university!

    --
    @peetm
  83. I could do that all day long for $200/hr by syousef · · Score: 1

    You know the multi-billion dollar LHC? Guess what they did their first physics on. Not finding new exotic particles

    I could do that all day long for $200/hr. I'm not finding new exotic particles right as we speak. And last night I did it in my sleep. No takers??? Okay how about a one off special just for today $50/hr?

    --
    These posts express my own personal views, not those of my employer
  84. More details by Sara+Chan · · Score: 2, Informative

    I am the story's submitter. My original submission included a link to the mathematician's web page about this; the page has many more details. There have also been other news stories, e.g. at the BBC.

    The UK Freedom of Information Act has exemptions for data that has not yet been used in publications, vexatious requests, etc.

  85. Privacy trumps publication... by Anonymous Coward · · Score: 0

    One place where your black and white view doesn't work is medical research. Much of the data needed for evidence based medicine and other research is full of private patient data.

    Many researchers need to access "secret" data under restrictions set by confidential data use agreements, or they cannot do the linked analysis necessary for their research. This data cannot be released to the public without violating the privacy of patients, and the "anonymization" techniques people use to try to remove this privacy aspect also destroy much of the utility of the data for future research. It may allow someone to repeat/verify some analysis results, but they cannot for example validate whether the cleaning and anonymization process was correct or whether it damaged the data in a way that affects all results. They need access to the raw data for full verification, as well as for novel retrospective studies.

    To really support the desired scientific process, you need this kind of private data to be put into a sort of trusted escrow. Curated and made available to future researchers, but only under the approval of institute review boards who make sure that appropriate ethical and legal considerations are met. It's not the same as publishing, i.e. releasing to the public.

  86. How's this for bad? by Roger+W+Moore · · Score: 3, Interesting

    But you haven't given a reason why it's actually bad

    It wastes scientists' time that would be better spent analysing the data rather than releasing it, it wastes money collecting and disseminating the data, it pollutes the real scientific results with those of nutters trying to prove their pet theory and, in the case of commercially useful data, it risks having companies use the data to develop something commercially useful that will then be locked away behind patents and the public will be charged through the nose for.

    There is also the more subjective, human issue that if you don't let people who have worked like crazy to get the data have at least the first shot at analysis then recruiting scientists is going to become extremely hard and motivating them to perform large-scale experiments will be even harder if they just have to give the data away - why would you bother if you can just sit around and get the data as soon as it is collected?

    Is that bad enough? There are ways you could mitigate some of the above but the bottom line is that nothing is free: it will cost more money to make the data publically available and, as a taxpayer myself, I see no real benefit from doing it and some serious potential pitfalls.

  87. Proved? by Nicolas+MONNET · · Score: 1

    The only thing that is proven is that denialists are funded by the oil industry and the same "libertarians" funding the teabaggers, such as the bootstrappy Koch heirs.

    At the end of the day, the opinion of the vast majority of scientists, and in paricular that of almost all climatologists, has not changed. A few years ago most national academies of sciences issued a joint statement supporting the IPCC, each with the overwhelming approval of their members. Has any of them gone back?

    No, it's just fucking criminal PR bullshit, the same kind that was used to justify the Iraq war or implement liberticide "anti-terrorist" policies.

    1. Re:Proved? by 16K+Ram+Pack · · Score: 1

      The only thing that is proven is that denialists are funded by the oil industry and the same "libertarians" funding the teabaggers, such as the bootstrappy Koch heirs.

      Motivation of those who would find holes in science is irrelevant. All that matters is if the science is right or not.

      At the end of the day, the opinion of the vast majority of scientists, and in paricular that of almost all climatologists, has not changed. A few years ago most national academies of sciences issued a joint statement supporting the IPCC, each with the overwhelming approval of their members. Has any of them gone back?

      That's just an appeal to credentials, that those who are the members of the right guild, union or club believe something. Again, completely irrelevant to whether something is scientifically correct or not

      No, it's just fucking criminal PR bullshit, the same kind that was used to justify the Iraq war or implement liberticide "anti-terrorist" policies.

      It doesn't matter what your personal opinion is of climate science. One thing is fact: The CRU cannot produce a detailed document showing the process at arriving at their results. They modified raw temperature data and have nothing to show for how that was done.

      You try that in a well-regulated software environment... giving your results to an external auditor without a program spec, or handing over details about a drug to the FDA with a bit that says "some magic happens here", or a patent to the patent office which says "woo cloud".

      I'm not saying the results are wrong or that the CRU aren't bright, honest people. But if we're going to spend billions of dollars on this problem, then the science that it's based on should be a lot more robust than that.

    2. Re:Proved? by Attila+Dimedici · · Score: 1

      And AGW proponents are funded by oil companies at about 10x the amount the those who are "denying" AGW, so who has more economic interest in the results they are producing? I always love these people who complain that those who are skeptical of AGW are just doing it for financial reasons, but the proponents (who are receiving much more funding) are completely unaffected by financial considerations.

      --
      The truth is that all men having power ought to be mistrusted. James Madison
    3. Re:Proved? by Just+Some+Guy · · Score: 1

      The only thing that is proven is that denialists are funded by the oil industry

      I couldn't possibly care less. I don't give a rat's butt if it's the Einstein Is A Dumbass Council that demonstrates that it's really E=mc^2.1 if their results are independently verifiable.

      --
      Dewey, what part of this looks like authorities should be involved?
    4. Re:Proved? by Nicolas+MONNET · · Score: 1

      but the proponents (who are receiving much more funding) are completely unaffected by financial considerations.

      Fuck off. This is simply bullshit, and you know it. Exxon has given dozens if not hundreds of millions to denialists.

      You scumbag reichwing corporate shills.

    5. Re:Proved? by Nicolas+MONNET · · Score: 1

      That's just an appeal to credentials, that those who are the members of the right guild, union or club believe something. Again, completely irrelevant to whether something is scientifically correct or not

      That's parroting typical of a right wing authoritarian. You people are no better than herd animals.

      Nobody but you is saying that something is right because such or such authority says it is. I'm responding to the allegation that "Climategate" proved anything.

      Now go back to your hole.

    6. Re:Proved? by Attila+Dimedici · · Score: 0, Flamebait

      Yes, Exxon is giving millions of dollars to those who are researching to disprove AGW, but Exxon is giving hundreds of millions of dollars to those who are proponents of AGW. Sorry to disrupt your nice little world view, but there is much more money in being an AGW proponent than in being an AGW skeptic.

      --
      The truth is that all men having power ought to be mistrusted. James Madison
  88. Re:Good and bad by kandela · · Score: 1

    Two things:

    1. Great, I want all the data on how to make an atom bomb provided in a neat easy to use format.

    2. I accept that the viewer is ultimately responsible for their own due diligence, but why am I paying for a newspaper if the media aren't being held to reasonable standards of diligence?

    --
    Conservation of angular momentum makes the world go round.
  89. How come this isn't the case for open source? by Anonymous Coward · · Score: 0

    How come this isn't the case for open source? And there's an awful lot more data than source code, and that data needs some serious maths to work with, therefore the number of people who CAN look at the data is a far, far, FAR smaller section of the people who have the data available than have the Linux source code.

    Yet the less useful data is a great potential for ALL OF US, yet source code is worthless.

    Yeah, right.

  90. Re:Good and bad by 16K+Ram+Pack · · Score: 1

    Except in the case of MMR/autism, it wasn't just some crank that started things using data. It was a doctor writing a peer-reviewed paper for The Lancet. It was actually the credentials of a real doctor that lent the sceptics case so much weight.

  91. Not to interrupt some lovely rants by overshoot · · Score: 1
    but part of the problem has been that people are requesting data from those who don't have the legal right to redistribute it. In at least one instance, the publishing authors were using data under NDA from another source. The requester wanted the data set as compiled from several sources and the responders pointed the requestors to original sources, including those who did have rights.

    Much drama ensued.

    --
    Lacking <sarcasm> tags, /. substitutes moderation as "Troll."
  92. Re:Good and bad by shaitand · · Score: 1

    "1. Great, I want all the data on how to make an atom bomb provided in a neat easy to use format. "

    Your point being? That comment doesn't relate to anything I said.

    "2. I accept that the viewer is ultimately responsible for their own due diligence, but why am I paying for a newspaper if the media aren't being held to reasonable standards of diligence? "

    I don't know. Do you read The Weekly World News? Deciding which news sources have such standards is part of the due dilligence of the consumer.

  93. The QUB's argument for denying him a copy is ... by rgviza · · Score: 1

    the time it would take to gather the data and put it on disk for him. The question I have is if this data has been used in climate models, it's already gathered and consolidated. Why not just run a copy of the data for him, which was used to do the climate models? That couldn't take longer than an hour to put on a dvd, since it's already sitting on a hard drive somewhere. It has to be or they couldn't have used it for the climate modeling.

    I smell a rat... Why don't they want anyone from the outside to verify their conclusions? It would only validate their findings and conclusions if someone else were to run the numbers. Why would they not want anyone else to have the data? It's in their best interest, if they are honest, to give out as many copies as they can. In fact they should have a zip file sitting on their web server for anyone and everyone to download it so they can get back to their research and not be bothered by data requests.

    --
    Don't kid yourself. It's the size of the regexp AND how you use it that counts.
  94. Re:Good and bad by Anonymous Coward · · Score: 0

    Data is not a set of instructions, so your request makes no sense.

  95. Re:Good and bad by interkin3tic · · Score: 1

    So, we're supposed to believe that your selection of 10 supporting cases out of your sample of 3000 is correct just because you're an "expert"?

    More or less. I was the one capturing the data. If I rejected the other 2990, I probably had a reason, one that may not be obvious to someone not manning the microscope or someone who has not stared at millions of those cells for hours at a time.

    If I were to publish a paper saying "These 10 cells out of 3000 are the only ones that matter" then I'd have to support that claim. If, on the other hand, I'm publishing a paper that doesn't explicitly claim that those 10 out of 3000 cells are the only ones relevant, then I'm not going to explicitly show why those other 2990 cells aren't relevant. The data is incomplete without supporting figures showing what was obvious to me.

    To make it a little more concrete, say I'm saying "Cells condense their DNA before dividing" and looked at cancer cells in a dish (all vertebrate cells I'm famililar with do in fact condense their DNA before dividing). I look at the 3000 cells and see that only 100 are dividing, based on what the cells look like. I'm not going to prove that the other 2900 are not dividing, it should be obvious to anyone familiar with that cell line. Of the 100 that are dividing, lets say 50 aren't marked properly with the signal I'm looking at, could be a DNA marker, so they get thrown out. Of the 50 that are dividing and are marked properly, maybe 30 are on top of one another preventing good analysis, and 10 of the ones remaining are cells that look unhealthy, like they're dying. That leaves 10 cells out of 3000.

    If you're not a cell biologist, most of that wouldn't be obvious to you, some of those criteria wouldn't even be obvious to you unless you were a cell biologist familiar with that specific cell line.

    To be convinced that the rejections were appropriate, either you could become an expert in that cell line, which I find unlikely, or I could write up 5 or 6 supporting figures demonstrating that non-dividing cells looked like this etc. That's a lot of wasted time that I'd rather be wasting on slashdot.

  96. You don't even know what it means by Nicolas+MONNET · · Score: 1

    Just like the author denialists, you're just parroting talking points. Of course you don't care what national academies of sciences think, you only care what your masters think. Atta boy.

    Any way, I was responding to the assertion that "Climategate" has proved anything. It hasn't.

    1. Re:You don't even know what it means by Just+Some+Guy · · Score: 1

      I'm not parroting anything, and I'm not a denier. My point is that valid results are still valid even when someone we dislike achieves them. Again, if the oil companies (or Greenpeace or Al Gore or PETA) publish independently verifiable results, then their work stands.

      As applies here, if the national academy publishes their data and methodology so that third parties can examine them - even third parties who disagree with their conclusions - and their results hold, then it lends credibility to their hypothesis. Hiding it with the excuse that they don't want people taking it out of context does not lend credibility.

      And again, I'm not a denier. While I'm not entirely convinced by AGW proponents, the preponderance of evidence makes it seem pretty darn likely. I'm very interested in how this episode plays out, though.

      --
      Dewey, what part of this looks like authorities should be involved?
  97. Think harder! by Roger+W+Moore · · Score: 1

    We expect you folks to spend some time thinking up a way so that you don't spend any time at all on "preparing" the supposedly "raw" data _and_ still make it available to the desirous public. Like you know putting up a file on a website with some footnotes. I hear universities have some websites.

    Congratulations - you have identified two media by which the information could be disseminated. Now, would you like to explain how it is possible to write the documentation, in a form that the general public can understand, without taking any time? ...and before you claim that we scientists should just find a way please remember that we are scientists, not magicians, and so are limited by physical reality. I could also point out that time spent designing any such system would also be time NOT spent doing science which is what we are actually getting paid to do.

  98. Beyond School Physics by Roger+W+Moore · · Score: 1

    "This is motion tracking data, format is X Y Z, its from ..."

    You seem to have the impression that science is just like the physics you may have done at school. The experiment I work on has 1MB data per event using a complex, compressed data format and even then you need access to a parameter database to know the configuration of the detector given any particular data run. A couple of comments at the top of the file, or even a document describing the format is not enough. You have to understand the detector and have access to the database. It would require CONSIDERABLE effort to make this even accessible to the general public....but I tell you what if you have the tens of petabytes of data storage needed and the budget to fund the required development we'll give it a try.