NSF Audit Finds Numerous Cases of Alleged Plagiarism
sciencehabit writes "The National Science Foundation (NSF) is investigating nearly 100 cases of suspected plagiarism drawn from a single year's worth of proposals funded by the agency. The cases grow out of an internal examination by NSF's Office of Inspector General (IG) of every proposal that NSF funded in fiscal year 2011. James Kroll, head of administrative investigations within the IG's office, tells ScienceInsider that applying plagiarism software to NSF's entire portfolio of some 8000 awards made that year resulted in a 'hit rate' of 1% to 1.5%. 'My group is now swamped,' he says about his staff of six investigators."
You know, the summary and the article look very familiar. I wonder if they copied anything or if they wrote it all themselves?
And let's not forget: this is our tax dollars at work.
What a shame.
There is a difference.
Give me Classic Slashdot or give me death!
When researcher's lives are ruled by arbitrary metrics on volumes of papers published, people will cheat. It's certainly true in computing. Try picking a few papers from the ACM Digital Library and start following the references. Rehash after rehash of other people's papers. Particularly their own...
It isn't really a scandal until the cases of plagiarism are confirmed. I once tested some plagiarism software on published academic economics, and it produced many false positives, many of which required some knowledge to interpret. Notice that a grant application may seem to be a somewhat "safer" place to plagiarize, since only a few people will see the application. However, those few might well include the borrowed from author - the granting agency will be sending the proposal for review to many researchers who have written on the topic before..
Are they just using a web service such as turnitin.com? I've used that for classroom assignments, and it has a rather high rate of false positives - even when factoring out direct quotes that students love to use to much to fill space.
i don't know karate, but i know ca-razy
How much self plagiarism and how much false positives?
I wonder what the plagiarism rate was in applications that did not receive funding.
I'd love to help any way I can.
Scientists have no rational basis to distinguish between "ethically right" and "that which benefits the organism". Thus conventionally immoral things like lying, cheating, stealing, are not a moral concern for a scientist (aside from arbitrary preference). Why shouldn't they plagiarize?
That's why you read the papers.
Science is NOT 'believing what they tell us at face value'. Science IS asking (and getting) evidence before accepting their conclusions (the concept of repeatability). Yes, scientists are annoyingly human and therefore bedeviled with the same positive and negative attributes as the rest of humanity.
I'm not entirely sure what your point is. You seem to be confusing journalists with scientists. There is an unfortunate tendency for the press to dig around an abstract, find some enticing sentence and tart it up well past the point where the authors would even admit to writing about it - then have it repeated in the Internet echo chamber until it's a cure for cancer, scabies and annoying foot perspiration. But that isn't what science is.
Go to a conference about something and listen to the question and answer period. That's science.
Faster! Faster! Faster would be better!
The cases grow out of an internal examination by NSF's Office of Inspector General (IG) of every proposal that NSF funded in fiscal year 2011
It seems to me they are running the tool against things that are already funded. Wouldn't it make more sense to run the tool when recieving any proposal and then pass on the results to whoever is deciding if a proposal should be funded?
Scientists have been proven to be human beings after all. They can lie and cheat like the rest of us. So that begs the question. Why should we all believe what they tell us at face value rather than using our deductive reasoning to figure out whether what they say is plausible? Why shouldn't we ask for hard evidence before accepting their conclusions at face value? If they are just as fallible as anyone else then why should we believe what they say rather than judging whether what they are saying makes any sense?
Well, you don't need to take what they say at face value(if you did, we could skip the tedious 'experiments' and 'peer review' and 'writing scientific papers' and 'data' and other boring stuff). More generally, the reason we don't generally use deductive reasoning is that it's somewhere between cumbersome and useless outside of toy examples written for deductive logic exercises. Induction is kind of lame by comparison; but it has the advantage of actually providing us with information about the world...
And might not grant proposal writers be purposefully including snippets of text or stylistic flourishes or word-usage choices characteristic of those high-level academicians whom they expect to be reviewing the grant proposal?? If they do, the reviewer might see that the proposer has some genius in them, since they are obviously on the correct trail and path!!! If they used techniques or buzzwords that are not "au courant" or standard canon fodder [joke, joke, pun intended], then they'd be seen as idiots. This "partial plagiarizing" may be seen as paying "homage" or "paying tribute" to the "gods of the academy" so that these lesser beings may enter the "pantheon of the greats!" [can you tell that I've been reading up on the french pantheon in my french classes?]
Ignoring the obvious...I point to this:
Why should we all believe what they tell us at face value rather than using our deductive reasoning to figure out whether what they say is plausible?
How is that any different from what scientists do? Apparently it was never turtles, but scientists all the way down.
Short answer: You shouldn't
Long answer: You really shouldn't listen to them when it comes to areas outside their limited domain expertise. Some climatologist (to pick a current example) might tell you something about pollution or climate change, but you shouldn't listen to him on his policy advice. He is not an expert in economics, social studies, government, bureaucracy, history...
The reality:
The public perception of *truth* remains as elusive as ever despite our ever increasing access to information. Ultimately, all knowledge relies upon trust of the person/institution delivering it. As institutions are made of people, it ultimately rests with people.
B would imply that so many allegations of plagiarism was found, rather than so many instances of possible plagiarism. I think alleged is fast becoming the most misused word in American, right next to "begs the question".
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
A plagiarism hit rate of only 1 to 1.5 percent is not that high, considering that many research grants are based upon the same core studies, use similar methods (e.g. "We will use a mass spectrometer with 8 plates of xxx"), and refer to prior studies in much the same way.
You call it plagiarism. I call it a good reason to retest your plagiarism software.
A more serious problem is duplication of human subjects in study designs. Many people with rare or recessive genetic problems like to volunteer for research studies, and may show up in samples from different labs, yet be the same person. A good study design accounts for that and validates that subjects (other than twins) are not in fact the same person represented in multiple studies. Same goes for any study with limited population sets that have restrictive conditions, such as longer lived animals.
Not as much of a problem with short-lived animals or cultures, as they tend not to be reused.
-- Tigger warning: This post may contain tiggers! --
Also, on further consideration, one of the problems with scientific research recently, is the lack of "duplicative" studies.
Seeing results from only one lab of a scientific hypothesis only proves that it deserves further study. To study it, and "prove" it, you need to replicate (duplicate) the study.
We should, in fact, see MORE studies with similar wording and language, in that they should have more than one study test the hypothesis. A study of the same condition should have a high "plagiarism" rate, since in fact it is a DUPLICATE of the original study, and will quote the same objectives, the same core research papers it is studying, and so on.
The LACK of such duplication implies that the science is weak. For examples of weak science, look at the S Korean and Chinese research into cloning. We only had single studies, and they were faked. You need independent labs with different scientists studying something to show that it is in fact Science.
-- Tigger warning: This post may contain tiggers! --
So that begs the question.
*twitch*
Why shouldn't we ask for hard evidence before accepting their conclusions at face value?
Go ahead and do so, if you want - no-one's stopping you. You could even go to college, get that degree, spend 10 years out in the field and do everything else the first guy did to check that it's all true. Personally I'm happy to accept the existence of electrons and quarks on trust at this point.
If they are just as fallible as anyone else then why should we believe what they say rather than [confirming] whether what they are saying [survives further scientific scrutiny]
That's what all the other scientists in the world are for.
systemd is Roko's Basilisk.
Individual scientists are often wrong. This is occasionally due to malfeasance, but more often it is just the state of the art. The truth is not always completely self evident and often multiple theories compete.
Science, as a human endeavor, makes progress because results are always being verified by multiple parties. Funding proposals and peer reviewed publication are only a part of the progress. The key operation is reproducible results. Other people use existing results as part of their own work. If the outcome differs from previously known work, it will be reported. This is normal and expected. Eventually there is a consensus and and the scientific community moves forward.
Bad behavior by individuals slows down the process, but it does not derail it. Resources and time are wasted, but as long as the scientific method is employed, the results are trustworthy.
Your positions is one sometimes taken by jealous members of the social sciences and humanities, particularity in academia. (This is one of my problems with Michel Foucault and postmodernism/structuralism.) They see the respect, status and money going to technical fields, so they try and reframe science as having no more validity then any other intellectual area. The existence of modern society proves this wrong, but since they reject the scientific paradigm they seem to have no trouble ignoring external facts. (Actually Foucault has a lot of value when his work is applied to culture and society, so I should not be quite so harsh.)
That's the long answer. The short answer is that you're a troll, and appear to short in mental stature and emotional maturity.
Why is Snark Required?
Let's hope that such detection techniques can soon be applied to Patent Office applications too :-)
They need to crowdsource this work. I'm sure many people would volunteer.
It's hard to tell from the summary or article what is going on here. I suspect a decent fraction of these may be people submitting proposals under different programs for similar or overlapping projects. Sometimes a scientific project will kind of fall between programs and people will submit more-or-less the same proposal to two different parts of the NSF, hoping for funding from one. Given the low funding rate and the great deal of uncertainty about funding (thanks, oh-so-functional Congress!) it is pretty common for people to submit to multiple programs or to have several co-Principal Investigators, each with a component of a larger-scale project. And people definitely recycle their earlier proposals, funded or not. There are also often required sections on ``broader impact'' that are important in many fields that may not have much in common with the specific proposal and may be copied from other proposals. To me, there is a huge difference between ``self-plagarism'' or duplication, between recycling a broader impact statement from a colleague, and between outright plagiarism, unknown to the person who is being copied, with genuine scientific theft of ideas. From what is described, it sounds like they aren't yet distinguishing between these cases.
There was a good description on the arXiv about good plagiarism detection methods and tuning parameters for efficient detection of duplication and plagiarism, applied to a good part of the body of arXiv submissions. That algorithm is run now, which is why you see those ``article has significant text overlap'' messages, detailed here.
It's psychosomatic. You need a lobotomy. I'll get a saw.
In heavily technical fields you just need to have a description of your topic. Normally, this one should not change between papers. But today, due to "autoplagiarism" rules, one is compelled to spend lots of time and imagination to find new ways of presenting the same old thing. The bad guys that plagiarized (often by reinventing the wheel while knowing it exists) are not really affected by the new rules, because they just continue to do their stuff. The ones who get hurt are the honest researchers (fewer and fewer).
Try this: Have 100 people write a 4 page book report about a novel they all read. Then ask them to describe one single chapter. Then narrow it down to one scene in one chapter, described using in 4 pages. Now submit these to a plagiarism detector. You will have a lot.
Every time a grant is written the same core research is typically cited. How many ways can you describe the same results without triggering a 1%, 10%, 20% match with the last grant proposal / paper, etc you wrote? We struggle with this every time we submit papers, let alone grant proposals. It's a necessary evil, but the tools get so strict that using the same symbols in equations can trigger a 15% match on a theory-heavy paper. A match like that can significantly bias an editor / reviewer into thinking the work is derivative (no pun intended).