NSF Audit Finds Numerous Cases of Alleged Plagiarism
sciencehabit writes "The National Science Foundation (NSF) is investigating nearly 100 cases of suspected plagiarism drawn from a single year's worth of proposals funded by the agency. The cases grow out of an internal examination by NSF's Office of Inspector General (IG) of every proposal that NSF funded in fiscal year 2011. James Kroll, head of administrative investigations within the IG's office, tells ScienceInsider that applying plagiarism software to NSF's entire portfolio of some 8000 awards made that year resulted in a 'hit rate' of 1% to 1.5%. 'My group is now swamped,' he says about his staff of six investigators."
You know, the summary and the article look very familiar. I wonder if they copied anything or if they wrote it all themselves?
And let's not forget: this is our tax dollars at work.
What a shame.
There is a difference.
Give me Classic Slashdot or give me death!
When researcher's lives are ruled by arbitrary metrics on volumes of papers published, people will cheat. It's certainly true in computing. Try picking a few papers from the ACM Digital Library and start following the references. Rehash after rehash of other people's papers. Particularly their own...
It isn't really a scandal until the cases of plagiarism are confirmed. I once tested some plagiarism software on published academic economics, and it produced many false positives, many of which required some knowledge to interpret. Notice that a grant application may seem to be a somewhat "safer" place to plagiarize, since only a few people will see the application. However, those few might well include the borrowed from author - the granting agency will be sending the proposal for review to many researchers who have written on the topic before..
Are they just using a web service such as turnitin.com? I've used that for classroom assignments, and it has a rather high rate of false positives - even when factoring out direct quotes that students love to use to much to fill space.
i don't know karate, but i know ca-razy
How much self plagiarism and how much false positives?
I wonder what the plagiarism rate was in applications that did not receive funding.
I'd love to help any way I can.
Scientists have been proven to be human beings after all. They can lie and cheat like the rest of us. So that begs the question. Why should we all believe what they tell us at face value rather than using our deductive reasoning to figure out whether what they say is plausible? Why shouldn't we ask for hard evidence before accepting their conclusions at face value? If they are just as fallible as anyone else then why should we believe what they say rather than judging whether what they are saying makes any sense?
Jesus was a compassionate social conservative who called individuals to sin no more.
Scientists have no rational basis to distinguish between "ethically right" and "that which benefits the organism". Thus conventionally immoral things like lying, cheating, stealing, are not a moral concern for a scientist (aside from arbitrary preference). Why shouldn't they plagiarize?
The cases grow out of an internal examination by NSF's Office of Inspector General (IG) of every proposal that NSF funded in fiscal year 2011
It seems to me they are running the tool against things that are already funded. Wouldn't it make more sense to run the tool when recieving any proposal and then pass on the results to whoever is deciding if a proposal should be funded?
And might not grant proposal writers be purposefully including snippets of text or stylistic flourishes or word-usage choices characteristic of those high-level academicians whom they expect to be reviewing the grant proposal?? If they do, the reviewer might see that the proposer has some genius in them, since they are obviously on the correct trail and path!!! If they used techniques or buzzwords that are not "au courant" or standard canon fodder [joke, joke, pun intended], then they'd be seen as idiots. This "partial plagiarizing" may be seen as paying "homage" or "paying tribute" to the "gods of the academy" so that these lesser beings may enter the "pantheon of the greats!" [can you tell that I've been reading up on the french pantheon in my french classes?]
B would imply that so many allegations of plagiarism was found, rather than so many instances of possible plagiarism. I think alleged is fast becoming the most misused word in American, right next to "begs the question".
sed -e 's/Chuck Norris/Rajnikant/g' joke > fact
A plagiarism hit rate of only 1 to 1.5 percent is not that high, considering that many research grants are based upon the same core studies, use similar methods (e.g. "We will use a mass spectrometer with 8 plates of xxx"), and refer to prior studies in much the same way.
You call it plagiarism. I call it a good reason to retest your plagiarism software.
A more serious problem is duplication of human subjects in study designs. Many people with rare or recessive genetic problems like to volunteer for research studies, and may show up in samples from different labs, yet be the same person. A good study design accounts for that and validates that subjects (other than twins) are not in fact the same person represented in multiple studies. Same goes for any study with limited population sets that have restrictive conditions, such as longer lived animals.
Not as much of a problem with short-lived animals or cultures, as they tend not to be reused.
-- Tigger warning: This post may contain tiggers! --
Also, on further consideration, one of the problems with scientific research recently, is the lack of "duplicative" studies.
Seeing results from only one lab of a scientific hypothesis only proves that it deserves further study. To study it, and "prove" it, you need to replicate (duplicate) the study.
We should, in fact, see MORE studies with similar wording and language, in that they should have more than one study test the hypothesis. A study of the same condition should have a high "plagiarism" rate, since in fact it is a DUPLICATE of the original study, and will quote the same objectives, the same core research papers it is studying, and so on.
The LACK of such duplication implies that the science is weak. For examples of weak science, look at the S Korean and Chinese research into cloning. We only had single studies, and they were faked. You need independent labs with different scientists studying something to show that it is in fact Science.
-- Tigger warning: This post may contain tiggers! --
Let's hope that such detection techniques can soon be applied to Patent Office applications too :-)
They need to crowdsource this work. I'm sure many people would volunteer.
It's hard to tell from the summary or article what is going on here. I suspect a decent fraction of these may be people submitting proposals under different programs for similar or overlapping projects. Sometimes a scientific project will kind of fall between programs and people will submit more-or-less the same proposal to two different parts of the NSF, hoping for funding from one. Given the low funding rate and the great deal of uncertainty about funding (thanks, oh-so-functional Congress!) it is pretty common for people to submit to multiple programs or to have several co-Principal Investigators, each with a component of a larger-scale project. And people definitely recycle their earlier proposals, funded or not. There are also often required sections on ``broader impact'' that are important in many fields that may not have much in common with the specific proposal and may be copied from other proposals. To me, there is a huge difference between ``self-plagarism'' or duplication, between recycling a broader impact statement from a colleague, and between outright plagiarism, unknown to the person who is being copied, with genuine scientific theft of ideas. From what is described, it sounds like they aren't yet distinguishing between these cases.
There was a good description on the arXiv about good plagiarism detection methods and tuning parameters for efficient detection of duplication and plagiarism, applied to a good part of the body of arXiv submissions. That algorithm is run now, which is why you see those ``article has significant text overlap'' messages, detailed here.
It's psychosomatic. You need a lobotomy. I'll get a saw.
In heavily technical fields you just need to have a description of your topic. Normally, this one should not change between papers. But today, due to "autoplagiarism" rules, one is compelled to spend lots of time and imagination to find new ways of presenting the same old thing. The bad guys that plagiarized (often by reinventing the wheel while knowing it exists) are not really affected by the new rules, because they just continue to do their stuff. The ones who get hurt are the honest researchers (fewer and fewer).
Try this: Have 100 people write a 4 page book report about a novel they all read. Then ask them to describe one single chapter. Then narrow it down to one scene in one chapter, described using in 4 pages. Now submit these to a plagiarism detector. You will have a lot.
Every time a grant is written the same core research is typically cited. How many ways can you describe the same results without triggering a 1%, 10%, 20% match with the last grant proposal / paper, etc you wrote? We struggle with this every time we submit papers, let alone grant proposals. It's a necessary evil, but the tools get so strict that using the same symbols in equations can trigger a 15% match on a theory-heavy paper. A match like that can significantly bias an editor / reviewer into thinking the work is derivative (no pun intended).