Slashdot Mirror


Study of Massive Preprint Archive Hints At the Geography of Plagiarism

sciencehabit writes with this excerpt from Science Insider: New analyses of the hundreds of thousands of technical manuscripts submitted to arXiv, the repository of digital preprint articles, are offering some intriguing insights into the consequences — and geography — of scientific plagiarism. It appears that copying text from other papers is more common in some nations than others, but the outcome is generally the same for authors who copy extensively: Their papers don't get cited much. The system attempts to rule out certain kinds of innocent copying: "It's a fairly sophisticated machine learning logistic classifier," says arXiv founder Paul Ginsparg, a physicist at Cornell University. "It has special ways of detecting block quotes, italicized text, text in quotation marks, as well statements of mathematical theorems, to avoid false positives."

3 of 53 comments (clear)

  1. Re:who cares about plagiarism by PvtVoid · · Score: 4, Insightful

    Why does anyone need 'credit' for ideas?

    Because it allows funding agencies, university tenure committees, etc. to determine which people are contributing useful new science to the world, and which people are dead wood sucking at the teat of an academic salary without creating anything useful to anybody.

  2. I have studied the issue extensively by Anonymous Coward · · Score: 4, Funny

    And, I have found that copying text from other papers is more common in some nations than others, but the outcome is generally the same for authors who copy extensively: Their papers don't get cited much.

  3. Some countries' education systems reward parroting by hey! · · Score: 5, Interesting

    Some countries place a high premium on memorizing and repeating back the teacher's words. These countries still produce their share of good and bad engineers, but they're sometimes bad in unrecognizable ways.

    I once hired a software engineer from a third world country who had an encyclopedic knowledge of design patterns. You could name any pattern in the GoF *Design Patterns* book and he could reel off the UML without hesitation and give a convincing sounding explanation of how the pattern worked. But when I started inspecting his code, I quickly realized he had no understanding of what any of it meant. It was just pictures and words he'd memorized, an impressive and prodigious feat, but ultimately useless to me.

    Now I should say I've hired some very good software engineers from this country; it's not that they don't make good engineers over there. For most people the discipline to absorb a lot of information yields many benefits. But this guy was an outlier; he managed to get a master's degree over there in a subject he had no practical understanding of whatsoever.

    --
    Post may contain irony: discontinue use if experiencing mood swings, nausea or elevated blood pressure.