Hoax-Detecting Software Spots Fake Papers

← Back to Stories (view on slashdot.org)

Hoax-Detecting Software Spots Fake Papers

Posted by Soulskill on Friday March 27, 2015 @09:07AM from the solving-the-problems-we-created-for-ourselves dept.

sciencehabit writes: In 2005, three computer science Ph.D. students at the Massachusetts Institute of Technology created a program to generate nonsensical computer science research papers. The goal was "to expose the lack of peer review at low-quality conferences that essentially scam researchers with publication and conference fees." The program — dubbed SCIgen — soon found users across the globe, and before long its automatically generated creations were being accepted by scientific conferences and published in purportedly peer-reviewed journals. But SCIgen may have finally met its match. Academic publisher Springer this week is releasing SciDetect, an open-source program to automatically detect automatically generated papers. SCIgen uses a "context-free grammar" to create word salad that looks like reasonable text from a distance but is easily spotted as nonsense by a human reader.

7 of 61 comments (clear)

Min score:

Reason:

Sort:

Got it... by Anonymous Coward · 2015-03-27 09:15 · Score: 4, Funny

Software detecting papers written by software -- in the dark.
Chicken chicken Chicken? by Irate+Engineer · 2015-03-27 09:22 · Score: 3, Funny

Chicken chicken, (chicken) chicken?
https://www.improbable.com/airchives/paperair/volume12/v12i5/chicken-12-5.pdf

--
Left MS Windows for Linux Mint and never looked back!
Vote for Bernie in 2016!
Evil tech? by Anonymous Coward · 2015-03-27 09:26 · Score: 5, Interesting

The purpose of the scam papers was to expose scam journals.
The purpose of this new software seems to be to all scam journals to continue scammng.
So it's an evil software, that should not have been developed, right?
I mean, if you were doing actual peer review, none of this would pass even a half-sentient peer's inpection.
Re:Results? by I'm+not+god+any+more · 2015-03-27 09:30 · Score: 5, Funny

1. The first thing SCIgen should do is to incorporate SciDetect, to make sure that their random papers pass the SciDetect test.
2. SCIDetect should then improve their algorithms, and SCIgen should again take a snapshot of SciDetect source code and incorporate it.
3. Run this loop a few times and what we'll have is some serious papers
4. Profit!!!
It is too much trouble to fix the problem by Attila+Dimedici · 2015-03-27 09:31 · Score: 4, Interesting

Springer reveals that they are not interested in fixing the problem revealed by SCIgen, they just want to prevent that software from demonstrating that they have not fixed it. They aren't going to change the review process to ensure that they no longer publish papers which are nonsense. No, they developed software to eliminate those papers which were generated by other software.

--
The truth is that all men having power ought to be mistrusted. James Madison
Interesting Response by Roger+W+Moore · 2015-03-27 09:34 · Score: 4, Insightful

arXiv is not peer reviewed. What I found interesting though was the response of the publisher: write a program to detect fake papers. Even the most simplistic peer review - i.e. reading the paper - would immediately catch these papers. If they need to write a program to catch fake papers then their peer review model is essentially worthless and frankly a journal that poor is no better, and liekly worse, than arXiv: at least arXiv doesn't pretend to have peer review.
Re:Results? by phantomfive · 2015-03-27 09:40 · Score: 3, Interesting

Of all the problems you might find at arXiv, I don't think "auto-generated papers going undetected" is one of their problems.

ArXiv's problem is recognizing when human-written, realistic sounding papers are actually BS.

--
"First they came for the slanderers and i said nothing."