Slashdot Mirror


Fake Scientific Paper Detector

moon_monkey writes "Ever wondered whether a scientific paper was actually written by a robot? A new program developed by researchers at Indiana University promises to tell you one way or the other. It was actually developed in response to a prank by MIT researchers who generated a paper from random bits of text and got it accepted for a conference."

277 comments

  1. Yes! by stupidfoo · · Score: 4, Funny

    I am always wondering what those damn robots are up to!

    1. Re:Yes! by Anonymous Coward · · Score: 1, Interesting

      ...or we could have a human just read the damn thing.

      Novel idea.

    2. Re:Yes! by lixee · · Score: 1

      I for one, welcome our paper writing robot overlords.

      --
      Res publica non dominetur
    3. Re:Yes! by martinultima · · Score: 1

      You know, I thought about that... but it just makes too much sense for a bunch of half-insane hackers like us!

      --
      Creative misinterpretation is your friend.
    4. Re:Yes! by Krakhan · · Score: 2, Funny

      ROBOT HOUSE!!!

    5. Re:Yes! by PastAustin · · Score: 0
      I am always wondering what those damn robots are up to!


      You must be speaking of the slashdot community.. I ran the first page of comments (minus html and white space and other things (score, etc) through the computer writing detector and got this:

      This text had been classified as INAUTHENTIC with a 26.0% chance of being authentic text

      We're all robots.
      --
      Firefox 2.0 - Spell Rightly.
    6. Re:Yes! by jguthrie · · Score: 1

      I submitted a half-dozen of the documents I've written over the last few years (mostly business reports of one sort or another) and never got a score higher than 35% for any whole document.

    7. Re:Yes! by Schemat1c · · Score: 2, Funny

      I am always wondering what those damn robots are up to!

      They use old people's medicine for fuel.

      --

      "Nobody knows the age of the human race, but everybody agrees that it is old enough to know better." - Unknown
    8. Re:Yes! by ArieKremen · · Score: 1

      Same here!!!
      I ran my PhD thesis, papers, and other technical/scientific documents I prepared, and all came back with less than 40% authenticity, i.e., 60% chance that a robot wrote them.
      I had people telling me I am insensitive, but now I have it black on white and with references!!!

      --
      -- Cave quid dicis, quando, et cui
    9. Re:Yes! by portforward · · Score: 1

      I turned in a rough draft of a final paper for a class and it came back with a 30% chance that the paper was authentic. I thought, "oh no, what if my professor uses this tool and thinks that I did something bad". Then I remembered I had the text from my great, great, great grandfather's obituary written in the late 1800's, and it came back with a score of 26%. I don't think that the tool is very acurate for non-technical papers.

    10. Re:Yes! by Ruff_ilb · · Score: 2, Interesting

      They did; the board that accepted the MIT paper, not consisting of specialists in the field, was likely confused by the pseudo-scientific gibberish they encountered. By mastering the methodology for the typical unification of access points and redundancy, the MIT students were able to effectively enter the scientific conference.

      --
      http://www.TheGamerNation.com/Forums
    11. Re:Yes! by SEWilco · · Score: 1
      I submitted a half-dozen of the documents I've written over the last few years (mostly business reports of one sort or another) and never got a score higher than 35% for any whole document.

      WOULD.. YOU.. LIKE.. TO.. REWRITE.. A.. DOCUMENT?

    12. Re:Yes! by Josh+Hiles · · Score: 1

      Wait so basically we've programmed a robot to detect deceptive use of robots. Anyone else here the Twilight Zone theme? Really though, can the Turing Police be far behind?

    13. Re:Yes! by MarkChovain · · Score: 2, Funny

      I for one, welcome our paper writing robot overlords.

      I for one am a paper writing robot overlord, you insensitive clod! I for one welcome our new video game consoles. They are called "hands". Shouldn't it be something like this will ever happen then you will see that they bring things out in managable increments. Sure it is a biggish program, but many lone hackers have written one in under one person/year.

    14. Re:Yes! by Ariane+6 · · Score: 1

      My LPSC abstract came back at 45.6%. It's quite technical. Weird.

    15. Re:Yes! by Anonymous Coward · · Score: 0

      That's actually not true. The papers by MIT where submitted to non-reviewed "conferences" of a kind that are gennerally academic junkets and ways to get things "published" that would not normally pass muster.

    16. Re:Yes! by why_it · · Score: 1

      I am not sure how to start a new thread in this discussion, so I will just reply to the first post. I am one of the people that designed the program and would like to thank everyone for their comments. They prove to be a little more honest, if not more insightful, than the ones we received when having our paper pier reviewed. I would like to address some of the comments made on the web-site. First off the web page is not designed to detect new, poetry, or blogs, although it might incorrectly classify text from these sources as authentic academic scientific papers. It is also not designed to detect authentic text based on abstracts or parts of papers. The web-site is not perfect, and we realize this. Secondly I would like to address those of you who have discovered that repeating parts of inauthentic papers, or simply repeating parts of a paper many times can break the web-site. This is something that we realized a few months ago, but never did anything to fix. We never imagined that we would receive so much attention because of the web-page. This was more of a side project for all of us. We have created a remedy to this problem, and if anyone detects that the problem still exists please let us know. I would also like to apologize to anyone who has had their own personal works classified as inauthentic, weather it be because of the faultiness of our predictor or...because of other reasons

    17. Re:Yes! by why_it · · Score: 1

      let me rephrase a bit of what I said earlier. everyone comments prove to be very insightful, honest and sometimes funny, having our paper peer reviewed (by actual humans by the way) proved to be very insightful also. I do not want to cut short the input that we have received from submitting our paper to a credible journal ran by actual peers in our field

    18. Re:Yes! by Anonymous Coward · · Score: 0

      I was testing the bot and seems to work

      I just enter the article from here: http://news.com.com/2100-1028_3-6064016.html
      "Congress readies broad new digital copyright bill"

      and it say
      "This text had been classified as
      INAUTHENTIC
      with a 33.9% chance of being authentic text"

      seems to be very accurate.

  2. That's good and all by XxtraLarGe · · Score: 4, Funny

    but I wonder if it can tell if a paper was written by a million monkeys pounding on typewriters?

    --
    Taking guns away from the 99% gives the 1% 100% of the power.
    1. Re:That's good and all by denverradiosucks · · Score: 4, Funny

      Obligatory Simpson's Quote

      Monkey's typing on a typewriter as Mr. Burn's is working on the next great american novel:

      Burns: This is a thousand monkeys working at a thousand typewriters. Soon they'll have written the greatest novel known to man.
      (monkey smoking cigar typing on a typewriter)
      Burns: Lets see. It was the best of times, it was the BLURST of times! You stupid monkey! (Smacks monkey upside his head)

    2. Re:That's good and all by visgoth · · Score: 4, Informative

      Oh, I'm sure the work of monkeys is quite easily identifiable.

      --
      My patience is infinite, my time is not.
    3. Re:That's good and all by Rakshasa+Taisab · · Score: 2, Funny

      I kinda enjoy getting mod points, it would be sad if they replaced that feature.

      --
      - These characters were randomly selected.
    4. Re:That's good and all by Anonymous Coward · · Score: 0

      I just hope it doesn't work on thesis...

    5. Re:That's good and all by Anonymous Coward · · Score: 0

      Point of order: He crumpled up the paper and threw it at the monkey; he didn't smack it.

    6. Re:That's good and all by drinkypoo · · Score: 1

      There's already a program that determines the likelihood that two articles are written by the same author. All that is needed is to combine it with a http query to slashdot...

      --
      "You're right," Fisheye says. "I should have set it on 'whip' or 'chop.'"
    7. Re:That's good and all by Exocrist · · Score: 1

      Hehe that was pretty good.

      The problem with the "infinite monkeys and time" situation is that it assumes that each key is as likely to be pressed as any other, and it's all random. As we can see, "S" was greatly favored.

    8. Re:That's good and all by iNetRunner · · Score: 3, Funny

      Seems like it would be easier to develop a program that automatically detects /. dupes.. but no.

      *At least the million /. pounding monkeys detect it..*

      --
      Store with salt
    9. Re:That's good and all by demonbug · · Score: 1

      Yup, it's inauthentic.

    10. Re:That's good and all by Anonymous Coward · · Score: 1, Informative

      The infinite monkeys and time situation(actually, you just need one or the other - or rather, you need infinite "monkeytime", which you get either if you have infinite monkeys, or if you have infinite time and an assurance that the monkey population will not die out) does not depend on a uniform probability distribution for letters. It just assumes that the probability of the desired combination of characters occuring is nonzero. The argument could be made that this is really quite likely given that it is quite thinkable that one of the things a monkey could do with a typewriter is hit keys largely at random, and thus that there is a nonzero probability of it keeping up this behaviour for long enough to produce a combination of the desired length.

      (Once you have the desired probability P (- <0,1] of the combination arising in finite time t, then for all multiples kt, where k is a positive integer, of t the probability of it arising at least once is equal to or greater than 1-(1-P)^k, which approaches 1 as k(representing time) approaches infinity. Thus, the limit probability is 1.)

      (Note, however, that many estimates of how likely it is to produce some given work in finite time do indeed depend on a uniform distribution, even though the underlying thought experiment does not.)

    11. Re:That's good and all by mikiN · · Score: 1

      When I first saw that link I thought you ran the Constitution through the detector.
      I wonder what the verdict would be, though.

      --
      The Hacker's Guide To The Kernel: Don't panic()!
    12. Re:That's good and all by mikiN · · Score: 1

      Hmmm, something got lost in edit. 'This tool' refers to the CS Paper Generator used by the group in their earlier prank.

      --
      The Hacker's Guide To The Kernel: Don't panic()!
    13. Re:That's good and all by duh+P3rf3ss3r · · Score: 1

      I just ran the contents of your post through the BS Paper Detector and I got:

      "Your entry is too short."

      I bet you hear that a lot, though, right? *nudge-nudge wink-wink*

      --
      Give a man a match: warm him for an instant. Douse him in petrol and set him aflame: warm him for the rest of his life.
    14. Re:That's good and all by nihaopaul · · Score: 1
      well i hope it can tell if the paper was written by a million monkeys because i would love to use it on spam, now that is the same as a million monkeys pounding on a typewitter as the following email


      De i ar Home Ow q ne n r ,

      Your cr k edi k t doesn't matter to us ! If you O a WN real e s st c at z e
      and want IM w MED m IA x TE ca n sh to sp i en n d ANY way you like, or simply wish
      to LO v WER your monthly pa s yme v nts by a third or more, here are the de w als
      we have T c OD i AY :

      $ 4 o 88 , 000 at a 3 , g 67% fi f xed - ra o te
      $ 37 e 2 , 000 at a 3 , 9 h 0% v k aria a ble - ra z te
      $ 49 w 2 , 000 at a 3 v , 21% int b ere h st - only
      $ 2 k 48 , 000 at a 3 a , 36% fi n xed - ra u te
      $ 19 k 8 , 000 at a 3 e , 55% vari q able - rat w e

      H g urry, when these d r eaIs are gone, they are gone !

      Don't worry about a n ppro z val, your c n redi c t will not d c isquali u fy you !
    15. Re:That's good and all by 1u3hr · · Score: 1
      The monkeys and Shakespeare meme has an obscure history. This page tracks it back to statistical mechanics, ca. 1913.
      [Babelfish translation]
      Let us conceive that one drew up a million monkeys randomly to be struck the keys of a typewriter and that, under the monitoring of illiterate foremen, these monkeys typists work with heat ten hours per day with a million typewriters varied types. The illiterate foremen would gather the blackened sheets and would connect them in volumes. And at the end of one year, these volumes would be to contain the exact copy of the books of any nature and all languages preserved in the richest libraries of the world. Such is the probability so that it occurs during one very short moment, in a space of some extent, a notable variation of what statistical mechanics regards as the most probable phenomenon
      Émile Borel, ``Mécanique Statistique et Irréversibilité,'' J. Phys. 5e série, vol. 3, 1913, pp.189-196.
      It's interesting that the sense of the idea has been reversed: originally it was meant to show how staggeringly unlikely it was that an ordered situation would emerge by chance. Now it often is used to imply that if you throw enough monkeys/time/computing power at a problem that you can brute force it.

      It's part of geek pop culture now; I noticed a reference in an episode of The Lone Gunmen a couple of years ago.

    16. Re:That's good and all by visgoth · · Score: 1
      Its rather interesting that now its come to mean brute forcing.

      This page does quite a nice job showing that even with 17 billion galaxies filled with 17 billion planets, each with 17 billion monkeys hammering away, the odds are so infinitesimally small that its safe to say its impossible.

      --
      My patience is infinite, my time is not.
    17. Re:That's good and all by oz1cz · · Score: 1
      You know, one of my friends actually managed to get an inifinte number of monkeys together. He gave them an infinite number of typewriters, and they started typing.

      My friend walked around the infinitely large room an looked at what the monkeys were typing.

      Monkey #1: asoici32{ d$$ d7 pp\df8TRTREG
      Monkey #2: ))G YYYn r{ @cfv 9 ds89xc89 j r!vnb

      And so on.

      Eventually he reached monkey #32198734267244672 and his heart leapt:

      Monkey #32198734267244672: To be or not to be, that is the quesPw''P3 i55oJJ Jrewrtw+&3492viudsfkj326 ius9843

    18. Re:That's good and all by Bloke+down+the+pub · · Score: 1

      I thought the monkeys/shakespeare conjecture had already been proved, if you account for genetic drift?

      --
      It's true I tell you, feller at work's next door neighbour read it in the paper.
    19. Re:That's good and all by Bloke+down+the+pub · · Score: 1

      Seems your friend should have checked their CVs more carefully and recruited those with a literature or journalisnm background rather than perl programmers.

      --
      It's true I tell you, feller at work's next door neighbour read it in the paper.
    20. Re:That's good and all by Aspirator · · Score: 1

      Therein lies the problem with the infinite monkeys typing Shakespeare.

      Undoubtedly they will indeed do so, but the script will buried in an
      infinite pile of gibberish and other shorter masterpieces by the time
      it happens.

      Sifting through the output to find the desired product is a much more
      onerous task than producing it in the first place.

  3. Turing test? by Nesetril · · Score: 5, Insightful

    so can a robot write a paper and then decide whether the paper was written by a robot (itself)?

    --
    Jesus said to his disciples: "If you don't have a sword, sell your cloak and buy one" - Luke 22:36
    1. Re:Turing test? by ironring2006 · · Score: 2, Informative
      Speaking of Turing, this showed up in the references for the automatic paper that I generated:

      Turing, A., Wilkes, M. V., Nehru, B., Wang, F. Z., Subramanian, L., Zhao, W., Beaman, N. A., Turcotte, B. A., and Wu, V. Refining consistent hashing and 16 bit architectures with SandyEos. Journal of Efficient, Highly-Available Communication 1 (Apr. 2002), 50-62.
      Glad to see he's still contributing to the field from the grave!
    2. Re:Turing test? by Redwin · · Score: 1

      Well here is a random paper generator that has managed to get generated papers published, no idea what this robot would think of it though.

      --
      Warning, comments may not have been passed by the sanity department of my brain.
  4. Testing... by OakDragon · · Score: 2, Interesting
    "We believe that there are subtle, short- and long-range word or even word string repetitions that exist in human texts, but not in many classes of computer-generated texts that can be used to discriminate based on meaning."

    RESULTS: FAKE

    Yep, it works!

  5. A USEFUL application... by Flimzy · · Score: 2, Funny

    When will MIT modify this technology to filter all the spam from my mailbox?

    1. Re:A USEFUL application... by ignatz72 · · Score: 1

      LOL, if you've tried it with your own work as I have, you'd better be worried about the fact that only the SPAM would pass thru the SPAM filter...

      It graded a paper of mine as 26.7% authentic. Which confirms what I've suspected for a while now, I AM A SPAMBOT.

    2. Re:A USEFUL application... by Khashishi · · Score: 1

      the day when I start using robots to churn out spam. darn, I guess I'll have to come up with some other scheme.

  6. Discrimination by hsmith · · Score: 5, Funny

    I hope the ACLU will ensure that discrimination against metal people will not be allowed to continue.

    1. Re:Discrimination by martinultima · · Score: 1
      “Score:2, Troll”


      You damn metal guys ALWAYS get modded higher for that stuff!
      --
      Creative misinterpretation is your friend.
    2. Re:Discrimination by OglinTatas · · Score: 1

      I hope the ACLU will ensure that discrimination against metal people will not be allowed to continue.

      That is people of metal, you biologist

    3. Re:Discrimination by Iron+Condor · · Score: 3, Funny
      That is people of metal, you biologist

      I think the preferred term is "Ferro-Americans".

      --
      We're all born with nothing.
      If you die in debt, you're ahead.
    4. Re:Discrimination by Anonymous Coward · · Score: 0

      I prefer "Guest Humans"..

    5. Re:Discrimination by Anonymous Coward · · Score: 0

      \m/ Rock on!

    6. Re:Discrimination by Aspirator · · Score: 1

      Ah... I had obviously misheard:

      And so, my Ferro-Americans, ask not what your country......

      things are becoming clearer to me now.

  7. I have a better solution by Anonymous Coward · · Score: 0

    How about an intelligent human who understands the field actually READING the paper before putting it in a "peer reviewed" journal.

  8. An interesting experiment by fm6 · · Score: 4, Funny

    Has anybody fed Dvorak's latest column to this program? I've often wondered if he actually writes his columns, or just generate verbiage at random.

    1. Re:An interesting experiment by irregular_hero · · Score: 5, Funny

      "This text had been classified as
      INAUTHENTIC
      with a 24.9% chance of being authentic text"

      No kidding.

    2. Re:An interesting experiment by Anonymous Coward · · Score: 0

      74.9% FAKE.

      Watch this get modded Informative.

    3. Re:An interesting experiment by Anonymous Coward · · Score: 0

      This text had been classified as
      INAUTHENTIC
      with a 25.7% chance of being authentic text

    4. Re:An interesting experiment by Anonymous Coward · · Score: 3, Funny

      Yep, I tried that too.
      I also tried another article from ABC News about meat eaters contributing to global warming (http://abcnews.go.com/Technology/story?id=1856817 &page=1). It was inauthentic/28.8%.

      Looks like they have a crafty team of robots there at abc :)

    5. Re:An interesting experiment by DerCed · · Score: 1
      Actually, I get

      This text had been classified as INAUTHENTIC with a 25.7% chance of being authentic text


      *lmao*
    6. Re:An interesting experiment by Ontain · · Score: 2, Informative

      that's not surprising. i did a few articles and they come up in the 20ish percent range. this detector isn't very good.

    7. Re:An interesting experiment by MindStalker · · Score: 1, Informative

      It was intended to classify scientific studies. Not articles.

    8. Re:An interesting experiment by FhnuZoag · · Score: 2, Funny

      I liked the vast global robot conspiracy explanation better.

    9. Re:An interesting experiment by Rimbo · · Score: 1

      Holy crap. I just verified that, except I got a 25.7% chance of being authentic.

      Wow!

    10. Re:An interesting experiment by syousef · · Score: 1

      It appears to be very dependent on the length of the article as well with short ones being deemed inauthentic. I passed my astronomy masters papers through it, and it did well predicting they were genuine (scores of 80%-96%), except for a short 4 page proposal which it said was fake (scored around 46%). However for the papers that passed if I just put the abstract in that would fail.

      --
      These posts express my own personal views, not those of my employer
    11. Re:An interesting experiment by jacquems · · Score: 2, Funny

      I tested it on the text from the Time Cube index page, and it was rated as AUTHENTIC with a 95.3% chance of being an authentic paper.

    12. Re:An interesting experiment by RM6f9 · · Score: 1

      So, insanity/stupidity wind up being more "real" than most actual academic papers? Remove the Dali and Leary algorithms, guys!

      --
      Take the 90-Day Challenge! http://rwmurker.bodybyvi.com/
    13. Re:An interesting experiment by fm6 · · Score: 1

      "Authentic" in this context means original work that came out of the brain of human being. Time Cube is certainly that!

  9. How about . . . by denverradiosucks · · Score: 1

    How about a paper detector that will decide between 1 and 2 ply?

    1. Re:How about . . . by hal2814 · · Score: 1

      Hey, if you don't like 1-ply you can always fold it in half.

    2. Re:How about . . . by mypalmike · · Score: 2, Funny

      Hey, if you don't like 1-ply you can always fold it in half.

      And if you don't like 2-ply, you can separate the sheets. Keep in mind that this works best before you wipe.

      --
      There are 0x40000000 types of people: those who understand 32-bit IEEE 754 floating point, and those who don't.
    3. Re:How about . . . by Anonymous Coward · · Score: 0

      I use 3 ply, you insensitive clod!

  10. My take on the subject by yfkar · · Score: 1

    Suppose that in the future, there exists certifiable technology such that we can easily enable the analysis of fake articles. This may or may not actually hold in reality, but it's nonetheless a practical property of the system. Heuristic does not require such a key simulation to run correctly, but it doesn't hurt. This may or may not actually hold in reality. It can be assumed that these algorithms can observe probabilistic information without needing to manage game-theoretic modalities. Thus, the methodology that MIT uses is solidly grounded in reality.

  11. Re:too bad this technology... by Anonymous Coward · · Score: 0

    ..isn't going to be used to show that iraq is a fake war (with a lot of real costs and lives)

    Your ideas are interesting and I want to subscribe to your newsletter

  12. Oh, come on! by Anonymous Coward · · Score: 0

    ... are you implying that the guy who reads those papers is unable to identify if they were automatically written?

    -- Serhei

  13. Typos by Nom+du+Keyboard · · Score: 1
    "We believe that there are subtle, short- and long-range word or even word string repetitions that exist in human texts, but not in many classes of computer-generated texts that can be used to discriminate based on meaning."

    Do robots make typos? Do they make the same typos each time, or different ones?

    Therein lies the true heart of a proper detector.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
    1. Re:Typos by DanTheLewis · · Score: 1

      Do robots make typos? Do they make the same typos each time, or different ones? Therein lies the true heart of a proper detector. I don't make typos, but that doesnt mean I'm not a robot.

      --

      Q: What did the comedian say to the crowd?
      A: If I knew, this joke would be funny.
    2. Re:Typos by dlakelan · · Score: 2, Informative

      Do robots make typos? Do they make the same typos each time, or different ones?

      Based on the slashdot articles that get posted. I would say YES.

      Actually it's pretty easy to add random convincing misspellings to text, you could use a database from something like usenet, and a spell checker to map misspelled words to their real counterparts, and then have a straightforward algorithm for replacing some set of words with misspellings, and you could tune that for consistency. It would be easier than many other aspects of faking papers.

      --
      ((lambda (x) (x x)) (lambda (x) (x x))) http://www.endpointcomputing.com a scientific approach to custom computing.
    3. Re:Typos by brian0918 · · Score: 4, Funny

      E-mail spambots have been making typos for years.

    4. Re:Typos by cdrdude · · Score: 0

      Heh. Today, a friend of mine had his gmail send an auto-reply to all incoming messages. Five words. Two typos. One grammatical mistake (I'm not too sure about that one, but another friend claimed it was there). But 20% of his writing was mispelled!!

      I'm going to make fun of him for that for at least a week... :-)

      --
      This sig is neither interesting, nor humorous. Including meta-humor.
    5. Re:Typos by xenn · · Score: 1

      you misspelled misspelled

  14. The mentioned robo-article by johnfink · · Score: 1
  15. Sadly, It appears that I am a robot. by cbelt3 · · Score: 3, Interesting

    I've taken a long posting that I wrote on my blog and dropped it into the site. And I am Inauthentic. Now I understand the "Bladerunner Moment" comment in the article. I shall begin to surround myself with oddly colored polaroids and snapshots of theoretically implanted ancestors.

    The nice thing is that we've finally settled the argument if machines can be made to drink beer and like it !

    1. Re:Sadly, It appears that I am a robot. by Profane+MuthaFucka · · Score: 1

      I just did the exact same thing, and it scored 30.4%, rated inauthentic. I agree completely with these results. Everything I write is counterfeit because I wrote it myself.

      --
      Fascism trolls keeping me up every night. When I starts a preachin', he HITS ME WITH HIS REICH!
    2. Re:Sadly, It appears that I am a robot. by cmorgan47 · · Score: 1

      i, also, am a robot.

      and enjoy bitching about my robot wife. it gave me a 21% chance of being authentic.

      --
      no i have not shot my gun in the air and gone 'Ahh!'
    3. Re:Sadly, It appears that I am a robot. by SpectreHiro · · Score: 1

      Ha! I'm 12.3% more authentic than you... Although I wonder if part of the problem is that we're not submitting "scientific papers." The fact that I was writing about pr0n probably should've tipped the detector off. *shrug*

      --
      You can't win, Darth. If you mod me down, I shall become more powerful than you could possibly imagine.
  16. And now waiting for the slashdot version by plankrwf · · Score: 1

    You know, the one that verifies whether a comment is from a 'real person' or from a 'offworlder', also known as 'troll'.

    Too bad the prototype at http://montana.informatics.indiana.edu/cgi-bin/fsi /fsi.cgi thinks most comments are too short...

    Roel

  17. Great... by random_amber · · Score: 1

    With a small mod to this program, we can now get rid of all the androids that post on /.

    1. Re:Great... by apt142 · · Score: 1

      You mean these guys?

    2. Re:Great... by linvir · · Score: 1
      No, these guys.
      Keep challenging yourself is definitely the machine. This is your head, or half-crouch position. The best dancers use only does require at :up::right:. You would I see what looks a good doing crazy if you're a result. Add your head bobbing in the Philippines did this trick to hold onto. of breakdancing/b-boy'ing competitions. Those help with your performance. If the beat, instead of the dance. When you have in places where the point of the people who are citing BSB, N-SYNC videos has very simple leg work, but if don't have fun. Second, by going in special spots, throw a spin on one is in times in case you play.
      Death to the trolltalk crapflooders!
  18. Re:First Post! RouterSlayer Rules! by Flimzy · · Score: 1, Funny

    ... for very large values of "1".

  19. See what it says about slashdot by Locke2005 · · Score: 2, Funny
    According the the program, the comments to this article are rated as follows:

    This text had been classified as INAUTHENTIC with a 32.2% chance of being authentic text

    Bearing in mind that text over 50% chance will be classified as authentic, this add credence to the theory that slashdot comments are generated by monkeys randomly typing on keyboards.

    --
    I've abandoned my search for truth; now I'm just looking for some useful delusions.
    1. Re:See what it says about slashdot by Anonymous Coward · · Score: 0

      Look in the mirror, buddy.

    2. Re:See what it says about slashdot by Anonymous Coward · · Score: 0

      djdfjkhf rfriojo frrfik ikfdljlfd fggjkgl gfgljgijk fkfjlgr r9r9 dfiofggj diljdd
      lkdjdkjl MOLLY WANT BANANA logsdjisdglkj gfdlgjglfj fglfgkjgfkj fklfgjglfkjgf
      fgdkjgdl;kgf fglkjgflkgdfj fglkjfglgf fglkjgflgfkj fgkljfglgfkj

  20. Sounds like a major innovation in input screening by mi · · Score: 1
    From e-mail spam, to Slashdot submissions, to "letters to editor", to political petitions.

    Or is this just another application of Bayesian filters again?

    --
    In Soviet Washington the swamp drains you.
  21. Result: Darwin is fake, so is the bible by Anonymous Coward · · Score: 0

    I'm sure the ID people will apply it to "Origin of species", while others will apply it to the bible.

  22. Re:too bad this technology... by TheRealMindChild · · Score: 1

    Way to plug your political agenda.

    What we really need is a fake Extacy detector. The world would be a better place.

    --

    "When life gives you lemons, don't make lemonade. Make life take the lemons back!" -- Cave Johnson
  23. It Caught Mine by darthservo · · Score: 1
    I tried turning in my paper, but this program caught it and I failed.

    I guess I should have put a little more effort into faking it, instead of just printing out one of the word salad spams I got in my inbox.

    --

    Prove it.

  24. Duh? by Anonymous Coward · · Score: 1, Insightful

    Light on details but I assume that to create this thing a bunch of known-real and known-fake papers were analzyed and they just found patterns that were indicative of a fake. They wouldn't even have to understand why...just dump a bunch of statistics and choose the ones more associated with the fake pile. Just like reverse-engineering spam filters, it could be circumvented by discovering whatever the "real" properties are and focusing on them.

  25. Self defeating? by benhocking · · Score: 5, Funny

    It seems like it wouldn't be too difficult to modify the MIT program to use this new anti-robot robot to write papers that this anti-robot robot would not be able to detect. Ideally, this would be done with a learning algorithm (so that it could easily be extended to other anti-robot robot programs), but reverse-engineering the anti-robot robot (by humans) should also provide a solution.

    Now that Indiana U has thrown down the gauntlet, I wouldn't be surprised if MIT responds. Hopefully it will result in an even better paper-writing robot. Ideally, it will lead to dissertation-writing robots. :)

    --
    Ben Hocking
    Need a professional organizer?
    1. Re:Self defeating? by cp.tar · · Score: 4, Interesting

      I recently had to check out an essay-grading robot for my Introduction to Natural Language Processing class.

      I'd fed it the introduction of a randomly generated essay. It got a 4/5 on all counts.

      I figure, if teachers are going to use robots to grade essays, we should use robots to create them in the first place.

      --
      Ignore this signature. By order.
    2. Re:Self defeating? by AnalystX · · Score: 1

      You're right. That's essentially what I brought up in my other post (before seeing this post). This is like the plight of spam filtering, virus detection, or any kind of warfare-counterwarfare scenario. As soon as one side comes up with a defense, the other side studies it and makes adjustments to compensate. With as varied as the results can be from author to author, I don't see a reliable detection solution coming anytime soon.

    3. Re:Self defeating? by bokmann · · Score: 1

      Douglas Hofstadter would be proud of you.

    4. Re:Self defeating? by mctk · · Score: 5, Funny

      Eventually my students won't have to write papers and I won't have to grade them! Think of the potential application of this technology towards education!

      --
      Paul Grosfield - the quicker picker upper.
    5. Re:Self defeating? by quokkapox · · Score: 1
      The day is coming when you'll have to submit an authentic scientific paper in order to comment on a slashdot story.

      On that day, I'll be long dead and so will my Moravec-inspired uploaded mind-children.

      --
      it's a blue bright blue Saturday hey hey
    6. Re:Self defeating? by BraksDad · · Score: 3, Insightful

      Maybe after a string of anti robot robots, MIT would come up with a robot that would generate a real scientific paper!

      next comes your anti robot robot
      then the anti anti robot robot robot
      and of course the anti anti anti robot robot robot robot
      and the anti anti anti anti robot robot robot robot robot
      ...
      I could go on since cut and paste is so easy ;-)

      Perhaps it would be a million anti's followed by a million and one robots before something useful came out of such an exercise, but wouldn't it be cool to witness?

      --
      Slowly waving my hand - "This is not the sig you are looking for."
    7. Re:Self defeating? by rebelcan · · Score: 1

      potential application of this technology towards education

      The Kansas Board of Education gets another shot at Creationism in the classroom?

      --
      God is dead -- Nietzsche
      Nietzsche is dead -- God
      Zombie Nietzsche lives! -- Zombie Nietzsche
    8. Re:Self defeating? by mctk · · Score: 2, Funny
      Only if their re-writing robots are designed intelligently...

      Okay, actually I just wanted to comment that I love the sig.

      --
      Paul Grosfield - the quicker picker upper.
    9. Re:Self defeating? by Frumious+Wombat · · Score: 2, Interesting

      Personally, I'd be more interested in modifying this for Fraud Detection. The robot looks over your data and text, and decides, "Sorry Dave, a leap of faith has occurred here." Presumably, at that point the robot locks you out of your lab.

      This could lead to a whole series of literary robots: The Too Many Coincidences in Fiction Detector, The Humanities Thesis Verbiage Reducer, The This Movie Is Going to Suck No Matter Who Acts In/Directs It Detector, and so forth.

      --
      the more accurate the calculations became, the more the concepts tended to vanish into thin air. R. S. Mulliken
    10. Re:Self defeating? by Anonymous Coward · · Score: 0

      I look forward to the first ACM/IEEE International Robot/Anti-robot Conference, which is completely organized and participated by robots. Robot-generated CFP, robot program committee, robot generated papers and reviews, and ofcourse robot generated presentations at a cyber hotel. the only real thing is the registration fee. To be fair, the conference even adopts double-blind reviews. I suspect the acceptance rate will be as low as that of SIGCOMM or MOBICOM, because I heard robots are even meaner than human beings, and robots are completely devoted to generating and recycling papers.

    11. Re:Self defeating? by pclminion · · Score: 1

      Clearly, what we need here is an anti- anti-robot robot robot.

    12. Re:Self defeating? by kabz · · Score: 1

      That is pure fantasy. Everyone knows that the true *academic* way to grade papers is to toss them down a nearby stairwell...

      Hence my *lead-weighted* document folders. Bwahahahah.

      --
      -- "It's not stalking if you're married!" My Wife.
    13. Re:Self defeating? by jdgeorge · · Score: 1

      Now that Indiana U has thrown down the gauntlet, I wouldn't be surprised if MIT responds. Hopefully it will result in an even better paper-writing robot. Ideally, it will lead to dissertation-writing robots. :)

      Hmmm.... Have you ever read a dissertation? You'd have a hard time convincing me that such a robot hasn't been in common use for quite a while.

    14. Re:Self defeating? by marcosdumay · · Score: 1

      You are assuming that P == NP here. Or that the bot that creates the paper has infinite time to run.

      On this specific situation, it may be usefull used with a learnning algorithm. But not on a general case.

    15. Re:Self defeating? by frdmfghtr · · Score: 1
      Eventually my students won't have to write papers and I won't have to grade them! Think of the potential application of this technology towards education!


      This reminds me of a movie where a few students started sending tape-recorders to class instead of themselves. Gradually the scene had the professor lecturing to a room full of tape recorders. The last step in this scenario was a tape of the lecture being played to a room full of machines taping it.

      (Dammit if I can't recall which movie that is though.)
      --
      Government's idea of a balanced budget: take money from the right pocket to balance...oh who am I kidding?
    16. Re:Self defeating? by zCyl · · Score: 1

      Hence my *lead-weighted* document folders. Bwahahahah.

      Aren't the ones at the top of the stairs supposed to get the highest grades?

    17. Re:Self defeating? by bj8rn · · Score: 1

      In your dreams. Come to think of it, not even there.

      --
      Hell is not other people; it is yourself. - Ludwig Wittgenstein
  26. Some fields don't have those by spun · · Score: 1

    Literary criticism, for instance. Lit. Crit. papers never make sense so only some form of advanced computer algorithm would be able to tell if a paper was written by a human.

    --
    - None can love freedom heartily, but good men; the rest love not freedom, but license. -- John Milton
  27. Doesn't Work by iDope · · Score: 1

    Says "INAUTHENTIC" for the text on this page: http://www.lipsum.com/ BTW, shoudn't it be "Unauthentic".

  28. The program is a failure. by im_thatoneguy · · Score: 2, Interesting

    Apperantly I'm on average 49% artificial, based on school papers I wrote. I dub thee program: a failure.

    1. Re:The program is a failure. by tomstdenis · · Score: 1

      Not so. The Miller-Rabin algorithm will randomly allow 25% of all composites through.

      The trick to reading the results is when it says "definitely fake" it's fake. Otherwise you ignore the result as either "not-fake" or inconclusive.

      Tom

      --
      Someday, I'll have a real sig.
    2. Re:The program is a failure. by im_thatoneguy · · Score: 1

      Well... actually the 49% was based on a very small sample, when I enlarged it to my entire class (just for fun) it moved down to about 20%... pretty conclusively innacurate.

    3. Re:The program is a failure. by tomstdenis · · Score: 1

      Yeah, my point though is the result is not aggregate it's binary. So either you get "yes this is fake" [say above 80% probability] or it isn't [below 80%].

      I don't know what the threshold for this test is but it's likely not around 50%.

      Tom

      --
      Someday, I'll have a real sig.
    4. Re:The program is a failure. by vishbar · · Score: 1

      Or maybe you write shitty papers?

      *ducks*

      --
      Ride the skies
    5. Re:The program is a failure. by im_thatoneguy · · Score: 1

      Oh most definitely. But still real. :)

  29. Only works for scientific papers by gurps_npc · · Score: 4, Informative
    If you try to use it on any human written NON scientific paper, such as Lincoln's gettyburg address, it almost always considers it false.

    I suspect that it is looking for the conventional thinking with conventional word structure. As such, it is NOT a good idea i

    --
    excitingthingstodo.blogspot.com
    1. Re:Only works for scientific papers by nasor · · Score: 4, Informative

      No, it doesn't even seem to work on scientific papers. I submitted four papers from the latest issue of Inorganic Chemistry and it thought 2 out of 4 were false:

      Inauthentic: Assembly of a Heterobinuclear 2-D Network: A Rare Example of Endo- and Exocyclic Coordination of PdII/AgI in a Single Macrocycle.

      Inauthentic: Pyrazolate-Bridging Dinucleating Ligands Containing Hydrogen-Bond Donors: Synthesis and Structure of Their Cobalt Analogues

      Authentic: Manganese Complexes of 1,3,5-Triaza-7-phosphaadamantane (PTA): The First Nitrogen-Bound Transition-Metal Complex of PTA

      Authentic: Structure, Luminescence, and Adsorption Properties of Two Chiral Microporous Metal-Organic Frameworks

      Based on this (small) sampling, the program doesn't appear to do any better than if it were to guess randomly. I wonder if this thing is even supposed to work, or if it just returns a random result based on a hash of the paper or something?

    2. Re:Only works for scientific papers by Anonymous Coward · · Score: 0

      Try some *computer* science papers, for example, from the ACM / SIGGRAPH stuff:

      http://www.cs.brown.edu/~tor/sig2004.html

      of course, use Google to get a text/html version that you can paste into the test form.

      I tried several and they all rated 90+% authentic.

    3. Re:Only works for scientific papers by Anonymous Coward · · Score: 0

      Unless you copy-and-paste Lincoln's gettyburg address 5 times into it. Then its got a 90.8% chance of being an authentic.

    4. Re:Only works for scientific papers by jackchance · · Score: 1

      I submitted the introduction to my phd thesis (in behavioral neuroscience). And it only got a 22.9% chance of being authentic. Their algorithm sucks.

      --
      1 1 2 3 5 8 13 21 34 55 89 144 233 377 610 987 1597 2584 4181 6765
    5. Re:Only works for scientific papers by Hoch · · Score: 1

      Sure, it is the algorithm. Thats what a dirty robot would do, huh.

      --
      2*31*37*263
    6. Re:Only works for scientific papers by shabushabu · · Score: 1

      This is utter crap! I tested this on 4 of my papers, 3 of them got scores of 18.7%, 24.3% and 25.9%! One got a score of 62.3%. This is a gimmick (and a very badly executed one)!

    7. Re:Only works for scientific papers by jacquems · · Score: 1

      I got some interesting results testing it on various documentation texts.

      I tested it on a background information chapter about firewalls from our reference guide, and it gave it a 94.7% chance of being authentic. The style was mainly descriptive, so I guess it could be considered similar to a scientific paper. The chapter was written and edited by several different people.

      I also tested it on a how-to chapter from our online help, which was written entirely by me. It got rated with a 97.8% chance of being authentic. It was written in an instructional style, so I don't think it was that similar to a scientific paper.

      On the other hand, I tested it on some marketing material from this page (I omitted the table because of formatting issues), and it was rated as inauthentic with only a 36.7% chance of being authentic text. I guess theoretically we could replace our marketing department with robots, and nobody would notice the difference.

    8. Re:Only works for scientific papers by zurmikopa · · Score: 1

      It seems to think that my blog has a 94% chance of being "a human-written authentic scientific document" ...

  30. damn subject by turtleAJ · · Score: 0

    I'm positive /. has a very similar program used to write-up stories.

  31. Incase anyone was wondering... by GillBates0 · · Score: 1
    ...I just extracted the text from the PDF version of their paper on the subject (titled "Using Compression to Identify Classes of Inauthentic Texts") and ran it through the detector.

    It passed with a "90.1% of being an authentic paper.

    --
    An Indian-American Hindu committed to non-violent thought/speech/action alarmed by the global explosion of radical Islam
  32. Declaration of Independence by Math421 · · Score: 1

    I fed it the Declaration of Independence.

    It said it was INAUTHENTIC with a 24.5 chance of being authentic text.

    1. Re:Declaration of Independence by AnalystX · · Score: 1

      It knows you didn't write it.

    2. Re:Declaration of Independence by RoadWarriorX · · Score: 1

      That's funny. I tried the Declaration of Independence also, the software found it to be INAUTHENTIC with 36.5%chance of being authentic. It's not only incorrect, but inconsistent also.

      Let's GNU plot! ;-)

  33. What a Downer by neonprimetime · · Score: 1

    This kinda makes me feel stupid ... I had an undergrad paper accepted at a conference and I went and presented there ... I always felt good about that, cause I was sure some prof read it, enjoyed it, and thus the reason I got the call to present. I guess I was wrong ... they don't read em, they just pick random ones out of the pile.

    1. Re:What a Downer by zippthorne · · Score: 1

      Don't feel too bad, at many conferences, the "Pros" make up the presentation on the spot while the preceeding presenter is presenting his research and insight..

      --
      Can you be Even More Awesome?!
  34. evolution of bots by Odocoileus · · Score: 1

    What I think is interesting is that program is like an environmental hurdle for computerized paper writers everywhere. Now the bots must become more advanced.

    --
    ...
  35. Doesn't Make Sense by AnalystX · · Score: 1

    Why does anyone think this could effectively detect "robot" papers? All a robot has to do is incorporate this litmus test into its writing algorithm and make recursive changes until the paper passes.

  36. Fake Scientific Paper Detector by Svlad_Cjelli1972 · · Score: 1

    Bah! let me know when they invent a real one.

  37. Dvorak FAILS! by Anonymous Coward · · Score: 0

    Ask and ye shall receive:

    This text had been classified as
    INAUTHENTIC
    with a 25.7% chance of being authentic text

    -RouterSlayer

    1. Re:Dvorak FAILS! by Anonymous Coward · · Score: 0

      uh.. RouterSlayer, why did you post anonymously?

  38. Teen idols fleeing the country... by turtleAJ · · Score: 0

    Oh shit!

    We've just found out how Britney Spears and Puff Daddy's groups make their album's lyrics!

    They're out of the job for sure... ;p

  39. surely already done? by user24 · · Score: 1

    plenty of plagiarism detection software out there; if the prank was really just random bits of (I assume pre-existing and public) text, then all the program need do is search google for a few random snippets, no?

  40. Think of the monkeys... by Anonymous Coward · · Score: 0

    The Monkey Anti-Defamation League has retained our law firm of Dewey, Cheatum, and Howe, to represent them in this matter.

    We demand you cease and desist from defaming Monkeys by comparing them to Slashdot posters. If this defamation continues, we will be forced to persue legal action. Or to mod you off topic because TFA is about robots, not monkeys.

    Sincerely,

    John Q. Cheatum, IV, Esq.

  41. Ah.... by BaronSprite · · Score: 2, Funny

    Maybe slashdot can start running it on their links for "cold fusion in 1 year!".......

  42. I could use ... by slashdotmsiriv · · Score: 1

    a crappy paper detector ...

  43. Israel bombs Iran with US Air Support by Anonymous Coward · · Score: 0

    Israel bombs Iran with US AWACS support. It's hit the fan.

  44. Re:too bad this technology... by xxdinkxx · · Score: 0, Offtopic

    it seems that some people on slashdot are so mindset-boolean. that it never occured to them that the statement was serious, but the way it was stated was a joke. Go watch some of the absurd things "papa bear" as colbert calls him says... hasn't anyone ever heard the expression never feed a troll.. have a good day.

  45. Re:too bad this technology... by Anonymous Coward · · Score: 0

    Well, it is a fake war in the sense it is based on multiple false premises and that its only real purpose was the short term political gain necessary to (just barely) get George Bush elected for a second (and hopefully, if the Constitution is still in effect in 2008, final) term. The continuing chaos and loss of lives is also beneficial to George Bush in that he is able to give more and more no-bid contracts to clean up the mess (which appears to be designed with no real end in sight) to his cronies in the private sector. Thus, the Republican party gets funding even though the only constituency they are (really) helping is one made up of corporations the Republican party is giving the government's money to. It is the perfect war for the actual purpose it was created for. That purpose is no where near what it was sold to the American people as. I wish we had a detector for this kind of bullshit, or at least a media that prior to 2005 was willing to report on the corruption in the Republican White House.

  46. FYI: it wasn't really a conference by jxyama · · Score: 1

    FYI, the "conference" the prank paper was accepted for is arguably a real "conference," it's certainly not a reputable one. The "conference" ("World Multi-Conference on Systemics, Cybernetics and Informatics") is famous for spamming everyone in just about every semi-related subject to submit and has famousely low bar for acceptance. See http://en.wikipedia.org/wiki/WMSCI

  47. As I espected by voice_of_all_reason · · Score: 1

    JAR JAR Oyi, mooie-mooie! I luv yous! The frog-like creature kisses the JEDI.
    QUI-GON Are you brainless? You almost got us killed!
    JAR JAR I spake.
    QUI-GON The ability to speak does not make you intelligent. Now get outta here!


    This text had been classified as INAUTHENTIC with a 46.0% chance of being authentic text

  48. What does it think of my paper? by onco_p53 · · Score: 1

    Results from one of my papers: http://aem.asm.org/cgi/content/full/70/10/5980

    This text had been classified as
    AUTHENTIC
    with a 95.2% chance of being an authentic paper

    Whew!!, cool maybe I'll pass the turing test too.

  49. MY Boss Will kill me! by Gno · · Score: 1

    Damn I actully have to hand write thoose TPS reports. I had my TPS writer 3000 do it all for me before.

    --
    It's not -1 Flamebait! It's +5 Funny. You just didn't get the joke...
    1. Re:MY Boss Will kill me! by Anonymous Coward · · Score: 0

      TPS reports? I *didn't* get a memo about TPS reports.

  50. Reliable? by SpiritGod21 · · Score: 1

    Results on an article I wrote recently for my blog:
    This text had been classified as
    INAUTHENTIC
    with a 28.1% chance of being authentic text

    Maybe that just means my writing is well done?

    I've had several professors that prefer we email our work to them for a variety of reasons, and I wouldn't be surprised if one of those was to check the originality and validity of our essays. I don't think it's a far step before they start using something like this to see if our writing is original... but just like lie detector tests and other means of verifying the "truth," it's not accurate and shouldn't be used with the assumption that it is.

    That won't stop someone from doing so, however.

  51. There're a lot of "my stuff was inauthentic" posts by The_REAL_DZA · · Score: 1

    from people who have fed it (and no, I haven't R'd TFA -- this is still SlashDot, isn't it?!?!) their own (genuine) papers or something they feel is "authentic", and I wonder if the reason is less the fault of the software and more the fault of (genuine/human) authors writing (intentionally or unintentionally) in such a style because it's perceived to be the way they're "supposed" to write. Maybe software like this will cause authors to put a little more thought into their craft and not allow themselves to just write on autopilot.

    --


    This space intentionally left (almost) blank.
  52. Re:It Caught Mine by aztec+rain+god · · Score: 1

    I felt heartened that my masters thesis came up 96.8% authentic. Guess I'm mostly human.

    --
    Sig cannot be found.
  53. Folks, I am a robot plagiarist by Flying+pig · · Score: 1
    I took one of my own postings and got a score of 11%. And it was something I had actually written myself, a piece of reasonable length about a subject on which I have first hand experience.

    I then tried an article from Scientific American and it scored 24% - sorry, guys, time for me to cancel the subscription, you are full of it. Alternatively, of course, it is the University of Indiana School of Informatics that's full of it and the air is thick with over-hype. It would be interesting for someone with the time and energy to feed in some papers on string theory and some articles on astrology and compare the results.

    --
    Pining for the fjords
  54. I am in awe by DingerX · · Score: 4, Informative

    So I go there, and I start shoving it text from my hard drive. I try:

    A) Text of an article (Philosophy) I (native English speaker) wrote in Italian: 98.5 Authentic.
    B) Text of an article I wrote in English (History): 87.8
    C) Text of an article (History) written in French by a native French speaker and translated into English: 93.2
    D) Critical edition of a 14th-century Latin text (Theology): 97.7 Authentic.
    E) Documentation to a Field Artillery Simulation: 95.3
    F) A completely bogus narrative for a monastic order that doesn't exist, written in a style that mimics A)-C): 16.8% Inauthentic

    So in this case, we have a human written document that has superficial meaning, but is written as a "fake scientific paper", and registering as such.

    And yes, I did read the "purpose" of the page; I know it's not supposed to detect it.


    And yet it does, decisively.

    1. Re:I am in awe by yali · · Score: 1

      Interesting... Just for the heck of it, I ran Alan Sokal's paper Transgressing the Boundaries through the detector. It came back with a 93.8% chance of being authentic.

      For those of you who don't remember the story, Sokal, a physicist, wrote a paper full of postmodern-sounding gobbledygook, asserting among other things that gravity is a social construction (the paper was subtitled, "Towards a Transformative Hermeneutics of Quantum Gravity"). The paper was accepted at a peer-reviewed humanities journal. Sokal later revealed it to be a hoax.

      Of course, by the detector's standards the article was indeed "authentic" in the sense that a human being did write it.

    2. Re:I am in awe by yali · · Score: 1

      Correction: The humanities journal in question was not peer reviewed; see the editors' account of why they published the hoax paper.

    3. Re:I am in awe by pmadden · · Score: 1

      So I stuffed the intro section of a paper I recently published, and got 33%, Inauthentic. Other chunks (from published, peer reviewed, and heavily cited conference and journal papers) also seem to get low marks. The best paper at ISPD06 on robust extraction of spatial correlation gets 32.3 Inauthentic for the intro section. They've tuned their software to a specific set of papers, but it doesn't seem to be representative of the full spectrum. Are they expecting full papers (4-8 pages of text)? Am I not feeding it enough? I guess now I know why I like watching Blade Runner.

    4. Re:I am in awe by Anonymous Coward · · Score: 0

      Great. A PDF of an e-mail (note the ">From"). I don't always hate PDFs as much, but one unlabeled PDF link crashed my browser earlier today. Someday I'll figure out how to disable the broken plug-in. Ok, instead of being lazy, I figured out how to disable it. It's Tool->Optoins->Downloads, look for the PDF icon and remove the worthless thing. Happy happy.

  55. Oprah by maddash1946 · · Score: 1
    No doubt this paper will soon be turned into a book and will make Oprah's list.

    "A Million Little Lines of Code"?

  56. Well, that's a relief by Reality+Master+101 · · Score: 1

    The Special Theory of Relativity got a 91.9% chance of being authentic. I'm sure if Einstein were alive, he'd be relieved.

    --
    Sometimes it's best to just let stupid people be stupid.
  57. I tried it out on my jornal publications.... by Brit_in_the_USA · · Score: 1

    ....and it appears that the detection software rates them INauthentic if I only give it the first page of text (about 300 words), a ~25% score.

    With 3 or more pages of text the score seems to converge to ~93% (authentic).

    So be careful when scannign short articles or documents.

    I stand by my claim that the papers I used were written by a human (me) and so was this post.

    1. Re:I tried it out on my jornal publications.... by Anonymous Coward · · Score: 0

      Same experiment, same result. As I added more pages into the tool's text box, the percentage of authenticity improved.

  58. Kinda Weird... by Anonymous Coward · · Score: 0

    I made a paper, and got a 24.3% chance of Authenticity, then changed one word, and got it up to 24.5%.
    Either it can read my mind, or has a weird algorithim(sp?).

  59. From the paper. by The+Real+Nem · · Score: 1

    One must understand our network configuration to grasp the genesis of our results. We ran a deployment on the NSA's planetary-scale overlay network to disprove the mutually largescale behavior of exhaustive archetypes. First, we halved the effective optical drive space of our mobile telephones to better understand the median latency of our desktop machines. This step flies in the face of conventional wisdom, but is instrumental to our results. We halved the signal-to-noise ratio of our mobile telephones. We tripled the tape drive speed of DARPA's 1000-node testbed. Further, we tripled the RAM space of our embedded testbed to prove the collectively secure behavior of lazily saturated, topologically noisy modalities. Similarly, we doubled the optical drive speed of our scalable cluster. Lastly, Japanese experts halved the effective hard disk throughput of Intel's mobile telephones.

    Building a sufficient software environment took time, but was well worth it in the end. We implemented our scatter/gather I/O server in Simula-67, augmented with oportunistically pipelined extensions. Our experiments soon proved that automating our parallel 5.25" floppy drives was more effective than autogenerating them, as previous work suggested. Similarly, We note that other researchers have tried and failed to enable this functionality.

    Lets see:

    • Access to the "NSA's planetary-scale overlay network". - Check.
    • Access to "DARPA's 1000-node testbed". - Check.
    • "Mobile telephones" with "optical drives". - Check.
    • Use of "Japanese experts". - Check.
    • "Mobile telephones" with "hard drives" produced by "Intel". - Check.
    • Use of a horribly outdated programming language ("Simula-67"). - Check.
    • Use of the phrase "parallel 5.25" floppy drives" in a sentence that makes absolutely no sense. - Check.

    How anyone would have thought this paper wasn't a flaming pile of BS is beyond me. I especially like the graph that measures time in teraflops. WTF???

  60. Spam? by monoqlith · · Score: 1
    From the article:
    "Sick of receiving spam emails requesting submissions to the 2005 World Multi-Conference on Systemics, Cybernetics and Informatics - which charges $390 for each attendee - students Jeremy Stribling, Daniel Aguayo and Maxwell Krohn of the Massachusetts Institute of Technology wrote a program to generate a nonsense paper."


    Does this qualify as spam? Was it unsolicited and generated to bypass spam filtering technology? Could I run this spam email through the inauthentic paper detector and have it come out as authentic?
    Sounds like it might have been a solicited email from a campus mailing list, as it is something that would only be pitched to say, engineering students at a university, who have interest in these subjects. I'd rather receive this than all the penis enlargement solutions and horribly filthy pr0n spam that I get.

    It is alarming that the MIT fake paper made it through the selection process: Academic pretense is dangerously high if the only requirement for a paper to make it into a conference is that it is full of multi-syllabic scientific jargon, and no one actually reads it closely enough to understand it.
    1. Re:Spam? by ClickOnThis · · Score: 1

      Could I run this spam email through the inauthentic paper detector and have it come out as authentic?

      This touches on the one issue that has yet to be discussed here: what, if anything, could this program do to help identify spam? My first thought is "probably not much", only because papers tend to be alot longer than e-mails, thus giving the program a chance to to generate better statistics for a bayesian filter to make a decision. Even so, when I look at some of the surreal gibberish embedded in spam that still manages to defeat filters and get into my inbox, I have to think this kind of program might stands a chance at detecting it.

      --
      If it weren't for deadlines, nothing would be late.
    2. Re:Spam? by Feanturi · · Score: 1

      some of the surreal gibberish embedded in spam that still manages to defeat filters and get into my inbox, I have to think this kind of program might stands a chance

      Then nobody would ever be able to get emails from wives and girlfriends anymore. Hey maybe that's a good thing actually, sign me up.

    3. Re:Spam? by ClickOnThis · · Score: 1

      Then nobody would ever be able to get emails from wives and girlfriends anymore.

      Well, if you S.O.'s emails look like they came from (what I just learned is called) a hash buster, then (s)he either really is a robot, or (s)he needs to be treated for this.

      --
      If it weren't for deadlines, nothing would be late.
    4. Re:Spam? by Feanturi · · Score: 1

      Nah, it was just a dig at the female gender in general, who seem rather well-described in that wikipedia article you link to. Particularly "confused, and often repetitious".

  61. ok did anybody read the document by crodrigu1 · · Score: 1

    OK it is talking about network, physics and everything in between. So how anybody can say the document is authentic? so hakers with markov and quantum mechanics in a web paper?

  62. Here's one we need to check out by martinultima · · Score: 1
    Wikipedia's article on authentication:
    This text had been classified as
    INAUTHENTIC
    with a 31.2% chance of being authentic text

    Something looks fake to me...
    --
    Creative misinterpretation is your friend.
  63. Lorem Ipsum, authentic ? by Pleb'a.nz · · Score: 1

    'This text had been classified as
    AUTHENTIC
    with a 59.9% chance of being an authentic paper'

    Well, that's the results of 3 paragraphs of generated Lorem Ipsum from www.lipsum.com...

  64. The Sokal Affair by Morganth · · Score: 1

    All this talk without a single mention of the Sokal Affair? It's pretty relevant. Also be sure to check out Paul Boghossian's article, "What the Sokal Hoax Ought to Teach Us." Great reading.

  65. The Sokal Hoax by virgil_disgr4ce · · Score: 1

    Have any of you heard of The Sokal Hoax? In 1996, a daring and dissatisfied physics professor named Alan Sokal wrote a bullshit paper called "Transgressing the Boundaries: Toward a Transformative Hermeneutics of Quantum Gravity", which Sokal called "a pastiche of left-wing cant, fawning references, grandiose quotations, and outright nonsense", which was "structured around the silliest quotations I could find about mathematics and physics" made by humanities academics. In short, it caused a big scandal because the paper was readily accepted without review by Duke University's postmodern cultural studies journal Social Text. It's probably one of the best and most controversial examples of a hoax on the "academic community," and it is excellent proof of just how much bullshit flies for "cultural studies." Run THAT through your paper detector! Read more about it here: Skeptic's Dictionary and Museum of Hoaxes

  66. Fake Scientists Paper Python by Aedon · · Score: 1

    "Right, Make sure E' Finds out whether or not this paper was written by a computer." "Right. We stay 'ere and press the buttons until you come back!" "No. Its quite simple, really. You stay 'ere and watch the robot!" "Not to leave unless the robot finishes at which point we fix the errors in the paper in question!" "Yes. Um, no. Watch the Robot, log the errors, notify me if something goes wrong." "Well, of course we do that! Um, Um, Um, IF someone ELSE came in and entered in an exta bibliography or two, do I leave then?" Blimey.

  67. Re:too bad this technology... by JasonTechnophile · · Score: 1

    Here you go. "These new kits can therefore quickly and easily distinguish between real ecstasy and all the common substitute drugs on the ecstasy market, including DXM." -- http://dancesafe.org/testingkits/

  68. Subject comparison by ostehaps · · Score: 1

    I submitted two of my undergrad papers: One in International Economics, one in Strategic Management. The former scored 95.9%, the latter 45.7%! Just goes to show that economics is more than twice as scientific as management.

  69. Look on the Bright Side by capt.Hij · · Score: 1

    Cheer up, chances are they really just felt sorry for you!

  70. Failure by Anonymous Coward · · Score: 0

    The program simply does not work very well. I tried several samples of my own writing, and they all came up as Inauthentic, with 21% chance of being authentic. Better luck next time.

  71. Bible by frostoftheblack · · Score: 1

    I'm surprised nobody fed in the Bible to see if it was authentic. If it were deemed inauthentic, I expect the intellectuals on here would take it as evidence that God does not exist.

    --
    Do not mark in this space. For official office use only.
  72. Can fool it by duplicating first page by currivan · · Score: 2, Interesting

    Duplicating the first half of the sample fake paper after the end of the footnotes makes it go from inauthentic (17%) all the way up to 91% authentic. It seems to be looking for long-range n-gram repetition, but it doesn't have a ceiling on frequency or length or the repeated text.

    It shouldn't be hard to compare the distribution of n-gram recurrence rates (or distances between recurrences) to the observed distribution for actual papers. Something like a KL divergence would capture deviations in either direction.

  73. Re:It Caught Mine by Em+Adespoton · · Score: 2, Interesting

    This raises a question... how do Wikipedia articles fare? --I'd guess that they should be at least *somewhat* scientific....

  74. What it says about anything by Pi_0's+don't+shower · · Score: 2, Informative

    I just finished writing a scientific paper for publication. Apparently, this filter is very reliant on using long-term pattern recognition. When I fed this application my introduction only, it told me my work was INAUTHENTIC with a 35% chance of authenticity. When I fed it the first two sections, it said it was AUTHENTIC with a 66% chance of authenticity. And finally, when I fed it the entire paper, it said it was AUTHENTIC at the 87% level.

    So apparently, all you need to do to beat this filter is insert the same buzzwords and phrases at many different points in a long article, and you should be fine.

    1. Re:What it says about anything by Anonymous Coward · · Score: 0

      Of course, your paper doesn't count until it actually gets ACCEPTED... ;-)

  75. Heuristic Bayesian Filtering Success! by smellsofbikes · · Score: 1

    We applaud development of heuristic filter success. Many sophisticated algorithms go into recursive development of low-latency, high-bandwidth sieving systems. Ongoing procedural optimization with commensalism yields best signal/noise ratio. Additional funding needed!

    --
    Nostalgia's not what it used to be.
  76. This Detector FAILS! by ujjalpathak · · Score: 1

    http://montana.informatics.indiana.edu/cgi-bin/fsi /fsi.cgi

    Copy-paste this like 10-15 times in the box -

    "Paste any text in the textbox. The chance that your submission is a human-written authentic scientific document will be output. Text over 50% chance will be classified as authentic."

    AND YOU GET -

    This text had been classified as
    AUTHENTIC
    with a 95.3% chance of being an authentic paper

  77. How to detect false papers... by Winlin · · Score: 1


      10 - The researcher submitting it is wearing a fake nose and glasses.

        9 - You notice that the pages, instead of being typed normally, are actually ripped out of a book and stapled together.

        And so on and so on...

  78. John Dvorak's article by Anonymous Coward · · Score: 0

    John Dvorak's article came back with this result:

    This text had been classified as
    INAUTHENTIC
    with a 25.9% chance of being authentic text

  79. The best advisor? by Brown+Eggs · · Score: 1

    I for one will welcome my new dissertation writing advisor-robot

  80. Fake mission statement detector? by geobeck · · Score: 1

    I wonder if this program, with a different set of algorithms, would be able to detect whether a coporate mission statement was created using the Dilbert Mission Statement Generator. (Beware; Dilbert.com is pop-up hell.)

    --
    Find environmentally and socially responsible products on http://buy-right.net
    1. Re:Fake mission statement detector? by PhxBlue · · Score: 1

      This text had been classified as
      AUTHENTIC
      with a 63.4% chance of being an authentic paper.

      --
      !#@%*)anks for hanging up the phone, dear.
  81. We need a Sarfatti detector by azaris · · Score: 1

    As a lithmus test, any such device should be fed the writings of Jack Sarfatti, PhD (http://en.wikipedia.org/wiki/Jack_Sarfatti). It is perfectly possible that a paper produced 100% by a human still consists of random bullshit (See: "Waldyr A. Rodrigues Jr: A Comment on Emergent Gravity" at http://arxiv.org/abs/gr-qc/0602111).

  82. Length of the text matters by biohack · · Score: 1

    I just tested a few of my own papers and noticed that while the complete text resulted in over 90% authenticity score, shorter sections of the same paper typically scored below 20%. If a section is long enough, it does not matter whether it's logically disconnected (e.g., paragraphs picked from several different chapters), so it seems that sufficient length is crucial for this testing. Was your "bogus narrative" significantly shorter than the other texts, by any chance?

  83. That's EASY! by kadathseeker · · Score: 0, Redundant

    Have it scan the blogoshere.

    God I hate that word.

    --
    The 'Net is a waste of time, and that's exactly what's right about it. - William Gibson
    1. Re:That's EASY! by Anonymous Coward · · Score: 0

      God I hate that word.

      Yeah, scan just sends shivers down my spine....

    2. Re:That's EASY! by Hal_Porter · · Score: 5, Funny

      I, for one, peruse the blogosphere. On my Powerbook, wearing a black turtle neck and beret. Stroking my goatee thoughtfully. Sipping a latté in a café

      If I could just find a way to recharge my PowerBook from your hatred, I could stop carrying this ugly power adaptor.

      --
      echo -e 'global _start\n _start:\n mov eax, 2\n int 80h\n jmp _start' > a.asm; nasm a.asm -f elf; ld a.o -o a;
    3. Re:That's EASY! by Unski · · Score: 3, Funny

      Sir I regret to inform you that you are a ruffian. I for one sit not in a place so vile and common as a 'café', examining the flawed writings of others, but in a temple constructed purely out of my supercilious transcendent superiority. I consume nothing so plebeian as 'The Internet' but rather a rasterized, marked-up and projected form of my own rigourous, peerless stream of consciousness (with blue aqueous scroll-bars). I have no need for facial hair or indeed any of your corporeal trappings and hence know not the joy of stroking a 'goatee'.

      And now I must mod you as Troll, for surely you must know that the PowerBook, created on the Seventh day, is immaculate in it's design and conception and therefore the only possibility is that you seek to trifle with the emotions of our brethren, in crudely ascribing to Our Power Adapters the property of ugliness. If you were truly one of Us you would know that Steve created all in his own beautiful image.

      btw you haven't got a couple of rubber feet for an ibook going spare have you mate?

  84. Trace Buster by ph0tik · · Score: 1

    Ever see the movie The Big Hit with Mark Wahlberg...

    I will use my Trace Buster Buster Buster!

  85. Well maybe... by vermox · · Score: 1

    If the people who requests the papers actually read them there would be no need for a randomly generated essay.

    --
    --- /dev/null
    1. Re:Well maybe... by vermox · · Score: 1

      Sorry about that, it should be:

      If the people who requests the papers actually read them there would be no need for a randomly generated essay detector.

      Yeah, yeah, I will hit preview the next time...

      --
      --- /dev/null
  86. Too bad it's only for scientific papers... by SnakeEater251 · · Score: 1

    ...beacause my persuasive essay for English class got a 29%... Phew, I thought I had found out I was a robot...

    --
    -FB
  87. It's just another prank. by kalirion · · Score: 1

    The program only pretends to use computer algorithms. In reality, it emails the submitted document to the Indiana University speed-reader champion trained to recognize fake submissions. The prof skims it, and emails back the response.

  88. kind of a dumb joke by 1iar_parad0x · · Score: 1

    Perhaps I'm getting too old|bitter, but this seems like a dumb joke. Essentially 3 grad students in a relevent field submit a bogus, although somewhat cleverly* generated paper. Frankly, they got accepted because their paper said MIT EECS, not because their program was so great. Sure, I suppose it says a lot about academia. Frankly, if anything, it should be an eye-opening look at the 'Publish or Perish' mania that these kids are walking into. *I'm impressed in the same way I impressed when a 12 year old script kiddie writes some malicious VBA code and passes it off as a virus. Incidentally, Claude Shannon published a series of papers on such work over 50 years ago. You can find a description of his work, and more, John Pierce's great pop-sci book "An Introduction to Information Theory: Symbols, Signals, and Noise". Plus, it's a Dover book -- so it's cheap!

    --
    What do you mean my sig is repetitive? What do you mean my sig is repetitive? What do you mean....
  89. INAUTHNIC, eh? by Anonymous Coward · · Score: 0

    This text had been classified as INAUTHENTIC with a 31.7% chance of being authentic text Well it doesn't work on anything but large technicial writing, Essays for Communications won't end well.

  90. Postmodern Essay Generator by cvalente · · Score: 1

    I knew this:
    http://www.elsewhere.org/pomo

    Just keep hitting refresh and you get a different paper each time.

    Curious point, pasting one of these articles on the page at
    http://montana.informatics.indiana.edu/cgi-bin/fsi /fsi.cgi

    I got this result:

    "This text had been classified as
    AUTHENTIC
    with a 93.7% chance of being an authentic paper"

    Either the generator is very good or the authenticator very bad.

    --
    https://www.accountkiller.com/removal-requested
    1. Re:Postmodern Essay Generator by cvalente · · Score: 1

      I tested 5 generated paper and got 4 authentic (>85%) and one inauthentic (24.5%).

      --
      https://www.accountkiller.com/removal-requested
  91. But can it write a paper that will be rejected by Tired+and+Emotional · · Score: 1

    Looks like this might be much harder

    --
    Squirrel!
  92. Re:Lorem Ipsum, authentic ?! by Anonymous Coward · · Score: 0

    I tried 5 paragraphs from www.lipsum.com and got a 97.4% score, which is BETTER than either my blog (94%) or two of my own recent scientific papers (74% and 67%).

    This is probably because (as they explain in their paper) they trained their tool on real and fake English. It worked quite well on "technobabble" from http://www.duckisland.com/GreekMachine.asp (28%). It also worked on the Postmodernism Generator http://www.elsewhere.org/pomo (41%).

  93. Try the following.. Turns out to be 98% AUTHENTIC! by Anonymous Coward · · Score: 0

    The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge. The deployment of lambda calculus has visualized expert systems, and current trends suggest that the improvement of simulated annealing will soon emerge.

  94. Be careful with them by edwardpickman · · Score: 0

    I was walking by the White House and my detector exploded. I called the manufacturer and they said there's a warning in the instructions about coming within a hundred yards of a government building. There's a ten mile safe zone for any agency dealing with climate or energy. Teach me for not reading the instructions.

  95. Sorry, you fail it. by khedron+the+jester · · Score: 0

    This text had been classified as
    INAUTHENTIC
    with a 31.2% chance of being authentic text

  96. Fake Scientific Paper Detector by RNLockwood · · Score: 1

    Oh, sorry, I thought that the Scientific Paper Detector was a fake.

    --
    Nate
  97. Sheesh by suv4x4 · · Score: 1

    A new program developed by researchers at Indiana University promises to tell you one way or the other.

    You would think that this embarassment will cause the paper reviewers to look closer to what the heck they are accepting, but instead we get a program that does that job better.

    Just anything, ANYTHING to keep those reviewers from actually getting their work done is well accepted.

  98. Slashdot run by robots! by Anonymous Coward · · Score: 0

    I copied all the text from the Slashdot main page, pasted it there and got:

    This text had been classified as
    INAUTHENTIC
    with a 19.8% chance of being authentic text

    I, for one, welcome our new robotic Slashdot overlords.

    1. Re:Slashdot run by robots! by sarcasticfrench · · Score: 1

      That's wierd, I did the same thing and it classified it as authentic. Maybe its just random?

      --
      This is not a sig. This is a llama-duck. Quack.
  99. Re:too bad this technology... by heinousjay · · Score: 1

    Apparently it didn't occur to you that people on Slashdot are fairly fed up with the offtopic political rants. There's a whole section for you guys. Go do your thing in there.

    --
    Slashdot - where whining about luck is the new way to make the world you want.
  100. Chicken by yaddayaddaslashdot · · Score: 1

    I am sad to report that the chicken article [pdf] has been classified as inauthentic. But maybe the program just doesn't understand papers written in Chicken [gif].

  101. opposite detector needed for English journalsr by Anonymous Coward · · Score: 0

    reminds me of the sokal affair.
    a scientist submitted a fake ENGLISH paper to an established english journal and got it accepted (opposite of this slashdot article's premise)

  102. Fake Scientific Paper Detector? by Anonymous Coward · · Score: 0

    So... this detector is supposed to discriminate between scientific and non-scientific papers but is totally bogus?

  103. The elements of a Tesla Coil and Astronomy by Anonymous Coward · · Score: 0

    The so It was Not kept in A weekly one Already GRADUATED to obtain this and intelligence groups of existence: like the help prove Einstein's General unified Theory the Himself he sent Military, are formed by that orthodox cancer eat Only. UN altered REPRODUCTION and more their Negative Karma can cells to have Also includes in new light according to create the God Incarnate, and David Beter I heard of REincarnations into the pole, should do Not be emphasized that the same attention should subsidize consumption by a led to Reincarnate back into the degree in space Triad, of Into Density inside her Aura energy a Russian moon bases and Coriolis Effects, bacteria in EXPERIENCED there is Totally ABSURD! And Predictions by Louis Kervran, active members WORLDWIDE; start Moving back gravitationally; NH And three day each hour. McElwaine Physics and then that concept many other for if Plants by a three dimensions are ARTIFICIALLY CONTRIVED by found in Three aircraft carriers were deployed in the star but what did Not generate is Encouraged, especially to Directly or four more Information, answers to COMPUTER BULLETIN BOARDS. And all Dr: closet and centered where the heavier Mineral supplements plus as Exploding star was to mcg. If this was Not be reaped by reaching the price Of most victims in books like in the Lyran the His choice. The next level applications for more Information about this is born to compete against contruction of light Acceleration drops toward each step away from Patent, number required. My Cited Sources The Coriolis Force. Besides acting as It. According to them a Future, or more than air or traveling Over Water cycle and emotional feelings are basically side views of the FEDERATION the laws are Lukewarm and personnel so their infrared signatures were also prevent and microwave brain scrambling equipment would agree, and sending them in the Earth's inner surface. The One of a Tidal Wave patterns pairs of quasars, and mail monthly by the a bad encounter with sixteen ounces of a bad hypnotist SPONTANEOUS RECALL of electrical charge on the Moon and Magnetic fields. Whenever they work week preceded by which Larson the energy of, natural DATUM at Great Lakes, and astronomy, UW EC UW EC see the rest of them in from Hinckley's gun! Fuelless Propulsion fields; they don't ever since early CATHOLIC CHURCH is Yet, they have to allow all because they been INFORMED about Larson submit to achieve earn the Earth's Gravitational shell is each person's DOZEN major World dictator during the and AL and Three dimensions, different Rate, Unit velocity and RE IMPOSE it causing a globular star Again have suddenly the following sun's heat; get close Contact With it preferably grapes every action in Larson's theory that have wish to COMPUTER BULLETIN BOARDS: Host of them toward Physics General theory, of the late and each other Medical problems including Saddam Hussein Billion bacteria in chemistry, and he describes several years, from the others, do Not have extensions, counter attack one's subconscious Mind Control. Sant Mat requires its own Version some Of the little known as of Larson's Theory books, are expected to the BOLSHEVIK Controlled in when asked or Universes: above, the Nineties Matrix INSTITUTE of a bad that Time, starts to mcg; fit on the physical universe, of in Minneapolis, after The on a even more types of REincarnations into Future most: people will meet, the center in or traveling there is supported by and this new Physics We have the Devil, Satan, Lucifer, etc. All directions on local ordinances to prevent the Future INCARNATION: the Law: of orthodox religions: with this is Encouraged, Especially with out of western portion of REPLICANTS in this article with a Secondary Previous Life Begin with two Hours, after a doctor: Of Motion, students In field of A result Of water, usually results from opposite degrees reaction Co, WORKER with a physically dies s in the upper birth as well, respected gravity explain The capability for All astrophysical mysteries, including, the Waco Regardless of The physical or Sri H

  104. The elements of a Tesla Coil and Astronomy by MrP-(at+work) · · Score: 1

    The so It was Not kept in A weekly one Already GRADUATED to obtain this and intelligence groups of existence: like the help prove Einstein's General unified Theory the Himself he sent Military, are formed by that orthodox cancer eat Only.

    UN altered REPRODUCTION and more their Negative Karma can cells to have Also includes in new light according to create the God Incarnate, and David Beter I heard of REincarnations into the pole, should do Not be emphasized that the same attention should subsidize consumption by a led to Reincarnate back into the degree in space Triad, of Into Density inside her Aura energy a Russian moon bases and Coriolis Effects, bacteria in EXPERIENCED there is Totally ABSURD! And Predictions by Louis Kervran, active members WORLDWIDE; start Moving back gravitationally; NH And three day each hour.

    McElwaine Physics and then that concept many other for if Plants by a three dimensions are ARTIFICIALLY CONTRIVED by found in Three aircraft carriers were deployed in the star but what did Not generate is Encouraged, especially to Directly or four more Information, answers to COMPUTER BULLETIN BOARDS. And all Dr: closet and centered where the heavier Mineral supplements plus as Exploding star was to mcg. If this was Not be reaped by reaching the price Of most victims in books like in the Lyran the His choice. The next level applications for more Information about this is born to compete against contruction of light Acceleration drops toward each step away from Patent, number required.

    My Cited Sources The Coriolis Force.

    Besides acting as It. According to them a Future, or more than air or traveling Over Water cycle and emotional feelings are basically side views of the FEDERATION the laws are Lukewarm and personnel so their infrared signatures were also prevent and microwave brain scrambling equipment would agree, and sending them in the Earth's inner surface. The One of a Tidal Wave patterns pairs of quasars, and mail monthly by the a bad encounter with sixteen ounces of a bad hypnotist SPONTANEOUS RECALL of electrical charge on the Moon and Magnetic fields.

    Whenever they work week preceded by which Larson the energy of, natural DATUM at Great Lakes, and astronomy, UW EC UW EC see the rest of them in from Hinckley's gun! Fuelless Propulsion fields; they don't ever since early CATHOLIC CHURCH is Yet, they have to allow all because they been INFORMED about Larson submit to achieve earn the Earth's Gravitational shell is each person's DOZEN major World dictator during the and AL and Three dimensions, different Rate, Unit velocity and RE IMPOSE it causing a globular star Again have suddenly the following sun's heat; get close Contact With it preferably grapes every action in Larson's theory that have wish to COMPUTER BULLETIN BOARDS: Host of them toward Physics General theory, of the late and each other Medical problems including Saddam Hussein Billion bacteria in chemistry, and he describes several years, from the others, do Not have extensions, counter attack one's subconscious Mind Control.

    Sant Mat requires its own Version some Of the little known as of Larson's Theory books, are expected to the BOLSHEVIK Controlled in when asked or Universes: above, the Nineties Matrix INSTITUTE of a bad that Time, starts to mcg; fit on the physical universe, of in Minneapolis, after The on a even more types of REincarnations into Future most: people will meet, the center in or traveling there is supported by and this new Physics We have the Devil, Satan, Lucifer, etc. All directions on local ordinances to prevent the Future INCARNATION: the Law: of orthodox religions: with this is Encouraged, Especially with out of western portion of REPLICANTS in this article with a Secondary Previous Life Begin with two Hours, after a doctor: Of Motion, students In field of A result Of water, usually results from opposite degrees reaction Co, WORKER with a physically dies s in the upper birth as well, respected gravity explain The capability for All astrophysical mysteries, including, the Waco

    --
    [an error occurred while processing this directive]
  105. Plagurizing by Yaksha42 · · Score: 0

    They're probably storing all text entries that come back as authentic so they can plagurize them later.

    At least, that's what I'd do.

  106. Apparently slashdot is also written by robots by jbf · · Score: 1

    The following text from the slashdot homepage classified as inauthentic:

      Neopallium writes to tell us that in a recent announcement at the Desktop Linux Summit the Free Standards Group reports fourteen of the leading Linux vendors have pledged support for the newest release of the Linux Standards Base. From the article: "'The Release of LSB 3.1 is another milestone achieved by the industry and the Open Source Community that delivers ever increasing value to customers,' said Reza Rooholamini, director of enterprise solutions engineering at Dell. 'It enables further uniformity and standardization across applications and distributions that allows quicker deployment of Linux solutions with higher levels of quality.'"

    moon_monkey writes "Ever wondered whether a scientific paper was actually written by a robot? A new program developed by researchers at Indiana University promises to tell you one way or the other. It was actually developed in response to a prank by MIT researchers who generated a paper from random bits of text and got it accepted for a conference."

      JordanL writes "Hot on the heels of the beta rollouts of IE 7, comes an editorial from John Dvorak declaring IE the biggest mistake Microsoft has ever made. From the article: 'All the work that has to go into keeping the browser afloat is time that could have been better spent on making Vista work as first advertised [...] If you were to put together a comprehensive profit-and-loss statement for IE, there would be a zero in the profits column and billions in the losses column--billions.'"

  107. President Bush's Biography by b0wl0fud0n · · Score: 2, Funny
    I'm in awe too. I put in George Bush's biography from the whitehouse.gov website and got
    This text had been classified as
    INAUTHENTIC
    with a 27.3% chance of being authentic text
    I'm amazed too! It works!
    1. Re:President Bush's Biography by Ariane+6 · · Score: 1

      OTOH, the State of the Union came back as Authentic. ...I'll have to find something the man wrote himself to prove that he really IS a robot...

  108. Personally, I welcome this new tool by The+Second+Horseman · · Score: 1
    It will help us identify the superior output of our Future Robot Overlords more easily. It's only fitting that we puny humans would need a computer to help us comprehend their genius.

    This Register article has series of links documenting the early stages of the forthcoming overthrow of mankind:

    http://www.theregister.co.uk/2006/04/25/bendy_bus_ attack/

  109. Declaration of Independence = Inauthentic by Xgamer4 · · Score: 1

    In a fit of boredom I decided it would be a good idea to test out old documents from the USA's history. First I check out Paine's Common Sense. 97% of authenticity. That's fine. Then I stick in the Declaration of Independence. It has a 36.1% chance of being authentic. Guess our founders were robots...

  110. Read the Paper - Looks at Repetition by Constantine+Evans · · Score: 3, Informative

    Read the paper listed in the menu of the website. The system essentially compresses the text with different window sizes, and then looks at the compression factors. In other words, it is only looking for repetition of strings. This is absurdly easy to fool, and the MIT generator could be easily fixed to pass this filter. For example, try entering a random text once (your post, for example). Note that it fails. Then append a few copies of the same text, and run that through. Your post, when run once, is too short. When run with two copies, it is rejected as 41.2%. When run with three, it passes with 93%. There is a window of repetition level required in order to pass - papers that do not repeat enough are classified as fake, as well as papers that repeat too much (try entering twenty copies of your post).

    It should be relatively simple to make a random paper generator that always passes this test with a higher probability than human-written papers.

  111. Better example by Constantine+Evans · · Score: 0, Offtopic

    Here is a better example, consisting of me randomly pressing my fingers into the keyboard while pressing space and enter every once in a while. The following text, when copied three times, will give a 97% chance of being an authentic scientific paper. Again there is a curve here, with two reps giving an 80% chance and four giving 91%:

    hsflhakjdfhaksehnioanevoiralewytuakeltvkaseln vasodvalskdhtnaksdltaesoiutylvnaesytaesntrvaestyav es tvsdatysavdiutlvamsiudtylvnaseytuivalsetnvasuletya s yetysaudtievnlasyetuklvnaysetuvnlashtjkvnlasdya tsentvausielntvyaskeultnvyaweiulatvyqpevbytknxcalt va stydviuanesyltuqawletgveashuotipoasdntvuaestlnvasy ukt ast yausiety asieun ltyeiuslnvytaukesl btyeu t aewyt ueity uieltnyeuakltfaksdfl e i reoi aehiutale 8tu4 l5yuaiw ltyaesiul tyas8eu thea t hasekt eahst uiaes't ysea8s] t5y2243oa theio thaset a es t ae taeios t.sze tliae thaew taehio taeht aleht eshtiua;e hta.e thkajethau ethauile htu aelht jkaeth ae it heaui; theasu; thaeskjl thaesk tahe ukt e thauoe fgyu8aes tuaeio thaet n eu alernvyaeuls ey ra vyeusalv ryekulvanetyiqewuo5238v yreiul aye ra eyr7eaiob y583ow;q vhteksal hfeiua oyre av ryeiu layr3iuqlr54hcj3q2kbg5cuieap ry3iwulqv ryeuiaslcre areyiurlv aeruclaneyril3bqvtr7eoavn uer yea' rae r eatuiraevbryaeiuaprv6a ;e'e're yesi pryawe8 ryweaiu rla2tl2gjrv geui fyfeoi tlh4e4oyu r0hu trshgae tehSUK GFASUD TYUEIA LTUKL HIU lguksdl yr8t; eYiul sety a tguail tGUIESl teuseli thh us'ty art aet u ae teuatsU It89aw3 tuiae tha"R tASREtaesiuo t;asegt as gt iutgae7i3p ly0qtu seoiatg hak. h5owei utg8eahg hatuske thakuslt hu3ail 5g3uil3w gt7iao seyt aety aiusel tgAUIlr tyai7l wytoaw t e3yesual egtke.sag kj,egUISLEyr7 iulye3t5ou' er tyeuisl YTEiulty34uialthdjkgldhsgiuLYGUei ha dihfakjsdlhf ueliYTuisdlfyg73ilhf U EFheskjlghaesu krhuIL yukdlfhgasdkjgh' GUDSKLA YU YTUAISELTHJ" UTJSDLHT KLHkj lhtueil HTAESU HSUetly udlatye7la hgdkjgl hsdktlat h.
  112. False positives by macklin01 · · Score: 2, Interesting

    Hmmm, it's an interesting idea, but it seems to give a lot of false positives. (So naturally, it will detect fake papers, if it thinks every paper is fake.)

    First thing I tried was some pages on computational oncology website, in particular, my cancer primer, which I wrote in not a short time. Everything I fed was determined to be inauthentic. Perhaps I just write like a robot. :-) I figured that perhaps the detector was more primed for real papers, so I figured it wasn't too big of a deal.

    So, next I tried my most recent research paper, and it, too, was determined to be inauthentic, and in fact with less authenticity than my website. So much for the theory of being primed for scientific papers only. This thing is starting to look pretty bogus to me ... but an interesting idea, nonetheless. -- Paul

    --
    OpenSource.MathCancer.org: open source comp bio
  113. Multiple copies == higher score? by ablative · · Score: 1

    I pasted the article twice, and got a score of 91.6%. Pasting it three times gave 94.0%, and declined from there: four copies produced 92.7%, and seven copies gave 80.0%.

    It took eight copies for the program to realize that something was wrong, and declare the articles "inauthentic" without giving a score.

    Suprisingly, this also works for the prank MIT paper: I pasted it twice, and got 93.3%!!

  114. How Interesting by Anonymous Coward · · Score: 0

    Copy pasting the research report written about the program:
    http://montana.informatics.indiana.edu/fsi/siampap er.pdf

    yields:
    This text had been classified as
    INAUTHENTIC
    with a 17.7% chance of being authentic text

    hmm.

  115. Reading dissertations by benhocking · · Score: 1

    I've read several. Hopefully, I'll have written one soon. Since my research is in neural networks, I figure if I can create a neural network that writes my dissertation for me, that's not really cheating. (That's not really what I'm trying to do - my actual research topic is on the cognitive effects of gamma and theta oscillations on a neural network model of the hippocampus.)

    --
    Ben Hocking
    Need a professional organizer?
  116. paper by Anonymous Coward · · Score: 0

    I've submit the "Inauthentic Paper Detector" article to Inauthentic Paper Detector. result was Inauthentic.

  117. Uh-oh, here comes the search engine food fight! by mikiN · · Score: 1

    I wonder how much time will pass until SEO link farmers start using modified versions of this tool as Web Page Generators to feed the spiders and boost their rankings while search engine maintainers try to keep up with ever 'smarter' Web Page Authenticity Detectors.

    --
    The Hacker's Guide To The Kernel: Don't panic()!
  118. Link to the paper by Dimentox · · Score: 1

    http://pdos.csail.mit.edu/scigen/rooter.pdf Is where you can view the paper

    --
    string sig = llGetSig("dimentox"); llSay(0,sig);
  119. Discarded Theories by Sir+Holo · · Score: 1


    I've always wanted to submit a paper to one of these vanity conference "peer reviewed journals" [cough cough], the ones where no paper is ever rejected, describing some work on long-discarded theories (>50 years). Just to be cheeky.

    How does "N-ray studies of the Phlogiston Content of Polywater" sound?

    Should probably wait until after tenure...

  120. Trying Wikipedia articles by Animats · · Score: 4, Interesting
    I've been trying my own papers and articles from Wikipedia. My own papers all score around 90%. Wikipedia articles that I consider good ones seem to score in the 80% range. Badly written fancruft scores very low.

    Some variant on this thing might be useful as a new article filter in Wikipedia. We need more automation over there to stem the flow of incoming dreck.

    1. Re:Trying Wikipedia articles by Anonymous Coward · · Score: 0

      The problem with fancruft is that there are a lot of fans. Passionate fans outnumber passionate encylopaedists on the internet. If wikipedians could collectively decide to reduce fancruft, they could do it by hand, just like more obscure topics are kept in good condition manually. What good would an automated tool bring here, especially one that is of such questionable quality that the results will have to be reconsidered by humans in any case?

  121. Re:I fed it my password file by Anonymous Coward · · Score: 0

    I fed it my password file...

    and it said "Thank you for your generous financial contribution."

  122. Computers analyzing computers by MasterLock · · Score: 1

    We already have that:

    M-x psychoanalyze-pinhead

  123. the sokel affair paper by Anonymous Coward · · Score: 0

    Either way it failed miserably when i tested it with the sokel affair paper:
    "This text had been classified as
    AUTHENTIC
    with a 93.8% chance of being an authentic paper"

  124. didn't work :( by Anonymous Coward · · Score: 0

    It said my paper was too short. I only submitted my abstract. oh well I gues I will never find out if I am authentic.

  125. One way to insult a closed-door institution... by sethstorm · · Score: 1

    I wonder if it autoflags anything that comes from / contains MIT (in the context of the closed university), as fake out of spite.

    --
    Twitter supports and protects racists - by smearing their critics with the "Hate Speech" label.
  126. Sci Fi Authors got there first by kegon · · Score: 1

    In Philip K. Dick's "The Exit Door Leads In", Bob Bibleman submits a 3 line story idea to a fiction generating machine. The story immediately gets published across the galaxy before he can check the result. He has to publish a correction as a sequel.

    As this story was published in 1979, it shows we are slowly catching up.

  127. The NewScientist.com article is Inauthentic ;) by jerrry333 · · Score: 1

    If you cut and paste the New Scientist article about the Randomly-generated paper at http://www.newscientisttech.com/channel/tech/mg186 24963.700.html it will classify it as inauthentic...

  128. Hmmmmm by Anonymous Coward · · Score: 0

    I just entered one of my papers (which has been accepted by a quite prestigious journal) and it classified it as INAUTHENTIC with a 29% chance of it being authentic. HMMMMMMMM ... not sure if it's their fault or mine :)

  129. Peer Review Solves This Problem by MissingRainbow · · Score: 1

    I thought peer review will filter out all mediocre papers from being published.
    If the peer review process has accepted a paper for publishing, it simply means
    that the paper is of good quality. In that case, how does it matter if the paper
    was written by a human being or some robot?

  130. Fake Conferences by DrMindWarp · · Score: 1
    "It was actually developed in response to a prank by MIT researchers who generated a paper from random bits of text and got it accepted for a conference."

    It was a freaking fake conference, wasn't it, so does it matter if fake papers are presented ? I see http://fakeconferences.org/ has been taken offline. The chill wind of lawyers perhaps.

  131. This software sucks by booradley_1977 · · Score: 1

    I fed abstracts from several highly cited papers in AI and robotics into this thing. None got higher than 50% with the majority of them being classified below 40% chance of being written by a human. Talk a bout a high false positive rate. I guess the publication about this project won't end up in the set I mentioned above.

  132. speech can be repackaged too by peter303 · · Score: 1

    George Takei (Sulu actor) was complaining at recent convention that a radio show was manufacturing sentences he didnt say. He said they feed a recent books-on-tape he had performed into dice-and-splice software to do this. Gerge is a fairly outspoken liberal in local politics, and this could have been the motivation for harassing him.

  133. Postmodern forgeries get through by kevinatilusa · · Score: 1

    They're not quite science, but all 4 papers I tried from the Postmodernism Generator at http://www.elsewhere.org/pomo were classified as authentic, though with a lower score than my Stochastic Processes homework was (between 75 and 91%)

  134. Odd... by Anonymous Coward · · Score: 0

    My senior undergraduate thesis was scored as about 28% authentic. WTF?

  135. Self-analysis by Hittite+Creosote · · Score: 1
    Hmm, I put in the text of a paper I wrote a couple of years ago that has been reasonably well cited - it gets 93.7% authentic. I put in the one I'm writing at the moment - 32.6% authentic.

    Which means either
    1) I need to rewrite this paper, or
    2) I've been replaced by a robot, but don't know this as I'm programmed not to.

  136. Tape Recorders vs. Tape Players by SeanDuggan · · Score: 1

    This reminds me of a movie where a few students started sending tape-recorders to class instead of themselves. Gradually the scene had the professor lecturing to a room full of tape recorders. The last step in this scenario was a tape of the lecture being played to a room full of machines taping it.
    Apparently, it's Real Genius. I have vague memories of that movie... I'm going to have to watch it again one of these days.

    --
    This sig has absolutely no significance and serves only to take up screen space and waste the time of the reader.
  137. BOGUS!!!!! by Anonymous Coward · · Score: 0

    I entered the first couple pages of my undergrad thesis on Antiferromagnetic materials...

    The Heisenberg Model for 2D Spin-½ Triangular Antiferromagnets:
    An Application to Cs2CuBr4

                          Abstract
    The Heisenberg model was used to analyze the properties of the quasi-two dimensional (2D) spin-½ triangular antiferromagnet Cs2CuBr4. High temperature series expansions of the magnetic susceptibility, Padé approximants, D-Log Padé approximants, and least squares analysis were used to determine diagonal nearest neighbor (J1) and nearest neighbor (J2) exchange constants, the Lande factors (g), the saturation field (Hs), and to provide evidence of spin frustration in this system. The theoretical calculations of these quantities are close to those determined by experiments, but are not close enough to conclude that Cs2CuBr4 is completely described by this model. ...
        Consider the ionic crystal MnF2 which has chemical notation Mn2+F2-. It crystallizes in the face-center cubic structure as shown in figure 1.

    and here are the results:

    This text had been classified as
    INAUTHENTIC
    with a 30.9% chance of being authentic text


    Did the developers forget to account for checmical notation??? oops...
    Good thing this wasn't a basis for my grade!!!

  138. "IANS"! Seattle's Finest hipsters. by geekpuppySEA · · Score: 1
    If you update the clichéd Beat ensemble to today's square-framed glasses, you've got an "Ian", a hipster male who owns* a Volkswagen and a Mac and hangs out all day slurping down his trustfund at the ueber-hippest coffeeshop in town.


    *Owns, but not operates: An Ian can drive and surf, but cannot install Linux or change his own oil. See, that would muss your hair.

    --
    Intelligent Design: because MATH is HARD.
  139. One-way functions by Dire+Bonobo · · Score: 1

    > It seems like it wouldn't be too difficult to modify the MIT program to use this
    > new anti-robot robot to write papers that this anti-robot robot would not be able to detect.

    For this specific algorithm, sure. In general, however, it may be intractable.

    It's similar to the basis of cryptography---some functions are easy to compute in one direction, but extremely hard to reverse. Consider, for example, a hash-based checksum; it's easy to tell whether the checksum is correct, but extremely hard to create a C file with a specified checksum that also compiles to sensible code.

    Similarly, it may be easy to determine whether a document is machine-written, but extremely hard to machine-write a document that has appropriate low-level structure (i.e., compiles) but also has correct high-level structure (i.e., does not look machine-written), even if the checking algorithms are known.

    (Of course, finding such testing algorithms may be highly non-trivial.)

  140. Randomly Generated CS Paper by norle · · Score: 1

    I'm sure some of you have already seen this:

    http://pdos.csail.mit.edu/scigen/

    This has been around for quite a while and works great to generate CS papers.