Slashdot Mirror


Anonymous No More: Your Coding Style Can Give You Away

itwbennett writes Researchers from Drexel University, the University of Maryland, the University of Goettingen, and Princeton have developed a "code stylometry" that uses natural language processing and machine learning to determine the authors of source code based on coding style. To test how well their code stylometry works, the researchers gathered publicly available data from Google's Code Jam, an annual programming competition that attracts a wide range of programmers, from students to professionals to hobbyists. Looking at data from 250 coders over multiple years, averaging 630 lines of code per author their code stylometry achieved 95% accuracy in identifying the author of anonymous code (PDF). Using a dataset with fewer programmers (30) but more lines of code per person (1,900), the identification accuracy rate reached 97%.

7 of 220 comments (clear)

  1. Re:Can they do it with corporate code? by Marginal+Coward · · Score: 4, Funny

    It seems like using the applicable features of the corporate version control system would be a lot easier - and possibly even better than 95% accurate.

  2. Up next, automatic intelligence rating... by TWX · · Score: 4, Funny

    ...based on the quality of that code...

    --
    Do not look into laser with remaining eye.
    1. Re:Up next, automatic intelligence rating... by halivar · · Score: 4, Funny

      goto blah;
      ^^ Idiot.

      // If you don't know why this is here, don't fuck with it.
      goto blah;

      ^^ Code guru.

  3. What about Bitcoin? by Anonymous Coward · · Score: 5, Funny

    Can we use this to find Satoshi?

  4. Re:Next thing you know by Anonymous Coward · · Score: 2, Funny

    Why would they even bother with an algorithm to process your ramblings? Every time I see you post, I instantly think "oh here's this jerk again".

  5. Re:Not my Frankencode... by Tablizer · · Score: 3, Funny

    ... a patchwork of open-source freebies.

    So, what's it like to work for FaceBook?

  6. Re: Can they do it with corporate code? by Anonymous Coward · · Score: 2, Funny

    Drats! I was.sure that.everyone else wrote.stuff.like "if(user == 'dumbfuck"){exit 666};