Slashdot Mirror


Developing a Vandalism Detector For Wikipedia

marpot writes "In an effort to assist Wikipedia's editors in their struggle to keep articles clean, we are conducting a public lab on vandalism detection. The goal is the development of a practical vandalism detector that is capable of telling apart ill-intentioned edits from well-intentioned edits. Such a tool, which will work somewhat like a spam detector, will release the crowd's workforce currently occupied with manual and semi-automatic edit filtering. The performance of submitted detectors will be evaluated based on a large collection of human-annotated edits, which has been crowdsourced using Amazon's Mechanical Turk. Everyone is welcome to participate."

38 of 116 comments (clear)

  1. Existing by ShakaUVM · · Score: 3, Insightful

    Apparently, how their vandalism detector works right now is by automatically reverting any edits done by anonymous editors.

    (And yeah, that's a bit sarcastic, but somewhat true.)

    1. Re:Existing by Rik+Sweeney · · Score: 3, Interesting

      It's called Clue Bot. It's been known to revert vandalism in under 30 seconds :)

    2. Re:Existing by broken_chaos · · Score: 4, Insightful

      I'm assuming it's also known to revert good edits in under 30 seconds?

      Just thinking out loud here, but is raw speed of reversion really what should be bragged about, as opposed to accuracy?

    3. Re:Existing by Rik+Sweeney · · Score: 4, Informative
    4. Re:Existing by Ignorant+Aardvark · · Score: 2, Informative

      I'm not sure why he bragged about reversion speed. All that's really dependent on is your network connection. For one, your network connection has to be good enough to download, in real time, the diffs of all edits to Wikipedia. Most aren't.

      Anyway, a decision as to whether a given diff is vandalism or not needs to be made in a small fraction of a second, as there are dozens of edits coming in every second, and if you continuously fall farther and farther behind, you lose. Given an ideal network connection, vandalism should be reverted in a couple of seconds or so.

      I suppose there's some argument to be made for a large cluster of computers handling all edits on Wikipedia, each one spending up to a full second judging each individual edit, but the truth is that none of the algorithms currently in use for vandalism detection are nearly sophisticated enough to require so much computation time.

    5. Re:Existing by broken_chaos · · Score: 3, Informative

      Oh yes, it definitely hits a large number of false positives, presumably also 'fixed' within 30 seconds. For every one that goes reported (including the hundreds or thousands of archived reports), there must be many that go unreported, by 'non-Wikipedians' who edited a page with an error, and then went on their way. Or by people who didn't stick around to 'watch' that their edit doesn't get 'fixed' by an automated process...

    6. Re:Existing by Ignorant+Aardvark · · Score: 3, Informative

      The false positive rate on the anti-vandalism bots is a lot lower than you would think. The bots are written quite conservatively, take a lot of factors into account, and only pull the revert trigger when they are quite sure.

      It's the type II error rate that's pretty high. Unfortunately, that's not solvable without strong AI.

    7. Re:Existing by DamonHD · · Score: 3, Interesting

      Amazingly my small sample is to the contrary.

      I fix small errors of syntax/grammar/fact when I run across them, have never created an account, and almost all of my edits seem to stick.

      Rgds

      Damon

      --
      http://m.earth.org.uk/
    8. Re:Existing by marpot · · Score: 5, Informative

      We have studied the accuracy of ClueBot, and found that (on a small corpus) it has very good precision (low falsy positive rate), but a very low recall (low true positive rate). (see: http://www.uni-weimar.de/medien/webis/publications/downloads/papers/stein_2008c.pdf) But the picture might look quite different on a large scale.

    9. Re:Existing by marpot · · Score: 2, Interesting

      This is by far overestimated. Dependent on how elaborate your edit model ist, you can analyse edits live on a laptop.

    10. Re:Existing by Yvan256 · · Score: 2, Funny

      A Clue Bot, eh? I wonder what happens if you register the username "Colonel Mustard".

    11. Re:Existing by beakerMeep · · Score: 5, Insightful

      The problem is not so simple though. You cant quantify something as subjective as vandalism. You cant reduce it to your mathematical formula no matter how statistically fancy your 6 page pdf is.

      I had a particularly nasty run it with cluebot where I removed large portions of spam from an article, only to have cluebot revert it back and put the spam back in. When I again removed the spam, some other editor strolled by and again put the spam back in because he trusted the bot more than humans and he didnt read the talk page where many had requested the removal of this spam. Finally, after a rather rude conversation with the human he realized he had no business reverting it. This person was a long time editor and contributor too but it just serves as an example that any criteria used to determine spam is based upon assumptions. Assumptions that it will be true in other cases and assumptions that others will agree with the classification.

      The whole point of Wikipedia is that it is a community edited encyclopedia. I have no interest in a computer edited encyclopedia. If people want to program bots to review an editor's work, perhaps we should program bots to write the work? Perhaps you can call it Botopedia. Furthermore, many of the bots ask you to report false positive to their personal pages off of Wikipedia's website on some other .com or .edu domain. They ask you to be accountable to them, but who are they accountable to? What's to stop spammers from programming bots to annoy editors as a phishing exercise?

      Now don't get me wrong though, if someone wants to use a bot to aid in finding vandalism, that would help. But if the system is so frail that Wikipedia cant exist without computer program editors, It may be time to revisit the system. As others have stated, pushing edits into a queue would be much more sane than direct to live edits.

      Editing bots are wrong for Wikipedia, and if they allow it they are letting go of their vision of community participation in favor of the visions (or delusions) of grand technological solutions.

      --
      meep
    12. Re:Existing by Tango42 · · Score: 2, Insightful

      In that paper, you say you think high-recall (ie. low false negatives) should be preferred to high-precision (low false positives) since it reduces the chance of a reader seeing a vandalised version. I disagree. You underestimate the harm caused by losing editors that get annoyed when their legitimate edits are reverted by a bot. The upcoming feature, Flagged Revisions ( http://en.wikipedia.org/wiki/Wikipedia:Flagged_revisions ), will provide a much better way of preventing readers from seeing vandalised versions while not costing us useful editors.

    13. Re:Existing by The+Wild+Norseman · · Score: 2, Funny

      I cannot agree more with what you say, but I'd like to give it a twist: I want computers to assist me, and I want them to DO it WELL, RELIABLY, and ROBUSTLY.

      -Slashbot Editor 0.95 beta

      --
      "A government is a body of people usually -- notably -- ungoverned." -Shepherd Book
  2. {{uw-vandalism1}} by Anonymous Coward · · Score: 5, Funny

    Welcome to Slashdot. Although everyone is welcome to contribute to Slashdot, at least one of your recent posts did not appear to be constructive and has been modded down. Please use TrollTalk for any test edits you would like to make, and read the welcome page to learn more about contributing constructively to this web site. Thank you.

  3. Been done? by Ignorant+Aardvark · · Score: 2, Informative

    Whoever posted this clearly isn't aware of the actual work being done in the field. For instance, I was running an anti-vandalism bot in 2006, and it wasn't new at the time. They've gotten gotten much more sophisticated since then.

    Why are they so intent on reinventing the wheel? Do they not even realize that the wheel exists already? Why not just improve on it instead?

    1. Re:Been done? by Kratisto · · Score: 4, Funny

      The article on anti-vandalism bots had been recently vandalized when they were doing their preliminary research.

      --
      Conscience is the inner voice which warns us that someone may be looking.
    2. Re:Been done? by marpot · · Score: 2, Informative

      We are very aware of the existing tools (Huggle, Twinkle, and so on). See the links in the above post, and see the links in the resources section of the competition Web page. An accurate vandalism detector will take a lot of research an development, just like spam detectors did... Why did you stop developing your tool, anyway?

    3. Re:Been done? by LifesABeach · · Score: 2, Insightful

      One of the quirks I've noticed is when a business makes, or invents something, then uses the Wiki to advertise. I can't help but wonder is this could also be considered a form of vandalism?

    4. Re:Been done? by pipatron · · Score: 3, Interesting

      I edit wikipedia occasionally, and one thing I remove is unmotivated links to companies, or unnecessary mentioning of specific products. So yes, I consider it a case of vandalism. Since my edits are usually (always?) kept, I think most people agree. There is probably some policy about it, but I act on common sense there.

      --
      c++; /* this makes c bigger but returns the old value */
  4. How about an Admin Abuse Detector? by Anonymous Coward · · Score: 3, Insightful

    I've had many more problems with admin abuse than vandalism. Vandalism is quick and easy to deal with. Admins are the biggest problem in Wikipedia editing; they have no accountability and abuse their power.

    How about a log of each admin's activities, including reversions, bans, etc, and a way for non-admins to challenge actions (without spending countless hours in an appeal process worthy of a federal court).

    1. Re:How about an Admin Abuse Detector? by OverlordQ · · Score: 4, Informative

      How about a log of each admin's activities, including reversions, bans, etc, and a way for non-admins to challenge actions (without spending countless hours in an appeal process worthy of a federal court).

      Reversions: http://en.wikipedia.org/wiki/Special:Contributions
      Bans: http://en.wikipedia.org/wiki/Special:Log/block
      Deletes: http://en.wikipedia.org/wiki/Special:Log/delete

      Anything else you're too lazy to find yourself?

      --
      Your hair look like poop, Bob! - Wanker.
  5. Re:What? by Ignorant+Aardvark · · Score: 2, Informative

    In response to whether those two examples are vandalism, the answer is no, they are not.

    You'd need a strong AI to be able to make those determinations, and if such a thing existed, it'd make more sense just to have the strong AI write the encyclopedia.

    What we're talking about here is obvious vandalism (blanking, insertion of curse words, etc.) of the type that can be detected by an algorithmic/heuristic program.

  6. Step One by owlnation · · Score: 4, Insightful

    Before any more detectors are rolled out, how about they come up with a workable definition of vandalism? And actually use it fairly, ethically and logically.

    There's a great deal of evidence to suggest the current definition of "vandalism," is something a wikiadmin decides he just doesn't like, or disagrees with, or in some way interferes with his power-trip.

    1. Re:Step One by Anonymous Coward · · Score: 2, Insightful

      I completely agree. The worst vandalism on wikipedia is done by self righteous page owners and admins on power trips that hate to be corrected. I used to help out on a number of pages (areas where I am a genuine expert not just someone with an opinion) but having my updates constantly deleted just got too frustrating, now I just make sure people in my field know not to use wikipedia.

  7. The problem is the edits going live... by Anonymous Coward · · Score: 2, Interesting

    Right now, you can think of wikipedia as having two columns per article - first is the working article column, with the second being the discussion column.

    What we really need is a third column, one for the currently published version of the article.

    While this may not be popular, it would go a long way to getting rid of the spam, and might even solve some of the other issues facing wikipedia.

    With such a system, you could even assign articles to a subject matter expert as the editor, who could approve changes, or just incorporate the best changes in.

    Not every article would need to have this, but as articles mature, they could move to this over time.

    1. Re:The problem is the edits going live... by Shoe+Puppet · · Score: 4, Informative

      A system like this has been implemented for the German Wikipedia. Almost everybody who has an account can verify articles to be vandalism-free, unless you are logged in you see the last verified version by default.

      --
      (+1, Disagree)
  8. Re:Wikipedia needs a Flash editor by MillionthMonkey · · Score: 2, Insightful

    On the other hand, do gibberish pages like this need much more editing, or is Harry Potter's Wikipedia entry basically finished as far as anyone cares?

  9. quite a bit of work on this by Trepidity · · Score: 2, Interesting

    Since the problem is tantalizingly easy to frame as a standard data-mining or machine-learning problem, albeit with some quirks, there's quite a lot of work from a lot of research groups that seems to be looking at it. Some examples: one, two, three, four, five, six, seven.

  10. The Art and Science of Wikipedia Vandalism by MillionthMonkey · · Score: 5, Interesting

    There is an art to Wikipedia abuse. If someone cites a Wikipedia article in some argument they're making, you can always just go to Wikipedia and edit the page so that they're wrong. But that's what a novice Wikipedia vandal does.

    A pro knows to edit the article in a very subtle way, so that it looks like the person has poor reading comprehension. Let's say the person cites a Wikipedia article with a sentence like this, in order to support the argument that Colbert is a Democrat.

    Although by his own account he was not particularly political before joining the cast of The Daily Show, Colbert is a self-described Democrat.[12][13]

    This bears the mark of authority, because of the footnote subscripts that are already on it. (We can skip the step where we maliciously relocate them here.)

    A novice might change it to this (correctly preserving the authoritative footnote superscripts):

    Although by his own account he was not particularly political before joining the cast of The Daily Show, Colbert is a self-described Republican.[12][13]

    It makes the person appear to be wrong- and the vandalism is obvious- like swapping Eurasia for Eastasia. There's no way he could have misread that.

    But change it to this

    Although by his own account he was not particularly political before joining the cast of The Daily Show, Colbert has even been described as a Democrat.[12][13]

    and the person looks not only wrong, but plausibly wrong because it looks like he can't read. That's what makes successful Wikipedia vandalism an art.

  11. Nice template by MillionthMonkey · · Score: 4, Funny

    Whoever posted this clearly isn't aware of the actual work being done in the field. For instance, I was running a ___[thing]___ in _[year]_, and it wasn't new at the time. They've gotten much more sophisticated since then. Why are they so intent on reinventing the wheel? Do they not even realize that the wheel exists already? Why not just improve on it instead?
    * * *
    This looks like a useful template for the standard "why reinvent the wheel" Slashdot post; I hope you don't mind if I reuse it.

  12. An arms race? by fysdt · · Score: 2, Interesting

    I believe that vandalism on Wikipedia can be limited. But would it really be possible to detect all kinds of vandalism?

    FTA:
    "Yahoo! Research will award a cash prize of 500 Euros to the winner of the plagiarism detection task. "

    500 Euro's doesn't sound much for detecting plagiarism on a site like Wikipedia...

  13. Re:What counts as vandalism on Wikipedia? by Tango42 · · Score: 3, Insightful

    Officially, vandalism is defined as edits made in bad faith. If you are trying to improve the article but are an idiot (which includes people that don't realise their own bias), that isn't vandalism, it's just idiocy. It is only if you are editing with the intention of making the article worse that you are vandalising.

  14. Yeah but what about when it's not vandalism ... by PaganRitual · · Score: 2, Funny
  15. In Wikipedia, everything is transparent by saibot834 · · Score: 4, Informative

    If I had mod points, I'd mod the parent up and the grandparent down. Seriously, almost everything in Wikipedia is transparent. Search the revision history and logs and look for the information you need. RTFM.

    A lot of people on /. seem to derive very general opinions about admins from a personal disappointing encounter. They do not include diffs of their edits or their username. From my experience in most cases the guy who got reverted by an admin broke some kind of rule (and often enough they just got reverted by a regular non-admin, but they assume it was an admin). Instead of RTFM those people post as AC complaining generally about admins without providing any traceable cases of admin abuse. I know my opinion isn't very popular, but unless you give concrete examples your allegations are just FUD.

    1. Re:In Wikipedia, everything is transparent by Jedi+Alec · · Score: 2, Insightful

      Hey, this is Slashdot. We're qualified to discuss any subject we damn well please based on our own prejudices and assumptions, while pretending that our high IQ's and common sense qualify us to pretend we're experts on whatever the discussion may or may not be about. What right do wiki admins have to assault our ivory towers when we sprinkle our droplets of distilled wisdom on their pages as well?

      --

      People replying to my sig annoy me. That's why I change it all the time.
  16. DWIM, PDCH by symbolset · · Score: 2, Funny

    You're looking for a DWIM (Do What I Meant) interpreter with PDCH (Predictive Digital Concierge Heuristics). While the technology is available it's currently quite costly. Bugs, errata, and maintenance can deliver less than an optimal experience. Might I instead offer you this mail order bride? We have imported personal assistants in stock from less privileged nations - and if you have the means we can outsource minute-to-minute management of them to our Bangalore VPDT (Virtual Presence Discipline Team). Please consult your accountant and tax lawyer concerning withholding for personal staff, particularly if you intend to pursue public service.

    /At your service!

    --
    Help stamp out iliturcy.
  17. How do you define *Vandalism* ? by Taco+Cowboy · · Score: 4, Interesting

    Case in point --- There is an article in Wikipedia about a certain country.

    In that article, they blame their previous British colonial master for everything.

    I tried to make some corrections to that article to make it more "neutral", and they changed it back within 10 minutes.

    I tried again, and again they changed it back.

    For the third time, I was warned by someone from Wikipedia (dunno if it's a volunteer or something) that I have no right to make any correction to that particular article anymore.

    The "THEY" in question is the government of that country. They have a "cyber-patrol" group in charge of "online propaganda" and that Wikipedia article is one of their many lies, aka propaganda, they have put online.

    Now, how do you define vandalism in this case?

    --
    Muchas Gracias, Señor Edward Snowden !