Developing a Vandalism Detector For Wikipedia

← Back to Stories (view on slashdot.org)

Developing a Vandalism Detector For Wikipedia

Posted by kdawson on Sunday February 28, 2010 @08:45AM from the false-positives-would-hurt dept.

marpot writes "In an effort to assist Wikipedia's editors in their struggle to keep articles clean, we are conducting a public lab on vandalism detection. The goal is the development of a practical vandalism detector that is capable of telling apart ill-intentioned edits from well-intentioned edits. Such a tool, which will work somewhat like a spam detector, will release the crowd's workforce currently occupied with manual and semi-automatic edit filtering. The performance of submitted detectors will be evaluated based on a large collection of human-annotated edits, which has been crowdsourced using Amazon's Mechanical Turk. Everyone is welcome to participate."

7 of 116 comments (clear)

Min score:

Reason:

Sort:

Re:Existing by Rik+Sweeney · 2010-02-28 08:59 · Score: 4, Informative

Further Reading
http://en.wikipedia.org/wiki/User:ClueBot

--
Summation 2
Re:Existing by broken_chaos · 2010-02-28 09:09 · Score: 3, Informative

Oh yes, it definitely hits a large number of false positives, presumably also 'fixed' within 30 seconds. For every one that goes reported (including the hundreds or thousands of archived reports), there must be many that go unreported, by 'non-Wikipedians' who edited a page with an error, and then went on their way. Or by people who didn't stick around to 'watch' that their edit doesn't get 'fixed' by an automated process...
Re:Existing by Ignorant+Aardvark · 2010-02-28 09:14 · Score: 3, Informative

The false positive rate on the anti-vandalism bots is a lot lower than you would think. The bots are written quite conservatively, take a lot of factors into account, and only pull the revert trigger when they are quite sure.
It's the type II error rate that's pretty high. Unfortunately, that's not solvable without strong AI.

--
Cyde Weys Musings - Scrutinizing the inscrutable
Re:Existing by marpot · 2010-02-28 09:38 · Score: 5, Informative

We have studied the accuracy of ClueBot, and found that (on a small corpus) it has very good precision (low falsy positive rate), but a very low recall (low true positive rate). (see: http://www.uni-weimar.de/medien/webis/publications/downloads/papers/stein_2008c.pdf) But the picture might look quite different on a large scale.
Re:The problem is the edits going live... by Shoe+Puppet · 2010-02-28 10:04 · Score: 4, Informative

A system like this has been implemented for the German Wikipedia. Almost everybody who has an account can verify articles to be vandalism-free, unless you are logged in you see the last verified version by default.

--
(+1, Disagree)
Re:How about an Admin Abuse Detector? by OverlordQ · 2010-02-28 11:59 · Score: 4, Informative

How about a log of each admin's activities, including reversions, bans, etc, and a way for non-admins to challenge actions (without spending countless hours in an appeal process worthy of a federal court).
Reversions: http://en.wikipedia.org/wiki/Special:Contributions
Bans: http://en.wikipedia.org/wiki/Special:Log/block
Deletes: http://en.wikipedia.org/wiki/Special:Log/delete
Anything else you're too lazy to find yourself?

--
Your hair look like poop, Bob! - Wanker.
In Wikipedia, everything is transparent by saibot834 · 2010-02-28 13:17 · Score: 4, Informative

If I had mod points, I'd mod the parent up and the grandparent down. Seriously, almost everything in Wikipedia is transparent. Search the revision history and logs and look for the information you need. RTFM.
A lot of people on /. seem to derive very general opinions about admins from a personal disappointing encounter. They do not include diffs of their edits or their username. From my experience in most cases the guy who got reverted by an admin broke some kind of rule (and often enough they just got reverted by a regular non-admin, but they assume it was an admin). Instead of RTFM those people post as AC complaining generally about admins without providing any traceable cases of admin abuse. I know my opinion isn't very popular, but unless you give concrete examples your allegations are just FUD.