Slashdot Mirror


Why Wikipedia Articles Vary So Much In Quality

Hugh Pickens writes "A new study shows that the patterns of collaboration among Wikipedia contributors directly affect the quality of an article. 'These collaboration patterns either help increase quality or are detrimental to data quality,' says Sudha Ram at the University of Arizona. Wikipedia has an internal quality rating system for entries, with featured articles at the top, followed by A, B, and C-level entries. Ram and graduate student Jun Liu randomly collected 400 articles at each quality level. 'We used data mining techniques and identified various patterns of collaboration based on the provenance or, more specifically, who does what to Wikipedia articles,' says Ram. The researchers identified seven specific roles that Wikipedia contributors play (PDF starting on page 175): Casual Contributor, Starter, Cleaner, Copy Editor, Content Justifier, Watchdog, and All-round Editor. Starters, for example, create sentences but seldom engage in other actions. Content justifiers create sentences and justify them with resources and links. The all-round contributors perform many different functions. 'We then clustered the articles based on these roles and examined the collaboration patterns within each cluster to see what kind of quality resulted,' says Ram. 'We found that all-round contributors dominated the best-quality entries. In the entries with the lowest quality, starters and casual contributors dominated.'"

35 of 160 comments (clear)

  1. Really? by d34dluk3 · · Score: 5, Funny

    Articles written by experienced people with a wide array of skills are stronger than those written by novices? Never could have guessed.

    1. Re:Really? by Anonymous Coward · · Score: 5, Interesting

      Also updates vary. For example:

      Christina_Applegate

      Current career

      Applegate starred in the ABC comedy, Samantha Who?, until it was canceled on May 18, 2009. The series costarred Jean Smart, Jennifer Esposito, and Melissa McCarthy. The series was about a 30-year-old who, after a hit-and-run accident, develops amnesia and has to rediscover her life, her relationships, and herself.[9] Shortly after the cancellation was announced, Applegate began a campaign to get the show back into production,[10] which was unsuccessful.

      Applegate will play Elizabeth Montgomery of Bewitched fame, who died of colorectal cancer, in the upcoming film Everything Is Going to Be Just Fine, due to be released in 2009.

      In January 2009, Applegate appeared with her TV brother David Faustino (Bud Bundy from Married with Children) in an episode of Faustino's show Starving.[11]

      Within two lines of each other, one article is talking about the future tense in 2009 and the past tense in 2009. Anyone editing the article as a whole would notice this. When, however, you have people editing piece by piece, simple mistakes can be made like that.

      Also, it doesn't help that I am too lazy to edit the changes myself. Leave it up to the snobby community. I've tried to contribute before, it was the last time I made that mistake.

    2. Re:Really? by Anonymous Coward · · Score: 5, Insightful

      Because 90% of Wikipedia is dead. People drive-by now and then and drop in a sentence or fix a spelling error, but for the most part nobody is editing the articles unless it's a politically contentious topic.

      The fun part was writing the articles in the first place, now phase is over, nobody wants to be Wikipedia's janitorial crew and deal with the super-aspbergers that populate that place. Which is why Wikipedia is doomed to a slow bit-rot into irrelevance.

  2. Oh. by Anonymous Coward · · Score: 4, Funny

    I always figured that some of the articles were poor because they were written by Americans, rather than much more intelligent Europeans or Asians.

    1. Re:Oh. by Arthur+Grumbine · · Score: 3, Funny

      I always figured that some of the articles were poor because they were written by Americans, rather than much more intelligent Europeans or Asians.

      At first I thought you were trolling, but then I checked the facts ! With how well-written that article was I can only assume it was someone from Hong Kong.

      --
      Now that I think about it, I'm pretty sure everything I just said is completely wrong.
  3. Missing role: deleters by Cyberax · · Score: 4, Insightful

    Seriously, I'm encountering more and more 'deleted' articles when I search Wikipedia.

    Can someone stop deleters? Or at least offer an option to view deleted articles (Deletionpedia works only for English language).

    1. Re:Missing role: deleters by Darkness404 · · Score: 4, Funny

      Exactly. And then these people who revert -any- change without even looking at it. What? An anonymous contributor added a few words to make a phrase make since? Revert it!

      --
      Taxation is legalized theft, no more, no less.
    2. Re:Missing role: deleters by LinuxIsGarbage · · Score: 3, Insightful

      Exactly. And then these people who revert -any- change without even looking at it. What? An anonymous contributor added a few words to make a phrase make sense? Revert it!

      Reason for edit: Change a word to make a phrase make since

      ;)

    3. Re:Missing role: deleters by Darkness404 · · Score: 3, Funny

      Reverted. I wrote that so it must be perfect! How dare someone have the -audacity- to change one of my words on -my- article! What is the world coming to? An encyclopedia where the masses can edit it!?

      --
      Taxation is legalized theft, no more, no less.
    4. Re:Missing role: deleters by b4dc0d3r · · Score: 3, Funny

      Audacity is open source, so anyone can have it.

      Here's my citation: http://en.wikipedia.org/wiki/Audacity

    5. Re:Missing role: deleters by thelamecamel · · Score: 4, Insightful

      Yes, this is really quite pathetic. On several occasions now I have wanted some information on a particular topic (e.g. a shitty old game I picked up, my mobile phone, or even a description of lemon party). I go to the wikipedia page, I can tell that several people went to the effort of writing an entry on that topic but the page was deleted by someone who decided that no-one would ever want to see that information. This is arrogance in the extreme - destroying some people's work because they incorrectly assumed that no-one would ever want to see it. Was the article getting in the way before it was deleted?!

      Surely Wikipedia could have a link to view pages that were 'deleted' for non-notability - what would be so bad about that?

    6. Re:Missing role: deleters by moogsynth · · Score: 3, Insightful

      It's much worse than that. Articles aren't deleted because people assume no-one will want to look at them. Articles with hundreds, even thousands of hits a month are commonly deleted because they are seen as not being notable enough for inclusion. The reasons why are usually because there aren't enough sources to prove that the article in question is notable, or the sources are of a sketchy nature (blogs and the like). The actual guidelines themselves say that articles should have the best citations that people can find--often enough mentions on blogs simply have to do. The notability guidelines are being taken as literal truth by a huge number of wikilawyers, who will mercilessly use it as a weapon to nominate articles for deletion. They'll then use other trollish guidelines as absolute law to rubbish the citations people dig up to try and save the article from being deleted. I've seen it happen way too many times now, and I just don't have the patience to dealwith these sorts of fuckwits.

  4. Maybe looking at it the wrong way? by Korin43 · · Score: 3, Interesting

    I know I'm more likely to "casually contribute" to Wikipedia on a low-quality article. Maybe the casual contributors just don't see the point of changing anything in an article that's already had a lot of attention?

  5. Quality Ratings by Toonol · · Score: 4, Insightful

    I think there may be a possible flaw in using Wikipedia's internal quality rating. It measures adherence to wikipedia standards... but that may not necessarily be the same thing as actual quality.

    In that scheme, excellent articles with posters who tend to brush up against some of wikipedia's more picky guidelines, would be rated lower. It's minor, because in general wikipedia's guidelines are there to make better articles, but it sometimes happens.

    It's like defining intelligence as the ability to do well on intelligence tests. It's certainly related, and there's not much of a better alternative, but you have to remember you aren't measuring the trait directly.

    1. Re:Quality Ratings by zappepcs · · Score: 3, Insightful

      That flaw has always been there, and similar was included in every version of every printed encyclopedia. It's hard to get around that without thousands of editors working full time. The premise of Wikipedia is good, but if you want to trust some information you found on the Internet... errrmm, you need to validate it, corroborate it, and research it yourself if necessary. For me, Wikipedia makes a great starting point to learn about something, just as any single book on any given subject is a good place to *start*. The principle of trust but verify applies for many things, but caveat emptor equally applies. Personally, much of the content of Wikipedia is better than asking Yahoo! Answers and others. meh, it's a thing. If you were supposed to get all your answers from a single source, god wouldn't have made Al Gore invent the Internet. Get off my lawn!

  6. Quality by bsDaemon · · Score: 4, Funny

    Wikipedia is great for anything involving mathematics or Star Wars. Everything else seems kind of suspect to me.

    1. Re:Quality by Darkness404 · · Score: 3, Interesting

      As opposed to? You know, I've come to the conclusion that Wikipedia is a lot more reliable, current and useful than a Google search. How many of us when going through links find the majority of them to be old (circa 1998) sites using outdated layouts, outdated information, etc. And it isn't like print media is much better. Really, Wikipedia is a great source to find needed information (note that accurate information is often unimportant compared to what the masses think) that you would spend days hunting down on Google and in libraries.

      --
      Taxation is legalized theft, no more, no less.
  7. My experience with WikiPedia by QuietLagoon · · Score: 5, Insightful

    Articles dominated by one or two "keepers" tend to be the most biased and lowest quality. Quality edits are tossed aside merely because they do not meet the agenda wanted by the keepers.

  8. Wikipedia's Editors by Darkness404 · · Score: 4, Insightful

    I think the main problem with Wikipedia is it went from "an encyclopedia where you -might- find something of interest" to "a place you can find anything!" to now "a place where you can possibly find some things but if we don't like it, it gets deleted and we don't want your help unless you feel like reading 22342342343 policies, follow them exactly and patrol "your" page constantly". Seriously, Wikipedia 2-3 years ago was a lot better than Wikipedia now. Why is it that editors think deleting articles somehow makes it better? Especially since Wikipedia is online and a few new articles don't translate to (much) extra load?

    --
    Taxation is legalized theft, no more, no less.
    1. Re:Wikipedia's Editors by rm999 · · Score: 4, Insightful

      It's funny, some comments in here complain that many articles have gotten stale and aren't well-maintained. Others, like yours, complain that there aren't enough articles. These two complaints are at odds with each other - a fixed number of editors can either maintain a smaller, more important set of articles, or can devote their time to starting and watching new articles. Your criticism is largely overblown too: there are, on average, over 1000 new articles a day. I'd like to see any print Encyclopedia do this in a year.

      Frankly, I prefer less but higher-quality articles, because it minimizes the amount of misinformation (one of the biggest plagues in early Wikipedia). It helps minimize the number of esoteric articles from being started and then forgotten. The only real rule you need to know when starting an article is notability: the 22342342343 policies are only in place to remove subjectivity from the process. Common sense can get you most of the way there, but if you are in the habit of starting articles understanding the five "general notability guideline" will save you a lot of hassle. And only takes about five minutes.

    2. Re:Wikipedia's Editors by Darkness404 · · Score: 3, Interesting

      But what do we -gain- for deleting every long article on every Pokemon in existence? Does it really matter if we have a single article for each episode of Keeping Up Appearances? In a print encyclopedia its easy to say that yes it does matter because extra pages translates to extra costs, however, with Wikipedia it doesn't. Yes, it may use a few -kilobytes- of disk space and if its really as obscure as everyone says, there won't be any extra bandwidth costs.

      --
      Taxation is legalized theft, no more, no less.
    3. Re:Wikipedia's Editors by gsslay · · Score: 4, Informative

      Why is it that editors think deleting articles somehow makes it better?

      Because ;

      - if the quality of Wikipedia is measured by averaging the quality of all its articles, deleting the crap raises the quality of Wikipedia.
      - crap inevitably attracts more crap. If the crap articles weren't deleted they would multiply.
      - crap pages, written by people who mistake Wikipedia for a free web-host for their fan site, give Wikipedia a bad name.
      - if you can't find the good articles for stumbling over the crap, you're likely to stop looking and go some place else.

      If crap pages weren't deleted Wikipedia would drown under them. Regardless of infinite disk space, or unlimited bandwidth. Wikipedia is essentially a database. If you fill a database with too much garbage it becomes useless, no matter how much data of true value in in there also. The noise to signal ratio becomes unbearable.

    4. Re:Wikipedia's Editors by grcumb · · Score: 3, Insightful

      Why is it that editors think deleting articles somehow makes it better?

      Because ;

      - if the quality of Wikipedia is measured by averaging the quality of all its articles, deleting the crap raises the quality of Wikipedia....

      [Emphasis mine.]

      Wow. So in your mind, 'not notable' is equivalent to 'crap'. That's quite a leap.

      Perhaps you should make that case first before you embark on any other argument.

      --
      Crumb's Corollary: Never bring a knife to a bun fight.
  9. Quality isn't such a simple metric, never will be by fyngyrz · · Score: 3, Insightful

    You can easily have an extremely high quality, 100% accurate and in-depth Wikipedia article without a single external reference. Therefore, the entire analysis is bullshit.

    Which is about what I've come to expect from anything that tries to meta Wikipedia.

    It's a mish-mosh. As long as article creation and revision is open, it will remain one. Legitimate attempts to characterize any article's quality can only be done by a true expert in the subject matter at hand, if one can even be found. Which is why Wikipedia's resident pedants utterly foul up so many excellent contributions.

    A-, B- and C-class articles, my ass.

    --
    I've fallen off your lawn, and I can't get up.
  10. back in the day by circletimessquare · · Score: 3, Informative

    there was a book called the cathedral and the bazaar

    http://en.wikipedia.org/wiki/The_Cathedral_and_the_Bazaar

    it delineates the difference between bottom up and top down organization, specifically in regards to software development models like linux versus gnu

    obviously, this overlaps thematically with wikipedia in that wikipedia was once a bazaar, and is now becoming a cathedral

    regardless of which model is better for wikipedia, the pluses and minuses of the cathedral versus the bazaar models of software development should be instructive for what exactly wikipedia is winning, and losing, in its trade off between bazaar and cathedral

    --
    intellectual property law is philosophically incoherent. it is your moral duty to ignore it or sabotage it
  11. Comment removed by account_deleted · · Score: 4, Insightful

    Comment removed based on user account deletion

  12. Re:Quality isn't such a simple metric, never will by Homburg · · Score: 3, Insightful

    You can easily have an extremely high quality, 100% accurate and in-depth Wikipedia article without a single external reference.

    No, you can't. Without references, a reader has no way of knowing whether the article is accurate or not; and an editor who writes an article who is unfamiliar with the references that could be cited is unlikely to be sufficiently knowledgeable to genuinely produce a high-quality article.

  13. One key flaw by cashman73 · · Score: 3, Interesting
    The study wasn't exactly complete. First, they only looked at Featured quality articles, A-class, B-class, and C-class. They totally neglected GA-class, of which there are currently over 8,000 of those. Secondly, FA-class and GA-class are handled differently than A, B, and C-classes. FA and GA are Wikipedia-wide assessment systems, with specific criteria that must be adhered to in order for articles to get listed there. FA is pretty rigorous, and only the best of the best get through after having been nitpicked, often far too much (yes, stupid crap like commas and en-dashes). GA is a bit less rigorous, with a review by only one editor being required for listing. And yes, this one editor system has been criticized in the past; though there is a GA reassessment system, and the community has gone through a pretty thorough system of GA sweeps, getting rid of some of the older GAs that were passed before the current criteria were enforced better.

    A, B, and C-class assessments are not Wikipedia-wide. They are assessed by individual Wikiprojects (of which there are literally hundreds of these). And each Wikiproject has their own standard of what it considers A, B, and C. Some Wikiprojects are much easier, others are more rigorous (like WikiProject Military history). Furthermore, C-class is relatively new, having been created just within the past two years or so; so there's probably still a lot of B-class articles that should be C-class.

    1. Re:One key flaw by Blakey+Rat · · Score: 4, Insightful

      What the holy shit are you talking about?

      Maybe the study should have been, "why are people working with Wikipedia completely unable to communicate in English to other people?" Shit, at the very least, why not tell us what your constantly-used GA and FA acronyms actually *mean*.

      Anybody care to translate that into English?

  14. Roles by lyinhart · · Score: 3, Informative

    In the five years that I've been a Wikipedia editor, I've played most of these roles, but right now I'm definitely a watchdog. I primarily revert vandalism. It's a good way to stay out of edits wars. At this point, most of the stuff on Wikipedia is way too messy to clean up and/or improve. I'd rather clean up Cowboys Stadium on any given Sunday in the Fall than clean up content on Wikipedia. As for deletionists? They deal with the administrative (sigh) aspect of Wikipedia, while this study seems to be driven mainly on the content itself.

    --
    Freedom is drinking a beer in the park when you're supposed to be at work.
  15. Re:Quality isn't such a simple metric, never will by interkin3tic · · Score: 3, Funny

    You can easily have an extremely high quality, 100% accurate and in-depth Wikipedia article without a single external reference.

    [Citation needed.] :-P

  16. Re:Quality isn't such a simple metric, never will by jbolden · · Score: 3, Insightful

    Take a look at the math articles. Heck most of the original content like episodes of BattleStar Galactica, information about cartoon characters or fringe political movements didn't have high quality references. Wikipedia built itself by specializing in materials for which only so / so or no references existed. Articles on wikipedia were higher quality that the same material on the same topics anywhere else.

  17. Re:Quality isn't such a simple metric, never will by owlnation · · Score: 4, Insightful

    Yes , you can . However Without references, a reader has no way of knowing whether the article is accurate or not

    Fixed that for you. If wikipedia allowed users to volunteer to sign pages, and in that signature were their qualifications, then some credence could be attached to the article -- referenced or not.

    Add to that, that all wikiadmins really should be identified on the site. If they are going to edit, delete, attack and defend content, as well as ban users, we really should know what their qualifications actually are.

    I'd be willing to bet 90% of the current problems with wikipedia would disappear overnight if the admins lost their anonymity. Much of the neofascist behavior, and agenda-ism, would certainly disappear. It solves the "who watches the watchers" problem overnight.

    While there are good reasons why articles can be submitted anonymously, those in charge of the site do NOT need to be anonymous -- and for the sake of transparency, honesty and ethical credibility, we NEED to know who they are. Are they afraid of the truth? What do they have to hide?

  18. Then add the citation by tepples · · Score: 3, Informative

    Equally nonsensical are the seemingly random insertion of [citation needed] tags on things that are matters of public record.

    In that case, you can help Wikipedia by removing the {{citation needed}} and replacing it with <ref>name of the relevant public record</ref>.

    1. Re:Then add the citation by tepples · · Score: 3, Informative

      Because verifiability is the core content policy of Wikipedia. If you don't like it, contribute to another web site such as Everything 2.