Slashdot Mirror


Who (Really) Writes Wikipedia

Nico ? La ! writes "Aaron Swartz questions Jimbo Wales' (Wikimedia's founder) belief and evangelized truth that only around 500 people are the most important contributors to Wikipedia. Whereas the truth is that they probably are the people who do the most editing. From the post: 'For example, the largest portion of the Anaconda article was written by a user who only made 2 edits to it (and only 100 on the entire site). By contrast, the largest number of edits were made by a user who appears to have contributed no text to the final article (the edits were all deleting things and moving things around).'" Which ultimately means that Wikipedia in some ways much more closely mimics a real encyclopedia, with many contributors writing the bulk of the content, but a small group massaging that text to insure standards compliance with the overall work. Interesting thing there and worth your time, although the super-computer thing doesn't make a lot of sense to me.

24 of 175 comments (clear)

  1. Anaconda by webrunner · · Score: 5, Funny

    I hear the number of anacondas has tripled in the last six months. Maybe that should be in the article?

    --
    ADVENTURERS! - ANTIHERO FOR HIRE - CARDMASTER CONFLICT
  2. More Statistics & What I Expect by eldavojohn · · Score: 4, Interesting
    But when you count letters, the picture dramatically changes: few of the contributors (2 out of the top 10) are even registered and most (6 out of the top 10) have made less than 25 edits to the entire site. In fact, #9 has made exactly one edit -- this one! With the more reasonable metric -- indeed, the one Wales himself said he planned to use in the next revision of his study -- the result completely reverses.
    With the shear mass of people writing on Wikipedia, I think you'd be able to find examples of the articles that were heavily contributed to by a large group of people (say things like Fermat's Theorem) and also things that are primarily the work of one person. Frankly, that's what I expected of Wikipedia.

    What about these statistics? Could Wales perhaps post average number of edits per page with a standard distribution? What about the same for average number of users contributing to page? What about statistics for average number of characters changed per edit?

    Things that have many books written about them are going to be edited by a lot of people that read those books (like The Beatles). But if I want to read up on Procul Harum (A not-so-well-known rock band), I'm assuming that there is some die hard nutjob out there with two children named Procul and Harum that filled in most of the information in that page.

    Is this a good thing? Well, yes and no. I think The Beatles' entry holds to more rigorous standards than Procul Harum's on account of the possibility of one person unintentionally inserting their personal views into Wikipedia. For instance, "Known as the World's Greatest Rock Band" may be appropriate for The Beatles' page but not for Procul Harum's. Yet, we all know how insane fans treat their favorite bands. Passion and emotion are not useful tools when authoring Wikipedia or history in general. And that, in my opinion, is Wikipedia's greatest hinderance.
    --
    My work here is dung.
    1. Re:More Statistics & What I Expect by qazsedcft · · Score: 5, Insightful

      Is this a good thing? Well, yes and no. I think The Beatles' entry holds to more rigorous standards than Procul Harum's on account of the possibility of one person unintentionally inserting their personal views into Wikipedia. For instance, "Known as the World's Greatest Rock Band" may be appropriate for The Beatles' page but not for Procul Harum's. Yet, we all know how insane fans treat their favorite bands. Passion and emotion are not useful tools when authoring Wikipedia or history in general. And that, in my opinion, is Wikipedia's greatest hinderance.

      But on the other hand, the more people view an article the more it is likely to be corrected and balanced for NPOV. This is a little-bit like free market price-correcting mechanisms - it isn't perfect, but in the opinion of many the results are fairly acceptable.

    2. Re:More Statistics & What I Expect by dfghjk · · Score: 4, Insightful

      "But on the other hand, the more people view an article the more it is likely to be corrected and balanced for NPOV."

      The more people that view an article the more likely the article is to reflect the views of more people. Sometimes that will tend towards a NPOV and sometimes not. The article will increasingly reflect popular opinion (which may not be neutral at all).

      "it isn't perfect, but in the opinion of many the results are fairly acceptable."

      Perhaps, but not definitive. How do you justify that claim?

      When an article can be objective without threatening the views of people it stands a decent chance of being so. Wikipedia is, by its nature, not an objective resource. It is a useful one, though.

  3. Define: Important by Sub+Zero+992 · · Score: 5, Insightful

    Well, maybe both Mr. Swartz and Mr. Wales are correct.

    Encyclopedias are measured by the number of articles they have, the average size of those articles and the "Quality" of the articles [here see other disputes :P]. Wikipedia is trying hard for quality, hence the importance of copy editors - those quick edit users who do a lot of banging articles into shape. They do an important job. These are the general-purpose-but-shallow editors.

    Of course, without the initial contribution of a large number of specialists, the working draft of many articles would never get done. These are the specialist-article-experts who know what they know, and leave the rest to others.

    So, this is likely to be another case of everyone having some of the truth and only a more enlightened, liberal view of the situation can lead to insights which can be used to improve the entire content creation process.

    --
    They who would give up an essential liberty for temporary security, deserve neither liberty or security - Ben Franklin
    1. Re:Define: Important by pmc · · Score: 5, Interesting

      Wikipedia is trying hard for quality, hence the importance of copy editors - those quick edit users who do a lot of banging articles into shape.

      Depends what you mean by quality - grammatically correct and logically ordered is one thing. Actual content - possibly in rough and ready form - is another.

      What JW is (apparently) arguing is that what he considers the contributions to wikipedia is best measured by "edits", and that by this metric there is a hardcore of 500 users that do most of the work.

      What the article argues is that the a better measure of of contribution to wiki is raw material, and that far from 500 people doing it, it is actually orders of magnitude more than this.

      My opinion is that anyone treating all edits are equal and using that to derive a metric for measuring user contributions to the site is using a seriously flawed method. Selling the output of this method as "The Truth about who Created Wikipeida (or the Tale of the Noble 500") is just trying to invent history to fit their preconceived notions.

  4. They still don't get it by Metasquares · · Score: 5, Insightful

    The number of edits you make means nothing because an edit can mean writing an entirely new article or a very small change (some articles, such as "peerage", have hundreds, if not thousands, of such edits).

    The obsession over edit count was the reason I stopped contributing to Wikipedia to begin with: My voice wasn't being heard because I did not have the time to make thousands of changes to the encyclopedia.

    The fact that we are still having this discussion indicates that little has changed.

    1. Re:They still don't get it by interiot · · Score: 4, Insightful

      Um, doesn't this study tend to support your assertions? It says that people with high edit counts contribute little to nothing to articles, while newcommers are the ones who provide most of the meat...

    2. Re:They still don't get it by ergo98 · · Score: 5, Insightful
      My voice wasn't being heard because I did not have the time to make thousands of changes to the encyclopedia.

      Aren't those edits how you're getting your voice heard? Or were you more concerned with gaining credibility among the Wikipedia inner cicle?
  5. not quite like a real encyclopedia ... by Lazy+Jones · · Score: 4, Informative
    ...but a small group massaging that text to insure standards compliance with the overall work.

    Nope, this small group is tweaking the text the way they see fit, basing their changes on their personal opinions and feelings and not on some god-given inspiration that leads to better quality or with standards compliance in mind. So the conclusion above is almost valid - it is like a real encyclopedia, but with an anarchic structure in the team of editors and no educated QA team. It's more like an encyclopedia reworked by a non-cooperating team of censors.

    --
    "I love my job, but I hate talking to people like you" (Freddie Mercury)
    1. Re:not quite like a real encyclopedia ... by interiot · · Score: 4, Insightful

      Another way to look at it would be that Wikipedia has an enormous number of policies that newcommers shouldn't be expected to memorize before contributing. So newcommers contribute the meat of an article, and people who have been around for a while apply all the style rules, find a couple categories for it, make sure it has the backlinks it should, etc... Sure, there's room for editorializing there, but hopefully there's enough established people around that they can review each other's work and make sure they're sticking to policy...

    2. Re:not quite like a real encyclopedia ... by julesh · · Score: 4, Insightful

      Actually, I think most of those people with the largest number of edits really are tweaking it towards complying with internal standards. Most of them are adding references to unsourced material, or adding {{fact}} tags to mark where unsourced material is so that someone with better knowledge of the subject can find a reference for it, or making stylistic changes towards standardised article structures, categorising information, fixing spelling or grammar errors, debating editorial decisions about what should and shouldn't be included, and things like that. These jobs aren't glamorous, but a lot of wikipedia's biggest contributors work on them.

    3. Re:not quite like a real encyclopedia ... by jacksonj04 · · Score: 4, Funny

      I must say I've noticed {{fact}} springing up all over the damn place recently. It's hard [citation needed] to read most of an article [citation needed] without seeing them.

      --
      How many people can read hex if only you and dead people can read hex?
  6. Editcountitis by Dan+East · · Score: 5, Interesting

    As with anything that keeps stats or running totals, there are those that seek to achieve the highest count possible. Wikipedia is not immune to this. There are those that will make 50 small, distinct edits to an article (each comprising minor changes, like punctuation, formatting, spelling corrections, etc) to increase their edit count.

    It is my personal experience that those with the highest edit counts peruse any and all articles applying Style Guidelines. This results in changes like like correcting capitalization of headers ("External Links" -> "External links"), placing bullets in front of external links, formatting dates, wikifying appropriate words, updating links that redirect, etc. Once a person becomes familiar with the guidelines they can easily nitpick pretty much any article and find something to correct (or at least change to their personal preference).

    Also, don't forget those that run bots. That's a very easy method to rack up edit points.

    Dan East

    --
    Better known as 318230.
    1. Re:Editcountitis by the-intersocialist · · Score: 5, Insightful

      And this is a problem, how?

      Wikipedia is not a video game where you need to gain exp-points to level. It is an encyclopedia. People spending their days correcting punctuation are also contributing and if ranking higher in the statistics is their motivation, that is fine by me.

      Those who make great contributions in the form of content are recognized in another way then statistics: their articles become featured, and that is a far greater honor then beeing the top contributor by means of spellchecking.

    2. Re:Editcountitis by Chacham · · Score: 4, Interesting

      It is an encyclopedia

      It *pretends* to be an encyclopedia. It is not, however. Due to its very nature it is a mixture of current thought and old grudges, with the latter being less influential. Calling it an encyclopedia is like mixing all Slashdot comments into one comment and modding it informative.

    3. Re:Editcountitis by Maru+Dubshinki · · Score: 5, Insightful

      Please. An encyclopedia is a form and genre, not a classification of quality. Your post is as silly as claiming "The Phantom Menace" was not a real movie because it sucked so much.

      Encyclopedias are reference sources containing information on a variety of topics. (Don't believe me? Take a look at the many definitions Google pulls up for the query "Define:encyclopedia".) Wikipedia is a reference source which contains information on a variety of topics in an explicitly encyclopedic format. Case Closed. Issues of quality and reliability are entirely separate and unrelated.

      --
      Enquiring minds want to know!
  7. Much like Digg by otisg · · Score: 5, Interesting

    This is not a surprise. That is simply another example of nature's laws on the web. This is not much different from the now well known fact that most stories on Digg are submitted by a handful of people (see: Top 100 Digg Users Control 56% of Digg's HomePage Content).

    --
    Simpy
    1. Re:Much like Digg by DingerX · · Score: 4, Insightful

      So explain to me, how does 56% of the content is submitted by the top 100 Digg users relate to the assertion here that the overwhelming majority of Wikipedia content is not controlled by the "Wiki-Elite."

      Personally, I think this policy of focusing on total edits for Wikiality is brilliant: it keeps the generalists/prestige mongers focusing on copy editing -- where they can help -- and away from content creation -- where they usually can't. Wikipedia is largely the creation of a bunch of specialist nuts. The "Wiki-Elite" are the nuts whose speciality is Wikipedia. Better to keep them away from the content; otherwise, it's akin to having someone with a degree in journalism reporting on a technical issue.

  8. types of editors by gEvil+(beta) · · Score: 5, Informative

    I think the problem is arising because of lack of distinction between two different types of "editors." There are people who edit the content of an article (content editors), and there are people who edit the copy (copyeditors). One is concerned with altering the actual material that is being presented to present a different subset of information. The other is concerned with making edits for grammatical consistency, readability, and style.

    --
    This guy's the limit!
  9. Re:Duh, they design it that way by TheSeer2 · · Score: 4, Informative

    1. Jew's isn't protected. 2. Apart from a select few vulnerable pages [for example, the vandalism page] protection is usually dropped very rapidly. 3. I've seen many heavily edited articles that have never been protected... ever. 4. Profit!!! ... don't make unsubstaniated claims.

  10. Ignore that man by inviolet · · Score: 5, Insightful

    Jimmy Wales is wrong, and probably on purpose...

    There is no glory in being one of a million diffuse contributors.

    But there *is* glory in being one of a small elite group, the group that really matters, the group that the founder adores. Jimmy is baiting his contributors with this possibility, in order to motivate them.

    --
    FATMOUSE + YOU = FATMOUSE
  11. Nobody is the most important by wackymacs · · Score: 4, Interesting

    I am a Wikipedia contributor, with around 5800 edits. I started editing in 2005. Most of my edits are minor ones, cleanup, etc. Although I have contributed a lot of text to many articles, I also enjoy contributing free-use images and providing references for articles in order to improve the reliability of an article; too many articles on Wikipedia do not cite their sources. Jimmy Wales should not state that only several hundred users are the most important editors on Wikipedia, because in reality everyone on Wikipedia who is contributing, whether by adding text or by cleaning up articles or doing maintenance, they're all helping out in one way or another. Most people all work on different stuff, so nobody can be called the "most important", its too broad a term.

  12. The good and the bad by XchristX · · Score: 4, Insightful

    The bottom line is that, while technical articles (on most disciplines and fields) on wikipedia are very good (better than anywhere else on the internet even), articles that are about history, religion, politics,movies, personalities, or recent events are hopelessly (though not always systemically) biased and full of unrepresentative crap. This is because wikipedia runs on de-facto consensus (though it's not supposed to) and only the most (and biggest) noisemakers get their edits on the article, and detractors cause endless revert-wars, edit-wars, vandalisms, admin intervention, article protection/semi-protection, insults in user talk pages (some nasty ones too), blocks, mediation cabals, RfC's, RfA's, secret edit-cabals, tag-warring, NPOV, TotallyDisputed etc. etc. and all sorts of things that are part of the dark and invisible underbelly of wikipedia that you DON'T see.

      An encyclopedia is not (IMHO) supposed to contain articles that are highly controversial and subject to different interpretations. It should be about objective and verifiable facts. 90% of the articles on wikipedia that are non-technical contain maybe 10% of verifiable facts, and 90% noise.

      The sad truth is that the high visibility of wikipedia (Google Searches usually point to wikipedia articles on the search subject first or second or third, if an article on the subject exists) means that people READ all this nonsense and, unaware of the many problems of wikipedia, assume it to be the truth based on a facade of legitimacy that wikipedia presents (at least, as far as the cats I listd above are concerned). These edits that are put there by cabals of editors, many of whom hold extremist views or represent organisations that have such extremist views are thus propagated into the masses of readers as facts, without the right balance to them, which is very damaging.

      That's what I think anyways, feel free to flame me down or whatever. Any replies and/or responses would be interesting to me as it would give me an idea as to how many people on slashdot regard wikipedia articles as canonically true and always NPOV.

    ~~~~

    --
    l'Homme n'est Rien l'Oeuvre Tout: Gustave Flaubert to George Sand