Slashdot Mirror


One Man's Quest To Rid Wikipedia of Exactly One Grammatical Mistake

An anonymous reader writes with this Fascinating profile of one particular Wikipedia editor Giraffedata (a 51-year-old software engineer named Bryan Henderson), who has spent the last seven years correcting only the incorrect use of "comprised of" on Wikipedia. Using a code to crawl for uses of "comprised of" throughout all of Wiki's articles, he'll then go in and manually correct them (for example, using "consists of" or "composed of") and has made over 47,000 edits to date.

45 of 425 comments (clear)

  1. Monomania by war4peace · · Score: 4, Funny

    Comprised of the ability to withstand the urge of doing anything else but this.

    --
    ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
    1. Re:Monomania by show+me+altoids · · Score: 5, Funny

      "He'll then go in and manually corrects them." Slashdot needs this guy to proofread submissions.

      --
      I feel sorry for people that don't drink, because when they get up in the morning, that's as good as they're gonna feel
    2. Re:Monomania by Hussman32 · · Score: 4, Insightful

      While some people may find his actions cromulent, I personally think his work embiggens us all.

      --
      "Who are you?" "No one of consequence." "I must know." "Get used to disappointment."
    3. Re:Monomania by jellomizer · · Score: 4, Funny

      I don't get why we tolerate people that have vices that are actually far more harmful. But an improper use of spelling or grammar, causes people with such an uproar.

      My personal hypothesis bases on no facts whatsoever. Is that teachers in the attempt in instill proper grammar and other language skills, actually went to far and caused people to see grammar and language as a moral issue, and not just a skill, to insure that ideas are commonly understood.

      Many of these people going after language issues, are often just fighting the natural migration of language where the meaning is well understood and isn't affecting peoples understanding of the information.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    4. Re:Monomania by osu-neko · · Score: 2

      You only say that because you're evil. :p

      --
      "Convictions are more dangerous enemies of truth than lies."
    5. Re:Monomania by im_thatoneguy · · Score: 2

      My guess is that they are attempting to direct language. For instance spelling bees in Spanish would be absurd since Spanish has had a pretty vigorously defended spelling system. English has always(Old-English) been (Proto-Germanic) a bit of a hodgepodge (Anglo-French) of disparate (latin) bits and pieces (French). As a result there is no consistency. I agree that it would be nice to clean up and standardize our spelling/grammar but obviously it's a futile task.

    6. Re:Monomania by sconeu · · Score: 3, Informative

      My personal hypothesis bases on no facts whatsoever. Is that teachers in the attempt in instill proper grammar and other language skills, actually went to far...

      Corrected: ... actually went too far...

      Sorry, given the story topic, I couldn't help myself.

      --
      General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
    7. Re:Monomania by eyrieowl · · Score: 4, Funny

      Hopefully that wasn't the only grammar error you picked up on in that post.... :)

    8. Re:Monomania by khellendros1984 · · Score: 2

      I have two issues with your post. First, the use of some kind of standardized grammar aids in comprehension by decreasing the difficulty of interpreting the meaning. If you reduce the number of re-parses that the reader has to do due to unexpected/non-standard word/punctuation use, the information comes through smoother and cleaner. Second, the reader will notice the "register" of the text and tend to give less credence to the information if it doesn't match what they expect. People don't expect idiosyncrasy from an encyclopedia. As an extreme example, I'm perfectly capable of understanding a computer science article written in Cletus Spuckler's dialect, but I'm not likely to trust the information without some form of independent verification.

      Of course, perception of the appropriate register, and the sets of language features included each register, are also subject to change over time, and the appropriateness of some specific feature at a specific time to a particular register also isn't a binary value.

      --
      It is pitch black. You are likely to be eaten by a grue.
    9. Re:Monomania by Anonymous Coward · · Score: 2, Insightful

      Because people like to enforce rules. It makes them feel good. All humans have this to a degree.
      The rules for grammar are very well defined and no actual harm is caused by being a grammar Nazi.
      I'ts a win win, no victims, someone gets to feel superior, someone gets to be upset, and human drives are fulfilled for all.
      It's a wonderful system! :-)

      Basically people like to shout about it because no one is actually harmed by it and as such no one is really going to get upset about it.
      Just a kind of pretend upset.

      You go after those people with real vices that are harmful! Well, your calling someones identity very much into question and you'll face a real response.

      It's all about the drama!

    10. Re:Monomania by jimbo-nally · · Score: 2

      Hilarious. Nicely done sir. I can't believe your attempt at levity was modded as Interesting instead of Funny.

    11. Re:Monomania by Anonymous Coward · · Score: 5, Funny

      Irregardless of you're opinion its still a waist of time for all intensive purposes.

    12. Re:Monomania by war4peace · · Score: 3, Interesting

      Apples and oranges.
      Tolerating people with vices falls under a different category than tolerating grammar blunders.
      Some things in languages are subject to change, yes. Random example: "all things considered" versus "all things considering". While the latter is considered as being incorrect, it is actually correct in some cases, when, let's say, you weigh components of an equation while building your own conclusion. "All things considered" technically refers to your train of thought ending before you draw a conclusion (implying you have enough time to do so), while "all things considering" means that the situation is ongoing and based on current set of events you decide to take *this* course of action.

      Language is everchanging, yes, but some of its components need to be represented correctly to eliminate inconsistencies, especially where said inconsistencies might give a totally different meaning to what's being said, effectively corrupting conveyed information. Another reason is more of a personal perception: whoever consistently makes blatant mistakes is much more likely to not respect both themselves and the audience. Shortly put, they're less likely to be trusted by me, not in the "I won't lend you money" sense, but in the "it's riskier to do business with you" sense.

      I don't care if the person next to whom I party is less literate but I do care if I am supposed to do business with them. Just today I cut a small deal with a company which sent me an e-mail telling me they extended their offer until 31st of February. This mistake meant (to me) they're prone to overlooking shit. If one can't properly verify a 3 row e-mail, how am I supposed to trust them to properly verify 100K lines of code?

      So yes, shortly put it's about self esteem, attention to details and how prepared one is to care about details, learn proper ways of doing things and not mess up because "hey, it happens".

      Disclaimer: English is not my native language.

      --
      ...gis sdrawkcab (usually not responding to ACs; don't bother posting as AC)
  2. I guess he missed this one by Marginal+Coward · · Score: 2, Funny

    he'll then go in and manually corrects them

    (Sorry, couldn't resist.)

  3. Similarity by Anonymous Coward · · Score: 5, Interesting

    This twitter account is similar: Correcting users on Twitter who type "sneak peak" with "sneak peek", we have "Stealth Mountain". https://twitter.com/stealthmountain

    Not sure either of these qualify as 'news', but what the hell, it's a slow news day anyways.

    1. Re:Similarity by Anonymous Coward · · Score: 2

      http://theoatmeal.com/comics/sneak_peek

  4. Also known as The Regex Execution Man by eexaa · · Score: 2

    Next up: The Wget Guy manually downloads a "hand-tailored" copy of wikipedia and sells it for living on DVDs.

    Oh wait.

  5. Re:language fluidity by Russ1642 · · Score: 2

    No. All of his edits have already been reverted by crazy Wikipedia editors.

  6. Speaking of mistakes by jdavidb · · Score: 5, Informative

    Using a code to crawl for uses of "comprised of" throughout all of Wiki's articles

    Wikipedia is not "Wiki." Wikipedia is a wiki. There are many wikis in the world, and they are not all Wikipedia. Wikipedia is the publication, and wiki is the medium. "All of Wiki's articles" is like saying "All of Newspaper's articles."

    Maybe I can get away with this offtopic pedantic comment since this whole article is about a guy spending years trying to fix small errors. :)

    1. Re:Speaking of mistakes by jeffmeden · · Score: 2

      Using a code to crawl for uses of "comprised of" throughout all of Wiki's articles

      Wikipedia is not "Wiki." Wikipedia is a wiki. There are many wikis in the world, and they are not all Wikipedia. Wikipedia is the publication, and wiki is the medium. "All of Wiki's articles" is like saying "All of Newspaper's articles."

      Maybe I can get away with this offtopic pedantic comment since this whole article is about a guy spending years trying to fix small errors. :)

      To be completely pedantic, you don't actually know that he confined his search to just Wikipedia. The article revolves around Wikipedia but he might be crusading across the entire internet, for all you know. Many other Wiki systems allow user contributions just like Wikipedia.

    2. Re:Speaking of mistakes by rossdee · · Score: 4, Informative

      " There are many wikis in the world, and they are not all Wikipedia."

      This is true. "Wiki" is a girls name in Aotearoa. It is the shortened form of Wikitoria

  7. Re:ATM machine ..? by ShanghaiBill · · Score: 5, Informative

    It's a common enough idiom.

    There are many common idioms that are used incorrectly in conversation or casual writing. But that doesn't mean they should be used in formal writing, such as an encyclopedia.

  8. Re: Oh, thank the gods. . . . by Anonymous Coward · · Score: 2, Funny

    Yes, very crazy of him to be focused on the cause of his own crazy rather than yours!

  9. What a WASTE of time by gurps_npc · · Score: 2
    This guy really nees to watch Erin MckEan's TED talk. Basically she is part of the large movement to throw out this kind of stupidity.

    Languages come in two types: Living and Dead. Dead languages have solid grammatical rules that must be obeyed. Living languages are in flux, constantly evolving. What this person did is NO different than a British person going through all of Wikipedia and replacing the word Humor with Humour or Favorite with Favourite.

    Words and Grammar CHANGE. Enough people use the word AINT, it gets imported into the language.

    Why? Because living languages are comprised of words and phrases that take their meaning from the common usage, not from a book. If people understand a meaning, that is the meaning.

    There is no language police outlawing people, no punishment - except for public disapproval and opinion - for misuse. This guy is not the public and has no right to disapprove of how the public uses the language.

    --
    excitingthingstodo.blogspot.com
    1. Re:What a WASTE of time by fibonacci8 · · Score: 5, Informative

      Words and Grammar CHANGE. Enough people use the word AINT, it gets imported into the language.

      You've spelled "ain't" wrong.

      --
      Inheritance is the sincerest form of nepotism.
  10. Patent Grammar Too by Venner · · Score: 5, Informative

    Yep. I work in patents, where a small incorrect use of grammar or terms of art can mean losing millions of dollars. The classic case in point:

    Patent A:
    "A vehicle comprising 3 wheels and a motor."

    Patent B:
    "A vehicle consisting of 3 wheels and a motor."

    Assuming it is 1700 or something and no prior-art exists,

    Patent A can go on to claim 4-wheeled motorized vehicles (since a 4-wheeled vehicle does after all have 3 wheels), 3-wheeled vehicles with shark fins, whatever. "Comprising" is open-end and interpreted as "it has at least this," or as you say, "including."

    Patent B is strictly limited to 3 wheels and a motor, no more and no less. If a competitor uses 4 wheels, or adds shark fins, or two motors, then it isn't covered by the patent. "Consisting of" is a closed phrase interpreted as "having exactly."

    The incorrect grammar "comprised of" would be an ambiguity, and as such, interpreted in the strictest way -- limiting as in Patent B.

    It may seem worrisome that scientists and engineers of all people -- some of the absolute worst butchers of language and grammar out there! -- are the ones who become patent agents or patent attorneys, but all-in-all, the ones who do so tend to be some of the smartest folks I've met. You need to be well-rounded to do the job.

    --
    A preposition is a terrible thing to end a sentence with.
    1. Re:Patent Grammar Too by oodaloop · · Score: 5, Interesting

      Wow, that's exactly the opposite of this comment above. And every dictionary I can find says they're synonyms.

      --
      Tic-Tac-Toe, Global Thermonuclear War, and relationships all have the same winning move.
  11. Let the trolling being by sjbe · · Score: 2

    Now that this is publicly known you can be sure there will be trolls who will mess with the guy.

  12. Re:I concur by lgw · · Score: 5, Informative

    This guy's my hero - misuse of "comprised" is a pet peeve of mine.

    Despite sounding vaguely similar to "composed", it's not a synonym. Comprised is a near-synonym for included, but implies totality. "The band comprised a guitarist, a bassist, and a drummer" means that was the entirety of the band. Since so few people actually understand this, I tend to avoid the word.

    --
    Socialism: a lie told by totalitarians and believed by fools.
  13. Re:ATM machine ..? by Half-pint+HAL · · Score: 5, Funny

    It's a common enough idiom.

    There are many common idioms that are used incorrectly in conversation or casual writing. But that doesn't mean they should be used in formal writing, such as an encyclopedia.

    Well met, friend, for thou speakst great sooth! Many people have I encountered who are such dullards as to employ incorrectly the English tongue. 'Tis tragedy of the vtmost that the youth of our times know not how the language should properly speak itself. A gay fellow would I be were my fellow man to renew his acquaintance with the King's English.

    Alas! but I must forsake thy gentle companie, for mine friends await me in a local hostelrie, and so must I away! Parting is such sweet sorry. Anon, good sir, anon!

    --
    Got them moderator blues I blieve I walk out the do', With these mod-points I been gettin', I 'most never post no mo'
  14. Re:I concur by NotDrWho · · Score: 5, Informative

    That's funny, because every online dictionary I consulted said that it was grammatically correct to use "comprise" as a synonym for "compose." In fact Merriam Webster has this to say:

    Sense 3 : compose, constitute

    Usage Discussion of COMPRISE
    Although it has been in use since the late 18th century, sense 3 is still attacked as wrong. Why it has been singled out is not clear, but until comparatively recent times it was found chiefly in scientific or technical writing rather than belles lettres. Our current evidence shows a slight shift in usage: sense 3 is somewhat more frequent in recent literary use than the earlier senses. You should be aware, however, that if you use sense 3 you may be subject to criticism for doing so, and you may want to choose a safer synonym such as compose or make up.

    --
    SJW's don't eliminate discrimination. They just expropriate it for themselves.
  15. Next up... by argStyopa · · Score: 2

    Global search: "utilize"
    Replace with: "use"

    Global search: "baited breath"
    Replace with: "bated breath"

    I could do this ALL day, man.

    --
    -Styopa
    1. Re:Next up... by arth1 · · Score: 3, Informative

      I would, literally, also change most occurrences of literally to figuratively.

      Then there are news articles about people who get evacuated, and not even in a hospital. If you can't substitute the word "empty", don't use the word evacuated.

      And this is a lost battle, but a burglar burgles, he doesn't burglarize, unless he turns others into burglars.
      Similar with ruggedized, which more often than not should be replaced with rugged.

      But perhaps most of all, when people write "I could care less" when they mean "I couldn't care less".

  16. Re:One man's rebuttal by willworkforbeer · · Score: 2

    This Sunday, Sunday, Sunday... Let's Get Rrrrready to Rrrrrumble ... It's a Code Throwdown

    --
    Pretending this is my office full of bitter coworkers..
  17. Re:One man's rebuttal by NotDrWho · · Score: 4, Funny

    And I will join your allied fight against Grammar National Socialism by conducting a concurrent invasion of the beaches of literature and retaking ground from the fascists who mistakenly think that it is grammatically incorrect to begin sentences with conjunctions. And we will win this fight!

    --
    SJW's don't eliminate discrimination. They just expropriate it for themselves.
  18. Great idea! by morgauxo · · Score: 3

    I think I will go on a quest to get rid of as many occurances of "X times LESS than ..." as possible.

  19. Re:language fluidity by NotDrWho · · Score: 3, Insightful

    what happens when oxford announces that "comprised of" now also means "made up of" ?

    They already have.

    --
    SJW's don't eliminate discrimination. They just expropriate it for themselves.
  20. Re:I concur by thaylin · · Score: 2

    That says it is being used more often, not that it is grammatically correct. Languages naturally change over time, and this is probably a good example of it.

    --
    When you cant win, ad hominem.
  21. Good by Anonymous Coward · · Score: 2

    I applaud this man's efforts. Copy editing and proper English have gone down the tubes in writing today. It's become so bad that even major news outlets are publishing egregious errors in headlines and teasers, as well as in article body content, daily and at an alarming rate. Sad to see that there is no higher standard anymore and everyone writing like a 14-year old, C-level English student.

  22. Re:My pet peeve: by slinches · · Score: 4, Funny

    Oh good, I should be safe on that one. I use "condiment" in place of both of those.

    --
    Knowledge Brings Fear
  23. Re:Edit count whoring by Zordak · · Score: 4, Funny

    I will write a script that locates ambiguous usage of commas, and will replace them with the correct oxford comma usage.

    Sir, that is uncouth, uncivilized and incorrect.

    There are legitimate grammar and usage debates, with cogent arguments on either side. But the Oxford Comma is the One True Way. The best argument I've ever heard against it is, "Well, it saves a few drops of ink on the printed page." Anti-Oxford Comma heathens should be drawn, quartered, and burned at the stake for befouling the language.

    --

    Today's Sesame Street was brought to you by the number e.
  24. Re:I concur by narcc · · Score: 2

    "It is a damn poor mind indeed which can't think of at least two ways to spell any word."
    -- Andrew Jackson

  25. Re:I concur by lgw · · Score: 3, Interesting

    No, it's sort of the inverse of composed:

    * My team comprises X, Y, and Z
    * My team that is composed of X, Y, and Z (or, awkwardly, X, Y, and Z compose my team).

    If it were UML, the arrow would go the other way.

    --
    Socialism: a lie told by totalitarians and believed by fools.
  26. Re:Edit count whoring by NotDrWho · · Score: 3, Interesting

    The best argument I've ever heard against it is, "Well, it saves a few drops of ink on the printed page."

    And yet you place the period inside the quotation marks. Foolish American!

    --
    SJW's don't eliminate discrimination. They just expropriate it for themselves.
  27. Re:I concur by tsotha · · Score: 2

    It's a pet peeve of mine as well, but it's a small pet.

    I don't want it to affect my holy war on the criminal misuse of the apostrophe.