Slashdot Mirror


Plagiarizing Wikipedia For Profit

An anonymous reader sends word of a dustup involving the publisher John Wiley and Sons and Wikipedia. Two pages from a Wiley book, Black Gold: The New Frontier in Oil for Investors, consist of a verbatim copy from the English Wikipedia article on the Khobar Towers bombing. This is the publisher that touched off a fair use brouhaha earlier this year when they threatened to sue a blogger who had reproduced a chart and a table (fully attributed) from one of their journals.

30 of 223 comments (clear)

  1. According to law... by dotancohen · · Score: 4, Insightful

    According to law, they are doing nothing illegal and are even protecting their own legal rights. This is what happens when law dictates human behaviour, instead of morals. Precisely this situation Plato envisioned when he said that good men need no laws to tell them how to behave, and evil men will find ways around the laws.

    --
    It is dangerous to be right when the government is wrong.
  2. summary: The copied text is subject to GNU FDL. by artifex2004 · · Score: 4, Informative

    Although the author of the linked page says he wrote much of the disputed text and released it into public domain, the license governing Wikipedia is GNU FDL, as can be seen by a link at the bottom of every page. The combined work, because it includes work by others, is covered by that license.

    If Wiley published this text without citing the FDL, they're in violation of it. Seems pretty clear. Further, the license says that if the work is modified, the resulting document must also be released in FDL, according to section 4. This is where it gets interesting. :)

    1. Re:summary: The copied text is subject to GNU FDL. by PhilHibbs · · Score: 3, Insightful

      Not that "interesting" really. Saying that it "must be" under the GFDL isn't the same as saying that it automatically is under the GFDL. They are in violation of copyright, plain and simple, the GFDL doesn't automatically apply to the whole book, that's crazy talk. It's then up to the lawyers or the courts to come up with a suitable violation penalty and a solution going forward.

  3. Re:How are they going to claim... by malkavian · · Score: 5, Informative

    Because GDFL allows copying only if you allow the work to be freely copyable, and release the work it is included in under the GDFL.
    If this is the case, then the whole book that this text is in becomes freely copyable, as long as it's source is attributed. If the publisher chooses not to conform to this license, then it becomes in breach of copyright (as the works on Wikipedia are covered by copyright law, they're simply globally available on a license backed up by copyright law).

  4. Copyright isn't the be all and end all by Skippy_kangaroo · · Score: 5, Interesting

    The Wikipedia link discusses the problem of bringing copyright violation charges. But, even if it is released in the public domain, the problem for the publisher and author is the charge of plagiarism.

    Many high-profile authors have been brought down by charges of plagiarism. They have not been sued for copyright violations but they have suffered significant consequences nonetheless. See, for example, the recent case of Kaavya Viswanathan. As such, I would think that the copyright violation angle can be pretty much ignored. It's distracting and weak. The plagiarism charge, however, could have significant consequences.

  5. Re:How are they going to claim... by Whiney+Mac+Fanboy · · Score: 5, Funny

    This is an interesting aspect of free license law that hasn't really been delved into yet.

    You're so right! Noone on the wider internet or even slashdot has ever considered this!

    --
    There are shills on slashdot. Apparently, I'm one of them.
  6. Re:How are they going to claim... by someone1234 · · Score: 3, Informative

    Copyright doesn't require proof of damages, but damages could be calculated from the sold copies of the book.

    --
    Patents Drive Free Software as Hurricanes Drive Construction Industry
  7. "There's no such thing as plagiarism..." by DreamingDaemon · · Score: 3, Funny

    As the incredibly-talented sci-fi writer Bob Unherdof said to his struggling burger-flipper friend George Lucas in 1975....

  8. Slashdot tags by EvanED · · Score: 5, Insightful

    This article is tagged "thief". I thought it was standard /. wisdom that copyright infringement isn't theft?

    Anyway, are we sure that the text is from Wikipedia, and not both from a third source? It's probably unlikely, but "they copied from Wikipedia" is far from the only explanation.

    1. Re:Slashdot tags by Sirch · · Score: 5, Insightful

      I thought it was standard /. wisdom that copyright infringement isn't theft? Only when it's Joe Public doing the infringement. When Bob Corporate infringes, Slashdot's bile rises...

      While that's a gross generalization of what I perceive to be a double-standard, I can see some kind of justification behind it - Joe Public generally doesn't make money off it, whereas Bob Corporate infringes for profit.
    2. Re:Slashdot tags by pipatron · · Score: 5, Insightful

      The thing is that they did copy the text and said "this is mine, I created this", thus you stole the attribution. This does not happen when you send an mp3 to a friend.

      --
      c++; /* this makes c bigger but returns the old value */
    3. Re:Slashdot tags by mgblst · · Score: 3, Insightful

      I think the double standard arises because it is ok to copy something, but to do it for profit is wrong. That seems a more reasonable standard than the one that you propose.

    4. Re:Slashdot tags by DarkOx · · Score: 4, Insightful

      You are totally correct in that plagiarism is the correct word and that it is the word we should be using to talk about the issue. To the point being made by other posters however plagiarism is a more serious matter then copyright infringement IMHO. With copyright infringement some control over the content is lost to the owner/author. With plagiarism not only is some of the control lost to the author but also the credit for the work.

      In both the academic and artistic circles this is much more damaging than copyright infringement. Once you have created a work of academic or artistic value and its recognized by others as one of those things, it really becomes your personal credibility in the field. If your an artist, it gets you hired to perform, or patronized, if your an academic it gets you a job in industry, a teaching position, funding to more similar work, etc.

      If someone plagiarizes your work then they may get these things instead of you and worse yet possible get you accused or suspected of plagiarism. I think its clear the original author is hurt much more by plagiarism then mere copyright infringement, which if people are bothering to infringe on your copyrights probably does more for your general credibility then anything else could and may actually benefit you in a variety, although certainly not all circumstances. If anyone wants to compare this to the RIAA crying about mp3z its would have to be like you uploading the latest top 40 song and then claiming you and your buddies performed it in the garage the other day.

      --
      Repeal the 17th Amendment TODAY! Also Please Read http://www.gnu.org/philosophy/right-to-read.html
  9. Re:How are they going to claim... by MickLinux · · Score: 4, Interesting

    I know you are asking how Wikipedia will claim losses -- but I could as easily turn it around to the publisher.

    How will the publisher claim losses, when (by the GNU FDL) they are now going to have to give away their work?

    Quite simply, the answer is that the publisher won't have to give away their work. Rather, the work of the publisher is specifically in making a text available in the form of a book, along with referencable ISBN. They *will* at this point have to include a GNU FDL with the book, *even if they remove the offending pages from future copies*, since the entire book is now contaminated.

    But honestly, the amount of photocopying and such that will happen is not going to significantly increase.
    In the end, the fair price that a publisher can charge is defined by the utility that the publisher adds. Aside from that, the price that a publisher can *get* is more defined by the current accepted fair price for other books than for this book. So if a FDL goes in the book, then the reader will just look at it, say "oh, nice." And go on.

    Now, how can Wikipedia claim damages? There are more damages possible than cash value. There are damages to the reputation of the actual authors, damages to frequency of customer visits, and these do have an inherent value to which a lawyer will assign a cash value. Yes, it will be slightly arbitrary. But, on the other hand I think that a jury will find that the value of damages is (1) relatively large, and (2) at least proportional to the increased value recieved by John Wiley Publishing and the author. Typically, when theft occurs value is destroyed (they steal my car, but bust up the key mechanism). Therefore, you might expect damages to total 1.5-3 times the expected sales of the book, scaled down by the proportion of pages that were plagiarized. So for a 120-pg book, 2 pages copied, damages could total 1/40 to 1/20 of total expected sales.

    --
    Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
  10. Wikipedia: victim and perpetrator by harmonica · · Score: 5, Informative

    There are (or were) at least two articles in Wikipedia that are my texts (from my site) with slight variations on sentences. So whoever visits those Wikipedia articles (or did so in the past) and then my pages must come to the conclusion that I stole the stuff from Wikipedia without giving credit. I can't even prove that because I don't have a public version history, and archive.org is spotty when it comes to my site.

    In this case (Wiley book) the articles were there way before the book, so the case seems to be clear, but in general, I recommend to keep an open mind about who copied where.

  11. Re:Copy/Paste needs help by ScrewMaster · · Score: 5, Funny

    "Hi! It looks like you're trying to steal someone else's intellectual property! Would you like me to a. attribute it properly for you or b. adjust it so your theft isn't so blatantly obvious?"

    --
    The higher the technology, the sharper that two-edged sword.
  12. The solution by niceone · · Score: 4, Funny

    John Wiley and Sons could just edit the wikipedia article to be different. Problem solved.

  13. Re:So is it plagiarism by Dunbal · · Score: 4, Interesting

    If your academic paper cites wikipedia, well, good luck with peer review...

    --
    Seven puppies were harmed during the making of this post.
  14. Wikipedia's Official Reaction... by CheeseburgerBrown · · Score: 3, Funny

    ...was unfortunately deleted by an overzealous editor who argued that the issue did not meet notability criteria.

  15. Not giving awayl licensing. by CarpetShark · · Score: 3, Informative

    How are they going to claim...losses, when they give away their work?


    They're not giving ANYTHING away. They're licensing a copy of their product to you, under certain conditions.
  16. Re:How about thinking about a license first by PhilHibbs · · Score: 4, Informative

    But if you chose to place your work under, say, the Creative Commons, you've just told the world at large, "here, take it and use it as you wish, I don't want anything in return, I don't forbid anything, have fun with it."

    They neither wanted nor did that, the Wikipedia text is under the GFDL which requires attribution of source. The WP author mentioned released his contribution to the public domain, but the wider Wikipedia community has the right to be outraged that this writer a) plagiarised Wikipedia and b) didn't credit the authors of the text that he plagiarised. He claimed the words as his own, which is unlawful in many copyright jurisdictions regardless of any licence that the original author may have used. If the publisher sells that book in Finland, then they could find themselves in hot water. And I don't mean a nice invigorating sauna.
  17. Wikiplagarism by PhearoX · · Score: 4, Insightful

    I would submit that Wikipedia contains more plagarism than any one textual work ever created.

    So someone copied Wikipedia?

    Meh.

  18. Re:How about thinking about a license first by ReallyEvilCanine · · Score: 3, Informative

    But if you chose to place your work under, say, the Creative Commons, you've just told the world at large, "here, take it and use it as you wish, I don't want anything in return, I don't forbid anything, have fun with it." So please have the _decency_ then to not act enraged when someone does just that. That would be all well and good and I'd be right there with you applying the LART to dotancohen were it not for the minor inconvenience that Wikipedia is not covered by the Creative Commons license but rather by the GFDL: From Wikipedia's Copyright FAQ:

    * Can I reuse Wikipedia's content somewhere else?

    Wikipedia's textual content is copyrighted, but you may reuse it under the terms of our licensing requirements, summarized below.

    Text in Wikipedia, excluding quotations, has been released under the GNU Free Documentation License (or is in the public domain), and can therefore be reused only if you release any derived work under the GFDL. This requires that, among other things, you attribute the authors and allow others to freely copy your work. (This is a summary, see the licence text for the exact details.)

    If you are unwilling or unable to use the GFDL for your work, use of Wikipedia content is unauthorized. Small quotations of Wikipedia content, with its source attributed, may be permissible under the "fair use" clause of U.S. copyright law. See Wikipedia:Citing Wikipedia for information about the proper citation of articles. No permission is needed to create a hyperlink to Wikipedia or its articles.

    Emphasis mine, used to highlight the important bits.
  19. Re:How about thinking about a license first by Bogtha · · Score: 4, Informative

    But if you chose to place your work under, say, the Creative Commons, you've just told the world at large, "here, take it and use it as you wish, I don't want anything in return, I don't forbid anything, have fun with it."

    In addition to what Phil has pointed out in another reply, it's worth pointing out that there are many different Creative Commons licenses, and they vary in what they permit. Some of them do not permit commercial use, some of them require attribution, some of them are more permissive.

    Please, if you are going to make claims about what something does and doesn't permit, at the very least you should be vaguely familiar with it yourself. Creative Commons is a brand name for a bunch of different licenses, not a license itself.

    --
    Bogtha Bogtha Bogtha
  20. Re:How are they going to claim... by allcar · · Score: 3, Insightful
    Is anyone else reminded of Hitchhikers?

    It is interesting to note that a later and wilier editor sent the book backwards in time through a temporal warp and then successfully sued the breakfast cereal company for infringement of the same laws.
  21. Re:How are they going to claim... by PhilHibbs · · Score: 5, Interesting

    No, licences do not automatically apply, the *PL and CC* licences are not viral. If I copy your work and disregard the licence, then I have violated your copyright, and you can take me to court. If you released it under a particular licence, then that is pretty much irrelevant to me - if I didn't follow the licence, then I have simply violated your copyright. This author may well have asked a researcher or even a member of his family to come up with a couple of paragraphs about that incident and they copied Wikipedia, it would be unreasonable for the author's entire book to become freely available under the LGPL due to his carelessness in not checking the actions of a third party. A judge might come up with a reasonable compromise, such as ruling that the modified version of the text as appears in the book must be licenced under the LGPL and made available on the publisher's web site for download, and that future printings must credit the Wikipedia article as the source on which the text is based.

  22. Re:How are they going to claim... by Tango42 · · Score: 4, Interesting

    You are correct. The Wikimedia (with an 'm') Foundation does not have any legal rights to the content of Wikipedia other than what the GDFL gives everyone. If anyone is to be sued over copyright violations of text in Wikipedia, it needs to be by at least one of the editors of the article in question (not including editors that have just corrected spellings, added cleanup tags, etc).

  23. Re:How are they going to claim... by __aayurq3262 · · Score: 4, Insightful

    No, licences do not automatically apply, the *PL and CC* licences are not viral. If I copy your work and disregard the licence, then I have violated your copyright, and you can take me to court. If you released it under a particular licence, then that is pretty much irrelevant to me - if I didn't follow the licence, then I have simply violated your copyright. If you violated the copyright, then you are liable for damages. The measure of damages is calculated based on a variety of factors, one of which is what the author charges for the copied work in the marketplace. That "charge" or fee is usually the *minimum* liability for infringing the author's rights. In this case, the author's "charge" is a license to copy the rest of the work under the LGPL/CC. That charge is highly relevant to the measure of damages suffered by the author.

    it would be unreasonable for the [infringing] author's entire book to become freely available under the LGPL due to his carelessness in not checking the actions of a third party. You may think it's unreasonable, but others might not. The infringing author had an obligation to supervise third party created works and not to infringe. I place a high value on the rights granted to the public under LGPL and CC licenses and comparatively much less value on private copyright rights to prevent the public from copying.
  24. Two differences... by SanityInAnarchy · · Score: 3, Insightful

    First, when Joe Public infringes, he generally does so for himself. When Bob Corporate does, it's for the world at large.

    Second, Bob Corporate usually gets away with it. If Joe Public is caught, he faces heavy, personal penalties. Bob Corporate can simply have Bob Corporate Inc cover the damage, assuming that they're caught at all and that they lose in court.

    Finally, we take great delight in finding a similar double-standard in Bob Corporate. This company, for instance, went after someone else for a fairly sizable quote (with attribution), and we now find them stealing wholesale (with no attribution). This seems almost second nature to most corporations -- in fact, I forget where it was, but I seem to remember reading someone psychoanalyzing a corporation (as if it were a human) and finding that it's insane.

    Which comes back to "A person is smart. People are dumb, stupid, panicky, dangerous animals, and you know it."

    --
    Don't thank God, thank a doctor!
  25. Re:Copy/Paste needs help by David+Gerard · · Score: 3, Interesting

    There is actually a bot on Wikipedia that runs Google checks on all new articles and marks any text it finds elsewhere for speedy zapping. This turns up more than a few false positives, but mostly huge amounts of copyright violations that then get quickly zapped.

    Wikipedia remains the only "Web 2.0" project that proactively gives a damn about copyright.

    --
    http://rocknerd.co.uk