Plagiarizing Wikipedia For Profit
An anonymous reader sends word of a dustup involving the publisher John Wiley and Sons and Wikipedia. Two pages from a Wiley book, Black Gold: The New Frontier in Oil for Investors, consist of a verbatim copy from the English Wikipedia article on the Khobar Towers bombing. This is the publisher that touched off a fair use brouhaha earlier this year when they threatened to sue a blogger who had reproduced a chart and a table (fully attributed) from one of their journals.
According to law, they are doing nothing illegal and are even protecting their own legal rights. This is what happens when law dictates human behaviour, instead of morals. Precisely this situation Plato envisioned when he said that good men need no laws to tell them how to behave, and evil men will find ways around the laws.
It is dangerous to be right when the government is wrong.
Although the author of the linked page says he wrote much of the disputed text and released it into public domain, the license governing Wikipedia is GNU FDL, as can be seen by a link at the bottom of every page. The combined work, because it includes work by others, is covered by that license.
:)
If Wiley published this text without citing the FDL, they're in violation of it. Seems pretty clear. Further, the license says that if the work is modified, the resulting document must also be released in FDL, according to section 4. This is where it gets interesting.
Because GDFL allows copying only if you allow the work to be freely copyable, and release the work it is included in under the GDFL.
If this is the case, then the whole book that this text is in becomes freely copyable, as long as it's source is attributed. If the publisher chooses not to conform to this license, then it becomes in breach of copyright (as the works on Wikipedia are covered by copyright law, they're simply globally available on a license backed up by copyright law).
The Wikipedia link discusses the problem of bringing copyright violation charges. But, even if it is released in the public domain, the problem for the publisher and author is the charge of plagiarism.
Many high-profile authors have been brought down by charges of plagiarism. They have not been sued for copyright violations but they have suffered significant consequences nonetheless. See, for example, the recent case of Kaavya Viswanathan. As such, I would think that the copyright violation angle can be pretty much ignored. It's distracting and weak. The plagiarism charge, however, could have significant consequences.
This is an interesting aspect of free license law that hasn't really been delved into yet.
You're so right! Noone on the wider internet or even slashdot has ever considered this!
There are shills on slashdot. Apparently, I'm one of them.
This article is tagged "thief". I thought it was standard /. wisdom that copyright infringement isn't theft?
Anyway, are we sure that the text is from Wikipedia, and not both from a third source? It's probably unlikely, but "they copied from Wikipedia" is far from the only explanation.
I know you are asking how Wikipedia will claim losses -- but I could as easily turn it around to the publisher.
How will the publisher claim losses, when (by the GNU FDL) they are now going to have to give away their work?
Quite simply, the answer is that the publisher won't have to give away their work. Rather, the work of the publisher is specifically in making a text available in the form of a book, along with referencable ISBN. They *will* at this point have to include a GNU FDL with the book, *even if they remove the offending pages from future copies*, since the entire book is now contaminated.
But honestly, the amount of photocopying and such that will happen is not going to significantly increase.
In the end, the fair price that a publisher can charge is defined by the utility that the publisher adds. Aside from that, the price that a publisher can *get* is more defined by the current accepted fair price for other books than for this book. So if a FDL goes in the book, then the reader will just look at it, say "oh, nice." And go on.
Now, how can Wikipedia claim damages? There are more damages possible than cash value. There are damages to the reputation of the actual authors, damages to frequency of customer visits, and these do have an inherent value to which a lawyer will assign a cash value. Yes, it will be slightly arbitrary. But, on the other hand I think that a jury will find that the value of damages is (1) relatively large, and (2) at least proportional to the increased value recieved by John Wiley Publishing and the author. Typically, when theft occurs value is destroyed (they steal my car, but bust up the key mechanism). Therefore, you might expect damages to total 1.5-3 times the expected sales of the book, scaled down by the proportion of pages that were plagiarized. So for a 120-pg book, 2 pages copied, damages could total 1/40 to 1/20 of total expected sales.
Correct Horse Battery Staple: 72 bits of entropy. Enter "Correct H" into google. When it generates the phrase, that's
There are (or were) at least two articles in Wikipedia that are my texts (from my site) with slight variations on sentences. So whoever visits those Wikipedia articles (or did so in the past) and then my pages must come to the conclusion that I stole the stuff from Wikipedia without giving credit. I can't even prove that because I don't have a public version history, and archive.org is spotty when it comes to my site.
In this case (Wiley book) the articles were there way before the book, so the case seems to be clear, but in general, I recommend to keep an open mind about who copied where.
"Hi! It looks like you're trying to steal someone else's intellectual property! Would you like me to a. attribute it properly for you or b. adjust it so your theft isn't so blatantly obvious?"
The higher the technology, the sharper that two-edged sword.
John Wiley and Sons could just edit the wikipedia article to be different. Problem solved.
ccalam - acoustic versions of new songs.
If your academic paper cites wikipedia, well, good luck with peer review...
Seven puppies were harmed during the making of this post.
They neither wanted nor did that, the Wikipedia text is under the GFDL which requires attribution of source. The WP author mentioned released his contribution to the public domain, but the wider Wikipedia community has the right to be outraged that this writer a) plagiarised Wikipedia and b) didn't credit the authors of the text that he plagiarised. He claimed the words as his own, which is unlawful in many copyright jurisdictions regardless of any licence that the original author may have used. If the publisher sells that book in Finland, then they could find themselves in hot water. And I don't mean a nice invigorating sauna.
I would submit that Wikipedia contains more plagarism than any one textual work ever created.
So someone copied Wikipedia?
Meh.
In addition to what Phil has pointed out in another reply, it's worth pointing out that there are many different Creative Commons licenses, and they vary in what they permit. Some of them do not permit commercial use, some of them require attribution, some of them are more permissive.
Please, if you are going to make claims about what something does and doesn't permit, at the very least you should be vaguely familiar with it yourself. Creative Commons is a brand name for a bunch of different licenses, not a license itself.
Bogtha Bogtha Bogtha
No, licences do not automatically apply, the *PL and CC* licences are not viral. If I copy your work and disregard the licence, then I have violated your copyright, and you can take me to court. If you released it under a particular licence, then that is pretty much irrelevant to me - if I didn't follow the licence, then I have simply violated your copyright. This author may well have asked a researcher or even a member of his family to come up with a couple of paragraphs about that incident and they copied Wikipedia, it would be unreasonable for the author's entire book to become freely available under the LGPL due to his carelessness in not checking the actions of a third party. A judge might come up with a reasonable compromise, such as ruling that the modified version of the text as appears in the book must be licenced under the LGPL and made available on the publisher's web site for download, and that future printings must credit the Wikipedia article as the source on which the text is based.
You are correct. The Wikimedia (with an 'm') Foundation does not have any legal rights to the content of Wikipedia other than what the GDFL gives everyone. If anyone is to be sued over copyright violations of text in Wikipedia, it needs to be by at least one of the editors of the article in question (not including editors that have just corrected spellings, added cleanup tags, etc).