Bloggers are the New Plagiarism
mjeppsen writes "PlagiarismToday offers a thought-provoking article that frankly discusses concerns with plagiarism and rote content theft among bloggers. In the section entitled "Block quotes by the Dozen" the author mentions the so-called "gray area". That is PlagiarismToday's classification of the common blogger practice of re-using large blocks of text/content from the original article or source, even when the source is attributed."
even when the source is attributed.
Its not plagiarism then is it?
There are shills on slashdot. Apparently, I'm one of them.
Not that it's Slashdotted or anything, I just thought it'd be funny.
---
The Investor Relations Web Report calls it "the new plagiarism". Dan Zarella from Puritan City call those who engage in it "the best plagiarists". Others simply call them bloggers or, as Zarella also put it, "Human Aggregators".
They're a new breed of content users that walk a gray area between that which is clearly fair use and what is obviously content theft. Their blogs are marked with large swaths of block quotes and heavy content reuse, but also proper attribution and at least some original content.
These sites, as they've grown in number, have created a great deal of controversy among bloggers who are left to wonder if they are nothing more than content thieves in disguise.
Block quotes by the Dozen
These sites, which for this article I'll simply call "gray", are generally identified by a large number of very short posts, with much of it in block quotes or otherwise directly lifted content. Though they meticulously credit their sources, bowing to more traditional rules for blog attribution, and work to add at least some original content, usually over half of their material comes from other sources.
This has caused many bloggers to worry that these grey blogs might be trying to get away with content theft under the guise of legitimate attribution. The idea being that they can create a much larger volume of content if they only have to write a small portion of it. Users will simply visit the gray blogs since they are able to provide so much more information and, due to the use of liberal quoting, the user will then have no reason to visit the original source. After all, they already have most of the critical information.
While certainly grey blogs don't pose the same threat or raise the same concerns as spam blogs and other content scrapers, the cause for concern is clear. Even though blogging is about sharing and reusing information, excessive sharing threatens the authors penning the original content. The tale of the goose laying the golden egg springs to mind as, quite simply, greed can be the blogging world's biggest enemy.
A Separation of Degrees
What makes this issue so difficult to address, and so difficult to write about, is that it's not so much about gray blogs, but rather, various shades of grey blogs. The difference between someone simply quoting blogs and someone trying to tweak the system is not a clear cut matter, but a separation of degrees.
Quoting, even liberal quoting, is expected by blogs. It's a part of researching a story and covering ongoing stories as well as sharing information. If done properly, it can not only be used to create a new work, but also drive valuable traffic to the original site. In the blogging world, being the source is often a badge of honor.
However, basing your entire site, or even a larger percentage of it, on quoted content is viewed differently. Being a source in a larger article is one thing, but having your content be the majority of the article on another site another. What distinguishes one from the other is unclear at best. There are no math formulas or systems for determining what is right or what is too much.
More confusing still, everyone has a different idea of what constitutes content theft. With Creative Commons Licenses being very common, it's obvious some feel that copying an entire work is acceptable so long as attribution is affixed. Others would place the boundary well within what is usually considered fair use.
The challenge becomes to strike a balance and set some kind of guideline that is compatible with copyright law, acceptable under the current code of blogging ethics but also able to appease the concerns many bloggers share over grey sites.
A Proposed Solution
When I first looked at the problem, I was tempted to set guidelines by which a blogger should not get more than X percent of their overall content from other sites or use more than Y lines from another entry.
Parent is correct - plagiarism is claiming as your original work, someone else's work. If you attribute the work, it is clearly not plagiarism, and not a 'gray area'. The only 'gray area', I would say, would be copyright violation. It is fair use to quote someone else. But, at what point of copying large blocks of someone else's copyrighted material do you cross the line from fair use to copyright infringement?
Personally, I would err on the side of fair use - particularly if the bloggers are adding significant amounts of criticism/commentary (for example, Groklaw recently commented on the blog of some ZDNet analyst, and PJ included almost the entire text of the blog entry - but that is because she was doing a point by point rebuttal of his tripe - that should be considered fair use, because it's almost impossible to rebut in entirety, if you cannot quote in entirety). If they copy 5 pages of article text and add a 3 line summary/critique at the top, that, to me, would not be fair use.