Google's Cache Ruled Fair Use
jbarr writes "An EFF Article states that: 'A district court in Nevada has ruled that the Google Cache is a fair use ... the Google Cache feature does not violate copyright law.' Notable is the basis that 'The Google Cache qualifies for the DMCA's 512(b) caching 'safe harbor' for online service providers.'" From the article: "The district court found that Mr. Field 'attempted to manufacture a claim for copyright infringement against Google in hopes of making money from Google's standard [caching] practice.' Google responded that its Google Cache feature, which allows Google users to link to an archival copy of websites indexed by Google, does not violate copyright law."
So if someone created a search engine which automatically, randomly and non-volitionally searches and caches MP3 files from websites which do not have "no archive" metatag, it's not breaking the law?
When those searched websites disappeared, this search engine may still serve those cached MP3 files for archival purposes?
Uncensored Google results requested and delivered by email
The judge then left the bench, walked over, and whacked the plaintiff and his council on the head with a salami.
This sig, aah-ah, is comin' like a ghost-sig...
Those of you who do the "yesbutNOCACHEtag" dance have got it backwards to: it's not the responsibility of the copyrightholder to sing to the tune of whatever the latest fad is. Rather, it's the other way around - google should convince people that it's in their interest to put a "CACHEME!" tag.
Comment removed based on user account deletion
I thought we sold "fair use" a while ago for three magic beans and some DVDs.
the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff
Browsers don't make their caches publicly availible. You're comparing apples and oranges.
Facts do not cease to exist because they are ignored.
Forget the fair use analysis, the most important thing here is the success of the "Implied License" claim. Basically, it goes like this: You operate a website. The web was created specifically with the idea that "robots" would crawl across it, and there is a standard well-known way to prevent them from crawling your site. Even more specifically, there's another standard well-known way to keep search engines from cacheing your content. Being on the web but not using these techniques means that you give search engines permission to cache your content.
It's sort of like what happens when you leave a potful of candy at your front-door on Oct. 31st. In theory, you could claim that all those kids who come to your door and help themselves are stealing. But, because everybody knows how Halloween works, you've implicitly given permission for them to do it.
In this opinion, the Fair Use analysis was basically just used as a stopgap of "what little infringement that's left after you account for the implied license is a fair use." If the website had included a robots.txt file, the fair use case would have been much harder to make.
The Implied License is a stake in the ground for "This is the Internet. The rules are different here." IMO, that's a good thing -- there are a bunch of things that just couldn't happen if you had to get explicit permission from every content owner.
I'm not sure about that. Although the result of this case seems fair and clearly indicated on several counts, there's a lot that might not apply to archives more generally, so I'm not sure how much of a precedent has been set.
In particular, the case was brought by someone who practically admitted trying to set Google up: he knew about mechanisms like META tags and robots.txt, knew that Google was caching his site, made no attempt to stop them, and indeed actually set up robots.txt explicitly to allow bots to crawl his site. This supports Google's first two defences here, having an implied licence and estoppel.
The most interesting discussion, IMHO, is on the fair use defence. The court considered in a lot of detail whether the use made by Google qualifies as fair use. On the first criteria (how the material is being used), it was found significant that the material was being used for different purposes in the cache than on the original site: the latter was presumed artistic, while the former allowed access to the material when the original site was down, historical comparisons of the site content, highlighting of search terms that made a page relevant to the user's search, etc. Hence the court concludes as follows:
The court also noted that Google made no attempt to profit from the display of the material, did not attach advertisements, made clear that the copy could be out of date, and linked clearly to the original source. (I wonder whether that non-profit, no-ads observation will come back to kick Google later...)
The other fair use discussion is less interesting, although the fact that the plaintiff had made his works available for free and not made any other attempt to profit from them was important, because this meant the market value of the original hadn't been damaged. One interesting tidbit is that apparently the SCOTUS has ruled that the fourth fair use factor (any damage to the market/value of the original work) can't be used to argue that the copyright holder could have licensed an otherwise fair use (such as the caching here) and thus the use can't be fair.
Some of the DMCA defence stuff could have quite significant implications. In particular, the fact that Google caches material only for a fairly short time (14-20 days is mentioned) is relevant, since a prior ruling about Usenet servers could be used.
In summary, Google would basically have won out on four different defences here, even without the fact that the original use might not qualify as direct copyright infringement (since the plaintiff went after the downloading done automatically in response to users; he didn't go after GoogleBot's initial copying process that caches the site on Google's system). It doesn't seem at all clear that a lot of the arguments would apply to other caching services, though: amongst other things, Google's cache in this case is temporary; known to the plaintiff, who had not tried to stop it and actually encouraged it; not for direct profit nor carrying any advertising; and clearly not damaging the market value of the original works.
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.