Google's Cache Ruled Fair Use

A new search engine is in order by imoou · 2006-01-26 08:24 · Score: 4, Interesting

So if someone created a search engine which automatically, randomly and non-volitionally searches and caches MP3 files from websites which do not have "no archive" metatag, it's not breaking the law?

When those searched websites disappeared, this search engine may still serve those cached MP3 files for archival purposes?

--
Uncensored Google results requested and delivered by email

Re:A new search engine is in order by Ninwa · 2006-01-26 08:26 · Score: 4, Interesting

Maybe... but there's a difference. That difference is that the items cached were already in violation of copy right law, most likely. Interesting though... and doesn't archive.org archive files? I know they've archived several small programs I've written that were linked on my site at one point in time.
Re:A new search engine is in order by Anonymous Coward · 2006-01-26 09:33 · Score: 2, Informative

Google's "cooperation" was purely a mean joke on the liars, er, lawyers. They basically rubbed the shitbag^H^H^H^H^H^H^Hattorneys' noses in poo by linking to the complaint at the bottom, which included all of the URLs which were being contested - and the shitbag lawyers had no recourse. :)

It'd be akin to complaining about references to pornography being available to all at the library of congress because copyrights have been filed on pornographic photos, stories, articles, films, etc. - "OMFG the US Gubbment is making Porn available to children! Call Dubya! Think of the children!"

not anymore than any browser by scharkalvin · 2006-01-26 08:25 · Score: 2, Insightful

Most browers have a built in cache. They don't violate copyright law
do they?

Re:not anymore than any browser by metternich · 2006-01-26 08:45 · Score: 5, Insightful

Browsers don't make their caches publicly availible. You're comparing apples and oranges.

--
Facts do not cease to exist because they are ignored.
Re:not anymore than any browser by squoozer · 2006-01-26 09:18 · Score: 2, Interesting

True, but there is a subtle difference between the Google cache and your local cache - Google are making money from displaying other peoples work. Ok maybe it's not 100% direct, they don't have adverts on the cached page (yet) but it is an additional service they are providing that draws people. More people == more money ergo you can conclude that they are profiting by displaying our work.

To perhaps make it clearer: Many of the articles I write are one page long and my time is funded by the adverts on my site. The cached google page shows the whole of an article but far fewer adverts than the real page therefore I am losing out on revenue because of Googles cache. I think it is difficult to argue that Googles use is fair as they are displaying the whole article. Fair use was supposed to protect people that wanted a snippet or two for review purposes and that sort of thing.

Imagine if you reprinted a chapter of a book or the leading article of a newspaper. You would be sued before you knew what was happening and yet Google seem to get away with copying and republishing whole websites.

--
I used to have a better sig but it broke.
Re:not anymore than any browser by imthesponge · 2006-01-26 11:28 · Score: 2, Insightful

Or when sites provide content to Google's spiders but deny it to everyone else *cough*expertsexchange*cough*.
http://www.experts-exchange.com/Storage/Q_21272116 .html
http://www.google.com/search?q=cache:GyTRLbavSIgJ: www.experts-exchange.com/Storage/Q_21272116.html&h l=en&gl=us&ct=clnk&cd=1

Good news by Eightyford · 2006-01-26 08:25 · Score: 3, Insightful

This is good news for the Wayback Machine at archive.org. I think the case against google images, and especially google video is a little stronger, however.

--
Religion for nerds. Stuff that really matters

Re:Good news by T-Ranger · 2006-01-26 09:28 · Score: 2, Informative

HTTP explicitly allows for caches - its part of the protocol. Associated standards have been published to restrict what can be cached, and indexed by e.g. search engines.

You can no more complain that something is caching, or indexing, your pages you have published with HTTP, then you can complain that someone is accessing them at all.

Lucky for /. Readers! by gasmonso · 2006-01-26 08:26 · Score: 2, Insightful

Google's cache is often times the only way to read an article posted on here. Also, its a good resource for people behind firewalls and can only pull up a cached version. It's a good win for Google.

http://religiousfreaks.com/

Good judgement by karolgajewski · 2006-01-26 08:26 · Score: 3, Insightful

Finally, a frivolous lawsuit that got its just desserts. We can only hope that will herald a new age, where the insanely stupid lawsuits are going to fianlly the death they so rightly deserve.

--
- .k. -

The judge then got down from his bench by Jim+in+Buffalo · 2006-01-26 08:26 · Score: 5, Funny

The judge then left the bench, walked over, and whacked the plaintiff and his council on the head with a salami.

--
This sig, aah-ah, is comin' like a ghost-sig...

cache everything by digitaldc · 2006-01-26 08:26 · Score: 2, Funny

avoid a lawsuit

--
He who knows best knows how little he knows. - Thomas Jefferson

obvious joke by grungebox · 2006-01-26 08:27 · Score: 3, Funny

So who has a link to the Google cache of the article?

I don't like this ruling. by mumblestheclown · 2006-01-26 08:31 · Score: 4, Interesting

Intellectually, I don't like this ruling one bit. "Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons. Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads. Rememeber folks, in terms of the cache here, we're referring to google's ability to serve content IN ITS ENTIRETY to end-users - we're not talking about those tiny snippets needed to make search engine results useful.

Those of you who do the "yesbutNOCACHEtag" dance have got it backwards to: it's not the responsibility of the copyrightholder to sing to the tune of whatever the latest fad is. Rather, it's the other way around - google should convince people that it's in their interest to put a "CACHEME!" tag.

Re:I don't like this ruling. by Routerhead · 2006-01-26 08:40 · Score: 2, Insightful

It seems a stretch to argue that Google is providing this service for financial gain. For starters, when a user pulls up a Google cached page, they don't get Google's ads on that page. They get, as you noted, the actual site as it was when Google cached it, complete with that site's ads, if there are any.

In addition, the cached version is never anything more than a poor substitute for the actual site. Text comes through, but much of the site's look and feel is lost. If Google wanted to hijack this site, as you seem to suggest, they'd want to incorporate the subordinate links and images as well.

Finally, given that users tend to resort to the cached version only when the actual site itself is down, it is hard to argue that the site itself is taking a financial hit. The user can't get there in the first place.

--
In tabulario donationem feci.
Re:I don't like this ruling. by schon · 2006-01-26 08:58 · Score: 4, Insightful

"Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder

Before we address this (false) assumption, here's what's happening:

Copyright holder makes a web page available *FOR FREE* to the general public. Google caches it. Please explain how Google's cache financially hurts the copyright holder. Providing something *FOR FREE* that is available *FOR FREE* would seem to have a "nil financial effect on the copyright holders", no?

Google's cache is basically a large-scale financial transfer from the copyrightholders

Sorry, WHAT ?!!??!

Financial == monetary matters. I haven't checked in the past 5 minutes, but prior to that, someone visting your (free, publically accessible) website doesn't move money from your bank account to theirs.
Re:I don't like this ruling. by pthisis · 2006-01-26 09:09 · Score: 4, Insightful

"Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons.

This is not true. Ebert & Roeper can show a movie clip, do some scathing commentary, decrease the film's box office take, and make money in the process, and that's clearly fair use. Protecting negative criticism is one of the core philosophical reasons behind fair use (originally, anyway) and it clearly has the potential to have devastating financial effects on the copyright holder.

--
rage, rage against the dying of the light
Re:I don't like this ruling. by Goldsmith · 2006-01-26 09:13 · Score: 2, Interesting

Before you continue to smash people over the head with blunt and incorrect legal arguments, you should actually read the law which was referenced. You may find that it is not any clause defining Fair Use as you have quoted in other places, but rather a clause on catching. The link is right there on the front page.

As you have said:
"Unfortunately, while you may WISH this to be the case, this is not what the law says. Your wishful thinking doesn't make your statement true. You've basically invented your theory out of whole cloth, or, put another way, pulled it out of your ass."
Re:I don't like this ruling. by Routerhead · 2006-01-26 09:13 · Score: 3, Interesting

The moment one decides to put something on the Internet, he loses a large chunk of control over that content. Caching is an inherent, and necessary, component of Internet technology. Searching as a whole does not work without it.

Your original post was that the original site owner was entitled to relief because of lost financial gain (due to users viewing Google's ads rather than his own). You now present a new argument: content control. However, posting any content on the Internet is entails a conscious surrender of control, and in the bargain that control is surrendered for convenience: distribution, access, attention, what have you.

As to your coffee shop analogy, there is a significant difference. Google is not making any money from its action. No Google ads appear on the cached sites, unless they were there already. In your coffee shop, you are selling coffee to those photocopy readers. Google is not, as you condeded above. For a copyright violation to take place, the violator has to be receiving a material gain. It is not at all clear that Google is receiving any financial gain.

As for metatagging, no standard exists. You are arguing for a standard (be it an opt-in or opt-out). I agree that a standard should exist. However, in the specific case of Google, Google is clear in describing the steps that a site administrator can take to exclude its content from Google's cache.

I'm not trying to suggest that this is a one-way street and that no copyright protection exists for Internet content, but, particularly in this case, it's hard to see that Google was stealing his site or its contents.

--
In tabulario donationem feci.
Re:I don't like this ruling. by prSpectiv2 · 2006-01-26 09:32 · Score: 2, Insightful

explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore

I'd say let's not be ridiculous but it looks like we're far too late for that. Books are "Products," aka items sold in exchange for access to intellectual or artistic property. If you don't own the book you can't give others access to it in the first place.

Maybe you didn't get the memo, but the ENTIRE INTERNET was designed to promote the free dissemination of publicly available information. Sites are not "Products," they are frameworks that allow access to content, which may or may not be free. When FREE CONTENT is sent unchanged from one computer to the next that means the web is working. If one party wants to restrict access to their content, they slap a password and encryption over it. It's common knowledge that this is how the web works: free until stated otherwise.

Your argument boils down to restricting the redistribution of content that is explicitly intended to be freely sent through dozens of computer systems before it's burned onto my monitor. By your argument, google should not only have to remove its cache, but all links to content that owners might not want to be accessible at any given moment, which adds up to every page on the net.

--
Nice guys don't finish last. In reality, they're abducted halfway through the race.
Re:I don't like this ruling. by dasil003 · 2006-01-26 09:33 · Score: 4, Insightful

Allow me to quote your original message:

Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads.

While I don't think it's an open and closed case, you won't convince anyone by spouting such hyperbole.

First of all, you're ignoring the fact that the main link goes to the actual website, and the only way to get to the cache is through a smaller link labeled "cache", which is unlikely to be clicked by someone who doesn't already know what it means. Furthermore, once they do click they get a very clear and prominent message in a frame stating that it is in fact a cache, and even explains what the cache is and how it works. So to say that it is a 'large-scale financial transfer' is just silly. At worst it's an extremely small-scale 'financial transfer', and that's a very dubious point to begin with.

So then in your response you say:

Nonsense. Google gains by drawing attention to itself and its other offerings. Let me ask you this simple question: if google doesn't gain financially, even if indirectly, why do they do it? Goodwill? Poppycock.

The same could be said about search in general. They are profiting off of other people's content, because without it there'd be nothing to search. I'm sorry, but there's no evidence that the cache hurts websites financially. And there is certainly plenty of anecdotal evidence where it helps (eg. site goes down, customers go to cache to get phone # or contact email). Note that I'm not using that as support of Google's right to cache, merely as a counterargument against your baseless assertions.

You are basically saying that GOOGLE has the right to display YOUR content as GOOGLE sees fit.

Again with the hyberbole. No one believe Google has the right to display your content any way they want. What some people are arguing is that Google has the right to display your content as they do which is quite reasonable, and not deceptive in the least.

In the google cache situation, the owner effectively loses that right.

No, the page will disappear from Google once it is permanently down, it just takes some time. This may be a valid point about the internet archive, but then again information that finds its way on to the internet anywhere is likely to stick around for a long time.

In other news, explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore (for example, without photos) and offer them for free reading in my coffeeshop?

Um, because books cost money?
Re:I don't like this ruling. by slackmaster2000 · 2006-01-26 09:39 · Score: 2, Interesting

Sorry. I'm on your side to some degree in this argument, but your claim that google isn't gaining financially isn't true.

Google offers its cache as part of its search engine service. The service itself makes money, and is popular for its entire featureset. Just because a specific feature does not itself generate revenue doesn't mean that it doesn't indirectly contribute to financial gain.

A bathroom in a coffee shop is a really nice feature that doesn't itself generate a single dime.

Anyhow, you don't have to be making money from copyright material to be infringing on a copyright, so this whole part of the argument is kind of pointless.
Re:I don't like this ruling. by pthisis · 2006-01-26 10:20 · Score: 2, Interesting

1. That has little to do with the post I responded to which claimed that Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons.

Maybe you can dance around and try to claim that actually the use itself didn't affect the film's box office take but the reviews did (though in many cases of parodies the fair use and the criticism are intimately connected). It's certainly not for personal reasons. And, realistically, in a lot of cases (more commonly with more Wowza! reviewers) the "clip from an upcoming movie" is itself the product being sold, and people don't watch for the value-added commentary.

2. The primary way that commercial uses change court interpretations of fair use is that courts tend to put a higher burden of proof on the defendent to prove that their commercial use is fair (typically, if the user is non-commercial than they place a higher onus on the plaintiff to demonstrate that the use is unfair).

But commerciality--even an impact on the marketability of the original--does not prima facie demonstrate that a use is unfair. Indeed, in that Slashdot baby "Sony Corp. of America v. Universal City Studios, Inc" a use that clearly affects the marketability of the copyrighted product was upheld (after all, if you cannot tape a movie when it airs on the networks and time-shift your viewing, you're more likely to buy the official movie release).

The basic balancing test required to ensure that use is fair use says otherwise to what you state.
I encourage you to read the following: Definition of Fair Use [auburn.edu] (pay special attention to points 3 and 4 there). I also encourage you to find out what a "balancing test" means, legally, in case that term is new to you.

I'd recommend the same, since your first sentence here shows a startling lack of understanding of the concept. In particular, the Court has been pretty one-sided about this portion of the balancing act: if you can't demonstrate that your market was affected then it's tough to get a court to agree that a use is unfair, but market affect on its own is proof of little.

IOW, this clause is generally invoked to protect fair use, but violating it alone doesn't go far toward showing that a use is not, on balance, fair.

If it's a litmus test at all (which it really isn't, since "no commercial impact on the copyright holder" is normally a defense but not always), it's a litmus test in the opposite direction of what you're claiming.

--
rage, rage against the dying of the light
Re:I don't like this ruling. by trawg · 2006-01-26 12:51 · Score: 2, Insightful

Copyright holder makes a web page available *FOR FREE* to the general public. Google caches it. Please explain how Google's cache financially hurts the copyright holder. Providing something *FOR FREE* that is available *FOR FREE* would seem to have a "nil financial effect on the copyright holders", no?

This is a really interesting topic, for me at least. I mirror a lot of files for Australian users on various websites over here. One thing I've found is that the vast majority of sites (at least the big ones) have a standard Terms and Conditions page, that generally say something along the lines of "you are not allowed to reproduce any of the content on this page".

I always respect this and don't mirror anything from any sites that have this in their T&Cs (without asking for permission first). Unfortunately this means that any Australian that wants to download something has to go through the international site and can't use a local mirror.

Now, my reckoning is that it definitely doesn't cost these companies anything if I mirror their files and make them available. If anything, it is a) saving them money on bandwidth costs and b) increasing exposure of their products and services, as we always link back to the parent site.

However, I assume that from their perspective, they don't want people mirroring distributing awesome_application.exe because a malicious website owner could put up a trojaned, virused or otherwise bad version of their file. Users then cry and go to them for support and they have to try and clean up the mess. We approached Microsoft to mirror their files for our users but were denied; while they never said this was the reason I think it is a safe assumption.

There are ways to limit these problems, like md5 checking, PGP signing, but I assume these are way out of the reach of regular users who just want to Click Download And Run.

Anyway, a little off-topic for your post, but I thought it might be relevant as one of the downsides to allowing people wholesale access to mirror/reproduce/redistribute/copy your works.

Comment removed by account_deleted · 2006-01-26 08:34 · Score: 5, Insightful

Comment removed based on user account deletion

Whooooosh! by DrEldarion · 2006-01-26 08:35 · Score: 3, Funny

That overwhelmingly loud noise everyone just heard was the sound of every Slashdot reader gasping at the fact that the DMCA just got used for something positive.

"fair use"? What's dat?! by Thud457 · 2006-01-26 08:35 · Score: 5, Funny

I thought we sold "fair use" a while ago for three magic beans and some DVDs.

--

the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

Re:"fair use"? What's dat?! by Anonymous Coward · 2006-01-26 09:21 · Score: 2, Funny

> I thought we sold "fair use" a while ago for three magic beans and some DVDs.

The problem with these 'Jack and the Beanstalk' allusions is that, in the story, the beans actually worked!

NOARCHIVE by DarkClown · 2006-01-26 08:38 · Score: 2, Interesting

wonder if the guy bothered with a robots.txt or used the meta NOARCHIVE - not that actually preventing that was his intent.
i don't mind the google cache at all, what drives me up a wall is what jeeves and other engines do with external pages by sticking them in a frame. so, if you put code in the page to force it out of frames, then engines like yahoo penalize (or drop from the index entirely) for messing with the user navigation.....

The "Implied License" is the most interesting by cfulmer · 2006-01-26 09:05 · Score: 4, Insightful

Forget the fair use analysis, the most important thing here is the success of the "Implied License" claim. Basically, it goes like this: You operate a website. The web was created specifically with the idea that "robots" would crawl across it, and there is a standard well-known way to prevent them from crawling your site. Even more specifically, there's another standard well-known way to keep search engines from cacheing your content. Being on the web but not using these techniques means that you give search engines permission to cache your content.

It's sort of like what happens when you leave a potful of candy at your front-door on Oct. 31st. In theory, you could claim that all those kids who come to your door and help themselves are stealing. But, because everybody knows how Halloween works, you've implicitly given permission for them to do it.

In this opinion, the Fair Use analysis was basically just used as a stopgap of "what little infringement that's left after you account for the implied license is a fair use." If the website had included a robots.txt file, the fair use case would have been much harder to make.

The Implied License is a stake in the ground for "This is the Internet. The rules are different here." IMO, that's a good thing -- there are a bunch of things that just couldn't happen if you had to get explicit permission from every content owner.

Re:The "Implied License" is the most interesting by Obfuscant · 2006-01-26 11:50 · Score: 2, Insightful

The web was created specifically with the idea that "robots" would crawl across it, ...
Uhh, no. The web was created specifically with the idea that humans would crawl across it. It wasn't until the web grew beyond easy comprehension by humans that robots were created to crawl it.
Were your statement correct, the robots.txt exclusion protocol would have been part of the CERN webserver documentation from day one. It wasn't. My web pages were up for a very long time before there were robots wandering the web.
It's sort of like what happens when you leave a potful of candy at your front-door on Oct. 31st.
Yes, I know what happens. Been there, done that. The first person to visit saw an entire pot full of candy and took it all. After I refilled the pot, thinking that the selfishness of the first visitor was an abberation, the second visitor took the entire pot full of candy and the pot. Ethics are what you do when you think people aren't looking.

I believe in an opt-in Internet. by rjonesx2 · 2006-01-26 09:05 · Score: 3, Insightful

The Google cache is absolutely ridiculous. As an individual who has had quite a bit of experience on both sides of the white hat / black hat search engine industry, the cache is NOT a webmaster's friend.

1. The cache removes content control away from the author. For example, a site like EzineArticles.com prevents scraping by using an IP blocking method based on the speed at which pages are spidered by that IP. It is absurdly easy to circumvent this by simply spidering the Google cache of that article instead of spidering the site. Google's IP blocking is far less restrictive, and combined with the powerful search tool, it allows for easy, anonymous contextual scraping of sites whose Terms of Service explicitly refuse it.

2. The cache extends access to removed content, often for months if not years at a time. Google rarely replaces 404 pages (perhaps it is because of their wish to have the largest number of indexed pages). I have clients who have nearly 48,000 non existent pages still cached in google that have not been present in over 14 months. Despite using 404s, 301s, etc. these pages have not yet been removed. Furthermore, Google's often mishandling of robots.txt, nocache, and nofollow leaves webmasters dependent upon search traffic hesitant to force removal of these pages using the supposedly standardized methods of removal.

3. The cache allows Google to serve site content anonymously. Don't want the owner of a site to know you are looking at their goods (think of companies grepping for competitor IPs), just watch the cache instead.

The list goes on and on. But I think the point is this...

Why should a web author have to be technologically savvy to keep his or her content from being reproduced by a multi-billion dollar US company? Content control used to be as simple as "you write it, its yours". It got a little more complicated with time to the point at which it might be useful to use, perhaps, a Terms of Service. Even a novice could write "No duplication allowed without expressed consent". Now, a web author must know how to manipulate HTML meta tags and/or a robots.txt file.

Fair use is for users, for people, not multi-billion dollar companies.

--
Fight Link Spam with LinkSleeve.org

Re:I believe in an opt-in Internet. by reve · 2006-01-26 13:08 · Score: 2, Insightful

Okay, let's say someone is scraping your site. If they are scraping your site and redistributing the information, you have a very, very clear violation of copyright and I urge you to contact your lawyer.

If it's just some guy scraping your site for his own edification, it's probably fair-use -- regardless of your crazy terms of service no one read.

Ultimately, an LWP::Simple based "scraper" is just a specialized browser -- unless they're redistributing your work.

Someone using Google to violate your rights is not Google's fault. It's an opportunity to ask for punitive damages.

--
-- r . m o s q u i t o --

May be a hint of how the book scanning will go. by Godeke · 2006-01-26 09:11 · Score: 2, Informative

After reading the actual opinion that granted summary judgement, if this same logic is applied to the scanning and offering of search on "real world" materials, Google may be able to withstand lawsuits on the book scanning effort quite well. There are some differences that could create a different outcome, but this outcome was 100% favorable to Google and the idea of indexing and caching of materials to allow such search and reference was solidly defended by the judge.

--
Sig under construction since 1998.

Slashdot Cache Ruled Fair Use!! by LesPaul75 · 2006-01-26 09:18 · Score: 3, Interesting

Ok, no more excuses Slashdot... It's time to start caching pages and preventing the Slashdot effect.

Web caches too! by slashdotmsiriv · 2006-01-26 09:25 · Score: 2, Interesting

What about web caches? They violate copyright law too! My firefox does it too. Should I use 0 Mbytes of disk space for browser caching?

Where's the engine for the robots.txt excluded? by Kazoo+the+Clown · 2006-01-26 09:31 · Score: 3, Interesting

By now someone must have created a search engine that only indexes sites whose robots.txt tells them not to index. I'm surprised I haven't heard of a particular one. Bet it would raise a few hackles though...

I'm sure you're right... by rewt66 · 2006-01-26 09:50 · Score: 2, Insightful

I'm sure you're right. I'm sure you're much smarter than the judge, and have a much better grasp of the law. What a pity that you weren't the one judging the case!

Sorry, mumbles, but I don't buy it. My money's on the judge being right, and you being a loudmouth with too much time to post over and over, as you have in reply to everybody who argued with you.

And to try to actually contribute something to the debate: So you don't like the "fair use" part of the decision. Fair enough; though as I said, between you and the judge, my money's on the judge as the one who correctly groks what fair use is all about. But that aside, what about the other three points? Most interestingly, what about the "implied license" point?

And what about the judge's assessment that Field was "attempting to manufacture a suit against Google"? That is, this wasn't about actual injury to Field, it was about Field actively looking for a chance to sue someone? Does any of that matter to you?

Very specific case, though by Anonymous+Brave+Guy · 2006-01-26 09:55 · Score: 4, Insightful

This is good news for the Wayback Machine at archive.org.

I'm not sure about that. Although the result of this case seems fair and clearly indicated on several counts, there's a lot that might not apply to archives more generally, so I'm not sure how much of a precedent has been set.

In particular, the case was brought by someone who practically admitted trying to set Google up: he knew about mechanisms like META tags and robots.txt, knew that Google was caching his site, made no attempt to stop them, and indeed actually set up robots.txt explicitly to allow bots to crawl his site. This supports Google's first two defences here, having an implied licence and estoppel.

The most interesting discussion, IMHO, is on the fair use defence. The court considered in a lot of detail whether the use made by Google qualifies as fair use. On the first criteria (how the material is being used), it was found significant that the material was being used for different purposes in the cache than on the original site: the latter was presumed artistic, while the former allowed access to the material when the original site was down, historical comparisons of the site content, highlighting of search terms that made a page relevant to the user's search, etc. Hence the court concludes as follows:

Because Google serves different and socially important purposes in offering access to copyrighted works through "Cached" links and does not merely supersede the objectives of the original creations, the Court concludes that Google's alleged copying and distribution of Field's Web pages containing copyrighted works was transformative.

The court also noted that Google made no attempt to profit from the display of the material, did not attach advertisements, made clear that the copy could be out of date, and linked clearly to the original source. (I wonder whether that non-profit, no-ads observation will come back to kick Google later...)

The other fair use discussion is less interesting, although the fact that the plaintiff had made his works available for free and not made any other attempt to profit from them was important, because this meant the market value of the original hadn't been damaged. One interesting tidbit is that apparently the SCOTUS has ruled that the fourth fair use factor (any damage to the market/value of the original work) can't be used to argue that the copyright holder could have licensed an otherwise fair use (such as the caching here) and thus the use can't be fair.

Some of the DMCA defence stuff could have quite significant implications. In particular, the fact that Google caches material only for a fairly short time (14-20 days is mentioned) is relevant, since a prior ruling about Usenet servers could be used.

In summary, Google would basically have won out on four different defences here, even without the fact that the original use might not qualify as direct copyright infringement (since the plaintiff went after the downloading done automatically in response to users; he didn't go after GoogleBot's initial copying process that caches the site on Google's system). It doesn't seem at all clear that a lot of the arguments would apply to other caching services, though: amongst other things, Google's cache in this case is temporary; known to the plaintiff, who had not tried to stop it and actually encouraged it; not for direct profit nor carrying any advertising; and clearly not damaging the market value of the original works.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

I like this ruling... by tv_dinners · 2006-01-26 15:30 · Score: 2, Insightful

..since it means that I can now create a search engine that caches and displays Google's results.

I wonder how long it would take for Google to cry foul for doing exactly what they do ?

Slashdot Mirror

Google's Cache Ruled Fair Use

41 of 213 comments (clear)