Google's Cache Ruled Fair Use

A new search engine is in order by imoou · 2006-01-26 08:24 · Score: 4, Interesting

So if someone created a search engine which automatically, randomly and non-volitionally searches and caches MP3 files from websites which do not have "no archive" metatag, it's not breaking the law?

When those searched websites disappeared, this search engine may still serve those cached MP3 files for archival purposes?

--
Uncensored Google results requested and delivered by email

Re:A new search engine is in order by Ninwa · 2006-01-26 08:26 · Score: 4, Interesting

Maybe... but there's a difference. That difference is that the items cached were already in violation of copy right law, most likely. Interesting though... and doesn't archive.org archive files? I know they've archived several small programs I've written that were linked on my site at one point in time.
Re:A new search engine is in order by PFI_Optix · 2006-01-26 09:07 · Score: 1

picked this up on another forum today. Search google.com for "kazaa lite". Scroll down to the bottom. There should be a note that one or two results were removed due to a request based on the DMCA. It may not work outside the US.
There's certainly precedent for Google removing something from its search results and/or cache.

--
120 characters for a sig? That's bloody useless.
Re:A new search engine is in order by squoozer · 2006-01-26 09:24 · Score: 1

I'm sure plenty of the material in the Google cache is there in breach of copyright law as well but they aren't being held accounatable for that so why should they be for mp3s? I think this might be a case of six of one and half a dozen of the other.

--
I used to have a better sig but it broke.
Re:A new search engine is in order by networkBoy · 2006-01-26 09:27 · Score: 1

And that's key to the issue.
If you are the copyright holder and request removal they are obligated to comply. So long as they do this then any content that is copyright but for which removal has not been requested they can cache.
If you don't want them to archive it then tell them that in the meta tag or robots.txt file and they will comply automatically.
-nB

--
whois gawk date unzip strip find touch finger mount join nice man top fsck grep eject more yes exit umount sleep dump
Re:A new search engine is in order by Anonymous Coward · 2006-01-26 09:33 · Score: 2, Informative

Google's "cooperation" was purely a mean joke on the liars, er, lawyers. They basically rubbed the shitbag^H^H^H^H^H^H^Hattorneys' noses in poo by linking to the complaint at the bottom, which included all of the URLs which were being contested - and the shitbag lawyers had no recourse. :)

It'd be akin to complaining about references to pornography being available to all at the library of congress because copyrights have been filed on pornographic photos, stories, articles, films, etc. - "OMFG the US Gubbment is making Porn available to children! Call Dubya! Think of the children!"
Re:A new search engine is in order by Anonymous Coward · 2006-01-26 09:42 · Score: 0

You are a total goddamn dumbass. Read the complaint that was sent to google! The results were not removed because Kazaa can be used to obtain copyrighted files. The complaint was sent on behalf of Kazaa, because the websites returned hosted unauthorized copies of the Kazaa software.
Re:A new search engine is in order by xstonedogx · 2006-01-26 09:47 · Score: 1

I wonder why the results were removed, as opposed to the cached copies of the websites. As far as I knew, the DMCA could only be used to force a host to remove infringing content from their website. A link is not infringing content as far as I know, unless they are arguing that those links/titles used Kazaa's trademark; but that's not what the letter says.

Does the DMCA really require the removal of links pointing to infringing content?
Re:A new search engine is in order by zcat_NZ · 2006-01-26 10:19 · Score: 1

Thanks for that. I was going to post the same comment but you beat me to it.

There's a certain irony in kazaa complaining about illegal copies of their software..

--
455fe10422ca29c4933f95052b792ab2
Re:A new search engine is in order by Anonymous Coward · 2006-01-26 10:42 · Score: 0

Yes, it is actually illegal according to the DCMA. Many website were shut down back when De-CSS was first published for merely linking to sites that had it.
Re:A new search engine is in order by Frank+T.+Lofaro+Jr. · 2006-01-26 12:08 · Score: 1

As it isn't fair to illegally modify or distribute software for illegally obtaining music. :)

--
Just because it CAN be done, doesn't mean it should!
Re:A new search engine is in order by Grimboy · 2006-01-26 12:10 · Score: 1

God I'm such a fuckwit that I didn't realize that.
Re:A new search engine is in order by Tim+C · 2006-01-26 12:13 · Score: 1

There should be a note that one or two results were removed due to a request based on the DMCA.

Gotta love the way they link to ChillingEffects.org for a copy of the complaint, rather thatn just hosting it themselves.

It may not work outside the US.

It worked on a request to google.com and google.co.uk from the UK.

--
It's official. Most of you are morons.
Re:A new search engine is in order by Grimboy · 2006-01-26 12:21 · Score: 1

Extreme example: That's the same as being a criminal for saying "I heard James on blah street can get you bombs". Sometimes you might just be saying "James lives on blah street", would that still be evil. What if you said go to "site lawyers dont like dot com with no spaces" would that still be infringement? How about if it was encoded as hexadecimal ascii? What about if you linked to another site that directly forwards you to a 'bad site'? What if you said "Oh blah.com used to be a site before it stopped existing" but that's the forwarding site? What if you said "blah dot com" used to be a site before it stopped existing"? How about that? What if you had a script that generated a truely random link (as in it randomized all the letters of the site) and it accidently linked to a "bad site"? Argh... I would join the EFF, but I can suddenly see the appeal of suicide bombing.
Re:A new search engine is in order by Grimboy · 2006-01-26 13:17 · Score: 1

It worked on a request to google.com and google.co.uk from the UK.

confirmed
Re:A new search engine is in order by AussieVamp2 · 2006-01-26 13:44 · Score: 1

Australia, too. Bizarre. Going to end up being tied in knots with 49 countries blocking every second page.

--
Not Free SF Reader
Re:A new search engine is in order by smoker2 · 2006-01-26 18:19 · Score: 1

So we're still Unforgiven then ?

Archive.org by wbechard · 2006-01-26 08:25 · Score: 1, Insightful

Google wasn't really copying as much as they were archiving the past... Look at Archive.org's way back machine. Same principal.

Re:Archive.org by LurkerXXX · 2006-01-26 08:35 · Score: 1

Yep. If they didn't want it on google, they should have said so in their robots.txt file
Re:Archive.org by HTH+NE1 · 2006-01-26 08:44 · Score: 1

Not only robots.txt, but I've encountered sites that if you were to attempt to read the Google Cache with Javascript enabled, it would redirect you to their live site. Usually this happens in the form of frameset redirection scripts, but there were cases where it would redirect you to the site's front page. In those latter cases, it often means that all external links to any page on those sites would redirect to the front page (anti-deep-linking).

--
Oh, say does that Star-Spangled Banner entwine / The myrtle of Venus with Bacchus's vine?

not anymore than any browser by scharkalvin · 2006-01-26 08:25 · Score: 2, Insightful

Most browers have a built in cache. They don't violate copyright law
do they?

Re:not anymore than any browser by metternich · 2006-01-26 08:45 · Score: 5, Insightful

Browsers don't make their caches publicly availible. You're comparing apples and oranges.

--
Facts do not cease to exist because they are ignored.
Re:not anymore than any browser by LiquidCoooled · 2006-01-26 08:48 · Score: 1

You tell me.

Heres the Google cache for one of your own pages.
Thousands of people can view the copyrighted content of your pages - even after you remove it from the web.

--
liqbase :: faster than paper
Re:not anymore than any browser by rabbit994 · 2006-01-26 09:09 · Score: 1

However, adding a quick robots.txt to your webpage root can fix this cache problem right up. Heck, you can just add meta tags as well if you want. I used to do that with my web pages and guess what, google obeyed the request not to cache.
Re:not anymore than any browser by squoozer · 2006-01-26 09:18 · Score: 2, Interesting

True, but there is a subtle difference between the Google cache and your local cache - Google are making money from displaying other peoples work. Ok maybe it's not 100% direct, they don't have adverts on the cached page (yet) but it is an additional service they are providing that draws people. More people == more money ergo you can conclude that they are profiting by displaying our work.

To perhaps make it clearer: Many of the articles I write are one page long and my time is funded by the adverts on my site. The cached google page shows the whole of an article but far fewer adverts than the real page therefore I am losing out on revenue because of Googles cache. I think it is difficult to argue that Googles use is fair as they are displaying the whole article. Fair use was supposed to protect people that wanted a snippet or two for review purposes and that sort of thing.

Imagine if you reprinted a chapter of a book or the leading article of a newspaper. You would be sued before you knew what was happening and yet Google seem to get away with copying and republishing whole websites.

--
I used to have a better sig but it broke.
Re:not anymore than any browser by Anonymous Coward · 2006-01-26 09:24 · Score: 0

How about Squid caches, then?
Re:not anymore than any browser by Ectospheno · 2006-01-26 09:47 · Score: 1

You're comparing apples and oranges.

So...
Re:not anymore than any browser by KeatonMill · 2006-01-26 09:57 · Score: 1

However, consider the instances when people USE the cache: when the original site is no longer available.
Re:not anymore than any browser by mystik · 2006-01-26 10:33 · Score: 1

You install a piece of softare on to your daughter's computer w/o a 2nd license. Is this infringment or fair use? It's not publically distributed.

A copy by any other name is still a copy, and governed by copyright law, and you can only do what the copyright holder's permit you to do.

A web site owner would have a difficult time serving content if he forbid any caching whatsoever. (not just Expires headers,etc.)

--
Why aren't you encrypting your e-mail?
Re:not anymore than any browser by mattwarden · 2006-01-26 10:50 · Score: 1

two words: proxy cache.
Re:not anymore than any browser by imthesponge · 2006-01-26 11:28 · Score: 2, Insightful

Or when sites provide content to Google's spiders but deny it to everyone else *cough*expertsexchange*cough*.
http://www.experts-exchange.com/Storage/Q_21272116 .html
http://www.google.com/search?q=cache:GyTRLbavSIgJ: www.experts-exchange.com/Storage/Q_21272116.html&h l=en&gl=us&ct=clnk&cd=1
Re:not anymore than any browser by typical · 2006-01-26 12:09 · Score: 1

Your argument is that we need to shift more money into content creators (presumably because we lack content).

There are thousands of lifetimes worth of content on the Internet. Our problem is in it going down and becoming stale. Google and Wayback solve that problem. This was the right decision for society.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.
Re:not anymore than any browser by rts008 · 2006-01-26 12:39 · Score: 1

They do on a Windows box, maybe not deliberately, but they do.

--
Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti
Re:not anymore than any browser by squoozer · 2006-01-26 21:40 · Score: 1

While I admit that as an end user this is a useful feature it doesn't give Google the right to republish (my) work without my permission. In fact I would go as far as to say that it is worse if I have removed the content and I am specifically denying people access to the work. To draw an analogy to regular publishing what you are suggesting is that if an author (or whoever the copyright owner is) decides not to publish a book any more "the people" have a right to just go and print themselves off a copy or photocopy the libraries copy. I think part of the problem lies in the confusion surround the word free. Something that is given away for free doesn't necessarly mean that it has no value. I don't mind making my work free to view because some kind people donate a little money or click an ad but costing no money to view and giving up my copyright are two fundamentally different things.

--
I used to have a better sig but it broke.
Re:not anymore than any browser by squoozer · 2006-01-26 22:03 · Score: 1

But completely the wrong decision for content creators. If there was no money in making content then very few people would make it. I would still add things to my (main) website as I enjoy helping people but I can tell you that the money it brings in is a really big incentive to add more content and work harder. So you have a choice: Option 1) A little bit of content and its free for anyone to do anything they like with. Option 2) Decent amounts of content that I provide for free but that I exercise a few rights over such as you can't republish it without permission. Only someone that doesn't produce anything would want option 1.

--
I used to have a better sig but it broke.

Good news by Eightyford · 2006-01-26 08:25 · Score: 3, Insightful

This is good news for the Wayback Machine at archive.org. I think the case against google images, and especially google video is a little stronger, however.

--
Religion for nerds. Stuff that really matters

Re:Good news by gEvil+(beta) · 2006-01-26 08:28 · Score: 1

I was going to say the same thing. Then I read the last sentence of TFA: "The decision is replete with interesting findings that could have important consequences for the search engine industry, the Internet Archive, the Google Library Project lawsuit, RSS republishing, and a host of other online activities." It should be interesting to see how far 512(b) goes.

--
This guy's the limit!
Re:Good news by sepluv · 2006-01-26 08:33 · Score: 1

I don't think Google Video caches videos from the WWW. I think it just allows copyright holders to upload their own videos to Google Video.
Google Images is just a version of the normal Google SE that only brings up images. It doesn't do any more caching than Google does anyway (when caching both text and images), not that whether the content is stored as an image or text is relevant.

--
Joe Llywelyn Griffith Blakesley
[This post is in the public domain (copyright-free) unless otherwise stated]
Re:Good news by Eightyford · 2006-01-26 08:51 · Score: 1

Google Images is just a version of the normal Google SE that only brings up images. It doesn't do any more caching than Google does anyway (when caching both text and images), not that whether the content is stored as an image or text is relevant.

But Google Images serves up pictures without the advertisements that support the content creators. This the same argument as PVRs and skipping commercials. Of course, an argument can be made for using a robots.txt file, but it could be argued that this shouldn't be the creators responsibility to implement.

--
Religion for nerds. Stuff that really matters
Re:Good news by Richard_at_work · 2006-01-26 08:51 · Score: 1

Google Video allows anyone to upload, not just the copyright holder. Pretty sure Topgear didnt upload this gem.
Re:Good news by T-Ranger · 2006-01-26 09:28 · Score: 2, Informative

HTTP explicitly allows for caches - its part of the protocol. Associated standards have been published to restrict what can be cached, and indexed by e.g. search engines.

You can no more complain that something is caching, or indexing, your pages you have published with HTTP, then you can complain that someone is accessing them at all.
Re:Good news by Eightyford · 2006-01-26 09:59 · Score: 1

You can no more complain that something is caching, or indexing, your pages you have published with HTTP, then you can complain that someone is accessing them at all.

I can complain about whatever I want, but that's not the point. Google Image Search is different than mere caching. It's like giving away taped television episodes but replacing the commercials with your own ads. I, personally, don't think Google's Image Search should be illegal, but that doesn't mean the issue is as clear-cut as you make it out to be.

--
Religion for nerds. Stuff that really matters
Re:Good news by Lehk228 · 2006-01-26 10:13 · Score: 1

have you actually used GIS before?

it gives you a page of tiny thumbnails which link back to a framed version of the source page and a link to the original image.

--
Snowden and Manning are heroes.
Re:Good news by Eightyford · 2006-01-26 10:16 · Score: 1

have you actually used GIS before?

No. Never...

--
Religion for nerds. Stuff that really matters
Re:Good news by Kelson · 2006-01-26 10:32 · Score: 1

Actually, it's more like handing out the covers to the relevant DVDs and an index saying what shelf you can find them on.
Re:Good news by mikek3332002 · 2006-01-26 10:42 · Score: 1

Also good news for squid and other proxy/peering services.

Awesome by Anonymous Coward · 2006-01-26 08:26 · Score: 0

This should apply to archive.org as well, right?

As far as I'm aware archive.org hadn't had a specific test case yet.

Re:Awesome by ZachPruckowski · 2006-01-26 08:31 · Score: 1

Archive.org is safer than Google's cache, since it doesn't get as up-to-date stuff. Google's cache is at most like a week or two out of date, right? And it's cache of major sites is updated like hourly or better. It is possible to see the full functionality of slowly updating sites on Google cache, allowing me to get all their content without visiting the site.

Lucky for /. Readers! by gasmonso · 2006-01-26 08:26 · Score: 2, Insightful

Google's cache is often times the only way to read an article posted on here. Also, its a good resource for people behind firewalls and can only pull up a cached version. It's a good win for Google.

http://religiousfreaks.com/

Good judgement by karolgajewski · 2006-01-26 08:26 · Score: 3, Insightful

Finally, a frivolous lawsuit that got its just desserts. We can only hope that will herald a new age, where the insanely stupid lawsuits are going to fianlly the death they so rightly deserve.

--
- .k. -

Re:Good judgement by 42Penguins · 2006-01-26 08:41 · Score: 1

"We can only hope that will herald a new age, where the insanely stupid lawsuits are going to fianlly the death they so rightly deserve."

You must be new to this country...
Re:Good judgement by Highrollr · 2006-01-26 17:06 · Score: 1

The American legal system does a fairly good job of tossing frivolous lawsuits. This is not to say it couldn't be improved, of course, but the fact remains. I direct your attention to the Spotlight fallacy.

The judge then got down from his bench by Jim+in+Buffalo · 2006-01-26 08:26 · Score: 5, Funny

The judge then left the bench, walked over, and whacked the plaintiff and his council on the head with a salami.

--
This sig, aah-ah, is comin' like a ghost-sig...

cache everything by digitaldc · 2006-01-26 08:26 · Score: 2, Funny

avoid a lawsuit

--
He who knows best knows how little he knows. - Thomas Jefferson

Re:cache everything by Anonymous Coward · 2006-01-26 08:51 · Score: 0

avoid a lawsuit
If you can't avoid a lawsuit you could always make a cache settlement out of court.

Oh well by ch-chuck · 2006-01-26 08:27 · Score: 1

Mr. Field will just have to find another way of making a fortune. "Jackpot Justice" didn't pay off this time.

--
try { do() || do_not(); } catch (JediException err) { yoda(err); }

obvious joke by grungebox · 2006-01-26 08:27 · Score: 3, Funny

So who has a link to the Google cache of the article?

Re:obvious joke by slashdotnickname · 2006-01-26 08:49 · Score: 1

So who has a link to the Google cache of the article?

I'm trying to look it up on goolge.cn, but all I'm getting are pages about the virtues of some sort of party.

RIAA by CriminalNerd · 2006-01-26 08:28 · Score: 1

That Mr. Field should start working for the RIAA.

I don't like this ruling. by mumblestheclown · 2006-01-26 08:31 · Score: 4, Interesting

Intellectually, I don't like this ruling one bit. "Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons. Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads. Rememeber folks, in terms of the cache here, we're referring to google's ability to serve content IN ITS ENTIRETY to end-users - we're not talking about those tiny snippets needed to make search engine results useful.

Those of you who do the "yesbutNOCACHEtag" dance have got it backwards to: it's not the responsibility of the copyrightholder to sing to the tune of whatever the latest fad is. Rather, it's the other way around - google should convince people that it's in their interest to put a "CACHEME!" tag.

Re:I don't like this ruling. by Routerhead · 2006-01-26 08:40 · Score: 2, Insightful

It seems a stretch to argue that Google is providing this service for financial gain. For starters, when a user pulls up a Google cached page, they don't get Google's ads on that page. They get, as you noted, the actual site as it was when Google cached it, complete with that site's ads, if there are any.

In addition, the cached version is never anything more than a poor substitute for the actual site. Text comes through, but much of the site's look and feel is lost. If Google wanted to hijack this site, as you seem to suggest, they'd want to incorporate the subordinate links and images as well.

Finally, given that users tend to resort to the cached version only when the actual site itself is down, it is hard to argue that the site itself is taking a financial hit. The user can't get there in the first place.

--
In tabulario donationem feci.
Re:I don't like this ruling. by prSpectiv2 · 2006-01-26 08:43 · Score: 1

Let's keep things in perspective here. No one is losing due to missed revenue stream because Google's cache links preserve the original content in its entirety, including the content host's advertisements. As far as I've seen, Google strips all multimedia from cached pages and replaces it with links to the original server. So hosts still get click throughs and view-based ad revenues. Also, google does not advertise on top of cached pages, just provides links to the original page.

Besides, people (aka 'me') probably only use Google Cache when the original content provider is offline. If anything, content providers should be thanking Google for the backup.

--
Nice guys don't finish last. In reality, they're abducted halfway through the race.
Re:I don't like this ruling. by Anonymous Coward · 2006-01-26 08:47 · Score: 0

When you put something on the internet, you are intentionally making it available for anyone to access and do pretty much whatever they want with it. It's up to the person who provides content on the internet to try to restrict it (i.e. place copyright notices or whatever). It's not job of the person who accesses the content to ask for permission on what to do with it. Don't put it out in public if you don't want anyone copying it. It's just common sense.
Those who do the "yesbutNOCACHEtag" dance have got it exactly right. Don't put the bloody thing on the internet if you don't want it cached. That's how the internet works.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 08:52 · Score: 1, Interesting

It seems a stretch to argue that Google is providing this service for financial gain. For starters, when a user pulls up a Google cached page, they don't get Google's ads on that page.
Nonsense. Google gains by drawing attention to itself and its other offerings. Let me ask you this simple question: if google doesn't gain financially, even if indirectly, why do they do it? Goodwill? Poppycock.
They get, as you noted, the actual site as it was when Google cached it, complete with that site's ads, if there are any.
Not really. For, as you mentioned in your next statement, the cached site loses its visual look and feel, including often graphics which can be ads. At the very least, it could be seen as damaging the site.
But, forget that - the real issue is control. You are basically saying that GOOGLE has the right to display YOUR content as GOOGLE sees fit. Regardless of whether you think google's cacheing of entire web pages is good, bad, or indifferent for web sites on the whole, the decision of whether to allow this should be only with the web sites and the DEFAULT situation, therefore, should be "NO." OF COURSE multi-billion dollar companies like yahoo have the necessary soapbox and budget to convince people to put "CACHEME=YES" tags.
I remind you, for example, that no-cache tags are followed only as a courtesy. According to this ruling, there is no particular reason (as I understand it) that a search engine or anybody else would need to honor this ("oh - we don't understand that particular nocache tag - for us not to cache you, you need to use the NoCacheXYZ tag").
Finally, your argument about "the site being down" holds no water. Often, the site is down BECAUSE THE MAINTAINER WANTS TO BRING IT DOWN. This should be the site owner's right. In the google cache situation, the owner effectively loses that right. Again, if you think that having a cache of your site for when your site is unavailable is wonderful, then agree with google to have a CACHEME=YES tag. It's a win-win and mutually agreed upon contract at that point.
In other news, explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore (for example, without photos) and offer them for free reading in my coffeeshop? remember, we're not talking about one or two books for the interested reader - we're talking about a basically free borders or barnes and noble. you'd be free to read the entire book at your leisure - for books consisting mostly of text, you'd be basically getting the whole thing free. and, if you buy some coffee, that's great too. look - i know real world analogies to the internet don't often make sense, but in this case i think my giant coffee shop isnt too far off.
Re:I don't like this ruling. by Anonymous Coward · 2006-01-26 08:55 · Score: 0

I can see where you are coming from. However, logically, considering your stated concerns, you should be a lot more worried about the transparent web caches operated by many ISPs and companies who want to reduce their bandwidth charges. They actually do cache copyrighted content in its entirety, unlike google, who don't cache any images (or sound or video).
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 08:56 · Score: 0

When you put something on the internet, you are intentionally making it available for anyone to access and do pretty much whatever they want with it.
Unfortunately, while you may WISH this to be the case, this is not what the law says. Your wishful thinking doesn't make your statement true. You've basically invented your theory out of whole cloth, or, put another way, pulled it out of your ass.
Re:I don't like this ruling. by schon · 2006-01-26 08:58 · Score: 4, Insightful

"Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder

Before we address this (false) assumption, here's what's happening:

Copyright holder makes a web page available *FOR FREE* to the general public. Google caches it. Please explain how Google's cache financially hurts the copyright holder. Providing something *FOR FREE* that is available *FOR FREE* would seem to have a "nil financial effect on the copyright holders", no?

Google's cache is basically a large-scale financial transfer from the copyrightholders

Sorry, WHAT ?!!??!

Financial == monetary matters. I haven't checked in the past 5 minutes, but prior to that, someone visting your (free, publically accessible) website doesn't move money from your bank account to theirs.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 09:04 · Score: 0, Flamebait
Fair Use
The "fair use" of a copyrighted work, including use for purposes such as criticism, comment, news reporting, teaching, scholarship, or research, is not an infringement of copyright. Copyright owners are, by law, deemed to consent to fair use of their works by others. The Copyright Act does not define fair use. Instead, whether a use is fair use is determined by balancing these factors:
- The purpose and character of the use.
- The nature of the copyrighted work.
- The amount and substantiality of the portion used in relation to the copyrighted work as a whole.
- The effect of the use on the potential market for, or value of, the copyrighted work.
In short, clearly you don't know what the hell you are talking about. The information put on a website is only "free" in the sense you don't pay money for it. However, when, for example, a company puts up a website, they are not doing so because they have extra money that they feel like giving you. Rather, they do it because you are in effect "paying" them by building their brand reputation, ego, attracting you to their other offerings, and so forth. Economics 101, man.
Re:I don't like this ruling. by pthisis · 2006-01-26 09:09 · Score: 4, Insightful

"Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons.

This is not true. Ebert & Roeper can show a movie clip, do some scathing commentary, decrease the film's box office take, and make money in the process, and that's clearly fair use. Protecting negative criticism is one of the core philosophical reasons behind fair use (originally, anyway) and it clearly has the potential to have devastating financial effects on the copyright holder.

--
rage, rage against the dying of the light
Re:I don't like this ruling. by Goldsmith · 2006-01-26 09:13 · Score: 2, Interesting

Before you continue to smash people over the head with blunt and incorrect legal arguments, you should actually read the law which was referenced. You may find that it is not any clause defining Fair Use as you have quoted in other places, but rather a clause on catching. The link is right there on the front page.

As you have said:
"Unfortunately, while you may WISH this to be the case, this is not what the law says. Your wishful thinking doesn't make your statement true. You've basically invented your theory out of whole cloth, or, put another way, pulled it out of your ass."
Re:I don't like this ruling. by Routerhead · 2006-01-26 09:13 · Score: 3, Interesting

The moment one decides to put something on the Internet, he loses a large chunk of control over that content. Caching is an inherent, and necessary, component of Internet technology. Searching as a whole does not work without it.

Your original post was that the original site owner was entitled to relief because of lost financial gain (due to users viewing Google's ads rather than his own). You now present a new argument: content control. However, posting any content on the Internet is entails a conscious surrender of control, and in the bargain that control is surrendered for convenience: distribution, access, attention, what have you.

As to your coffee shop analogy, there is a significant difference. Google is not making any money from its action. No Google ads appear on the cached sites, unless they were there already. In your coffee shop, you are selling coffee to those photocopy readers. Google is not, as you condeded above. For a copyright violation to take place, the violator has to be receiving a material gain. It is not at all clear that Google is receiving any financial gain.

As for metatagging, no standard exists. You are arguing for a standard (be it an opt-in or opt-out). I agree that a standard should exist. However, in the specific case of Google, Google is clear in describing the steps that a site administrator can take to exclude its content from Google's cache.

I'm not trying to suggest that this is a one-way street and that no copyright protection exists for Internet content, but, particularly in this case, it's hard to see that Google was stealing his site or its contents.

--
In tabulario donationem feci.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 09:16 · Score: 1

"This is not true. Ebert & Roeper can show a movie clip, do some scathing commentary, decrease the film's box office take, and make money in the process, and that's clearly fair use."
you've misinterpreted the issue. Epert and Roeper can decrease the box office's take by making a negative review. The basic balancing test required to ensure that use is fair use says otherwise to what you state.
I encourage you to read the following: Definition of Fair Use (pay special attention to points 3 and 4 there). I also encourage you to find out what a "balancing test" means, legally, in case that term is new to you.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 09:21 · Score: 0

The moment one decides to put something on the Internet, he loses a large chunk of control over that content.
Show me the special clause in copyright law where this says this. If you want, restrict your talk to the USA, since google is a US company. Basically, this is wishful thinking on your part.
Caching is an inherent, and necessary, component of Internet technology.
If you're talking about the sort of 'hardware' caching necessary for speed gains, then yes. If you're talking about google cacheing, then you're talking rubbish. The internet worked just fine before google started making 100% copies of internet pages. if google removed its 'cached' links from search results, the internet would not fall apart.
Searching as a whole does not work without it.
You are confusing the issue. Google CAN keep an INTERNAL copy of web pages and then show little fair-use snippets when displaying search results. The issue here is where google makes COMPLETE 100% COPIES FULLY AVAILABLE. This is absolutely not required for searching in any imaginable way. It just happens to be a an easy thing for google to make a few extra bucks on as a byproduct of its existing, useful, and legal search features.
Re:I don't like this ruling. by Anonymous Coward · 2006-01-26 09:21 · Score: 0

you need to stop posting.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 09:24 · Score: 1

And my point, should you take care to actually think for a moment, was that the wrong law was applied. Or, did I miss your point? We're you actually going to defend the idiot who claimed that anything put on the internet is automatically basically free-for-all because he said it was?
Re:I don't like this ruling. by Routerhead · 2006-01-26 09:29 · Score: 1

It does not make complete 100% copies available, as we discussed. Much from the site is missing, particularly aspects of its look and feel. The only thing that Google makes available is the part it uses for searching, namely the text.

Also, can you explain "hardware" caching? Software caches, not hardware. Software stores its cache using hardware.

More on point, however, is that your tacit endorsement of what you term "hardware" caching undermines your argument. Google caches but does not gain financially. An ISP caches and gains financially by more efficiently controlling its bandwidth. Is not the ISP far more vulnerable to this charge than is Google?

--
In tabulario donationem feci.
Re:I don't like this ruling. by squoozer · 2006-01-26 09:30 · Score: 1

Well said. I can't believe that Google have been getting away with it for as long as they have and this ruling seems totally back to front.

--
I used to have a better sig but it broke.
Re:I don't like this ruling. by prSpectiv2 · 2006-01-26 09:32 · Score: 2, Insightful

explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore

I'd say let's not be ridiculous but it looks like we're far too late for that. Books are "Products," aka items sold in exchange for access to intellectual or artistic property. If you don't own the book you can't give others access to it in the first place.

Maybe you didn't get the memo, but the ENTIRE INTERNET was designed to promote the free dissemination of publicly available information. Sites are not "Products," they are frameworks that allow access to content, which may or may not be free. When FREE CONTENT is sent unchanged from one computer to the next that means the web is working. If one party wants to restrict access to their content, they slap a password and encryption over it. It's common knowledge that this is how the web works: free until stated otherwise.

Your argument boils down to restricting the redistribution of content that is explicitly intended to be freely sent through dozens of computer systems before it's burned onto my monitor. By your argument, google should not only have to remove its cache, but all links to content that owners might not want to be accessible at any given moment, which adds up to every page on the net.

--
Nice guys don't finish last. In reality, they're abducted halfway through the race.
Re:I don't like this ruling. by dasil003 · 2006-01-26 09:33 · Score: 4, Insightful

Allow me to quote your original message:

Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads.

While I don't think it's an open and closed case, you won't convince anyone by spouting such hyperbole.

First of all, you're ignoring the fact that the main link goes to the actual website, and the only way to get to the cache is through a smaller link labeled "cache", which is unlikely to be clicked by someone who doesn't already know what it means. Furthermore, once they do click they get a very clear and prominent message in a frame stating that it is in fact a cache, and even explains what the cache is and how it works. So to say that it is a 'large-scale financial transfer' is just silly. At worst it's an extremely small-scale 'financial transfer', and that's a very dubious point to begin with.

So then in your response you say:

Nonsense. Google gains by drawing attention to itself and its other offerings. Let me ask you this simple question: if google doesn't gain financially, even if indirectly, why do they do it? Goodwill? Poppycock.

The same could be said about search in general. They are profiting off of other people's content, because without it there'd be nothing to search. I'm sorry, but there's no evidence that the cache hurts websites financially. And there is certainly plenty of anecdotal evidence where it helps (eg. site goes down, customers go to cache to get phone # or contact email). Note that I'm not using that as support of Google's right to cache, merely as a counterargument against your baseless assertions.

You are basically saying that GOOGLE has the right to display YOUR content as GOOGLE sees fit.

Again with the hyberbole. No one believe Google has the right to display your content any way they want. What some people are arguing is that Google has the right to display your content as they do which is quite reasonable, and not deceptive in the least.

In the google cache situation, the owner effectively loses that right.

No, the page will disappear from Google once it is permanently down, it just takes some time. This may be a valid point about the internet archive, but then again information that finds its way on to the internet anywhere is likely to stick around for a long time.

In other news, explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore (for example, without photos) and offer them for free reading in my coffeeshop?

Um, because books cost money?
Re:I don't like this ruling. by bahwi · 2006-01-26 09:36 · Score: 1

One thing I've noticed about Google Cache is it still loads the original images from the original server, so the ads are still there for those people, and with the highlighted crap it's not efficient or pretty to look at the google cache most of the time as well.
Re:I don't like this ruling. by squoozer · 2006-01-26 09:37 · Score: 1

I just checked this out with a number of my pages. There are substantially fewer ads on cached pages (1) than the real page on the site (4). The cached page is bang up to date (I checked a page that hasn't changed for a while). How can you argue, therefore, that I am not at least potentially loosing out on revenue?

--
I used to have a better sig but it broke.
Re:I don't like this ruling. by squoozer · 2006-01-26 09:39 · Score: 1

Two things: if Googles cache vanished tomorrow the Internet would still work. Copyright is implicit - you don't have to apply for it or add any notices to your work for it to apply (it is recommended that you display a copyright notice but _not_ required).

--
I used to have a better sig but it broke.
Re:I don't like this ruling. by slackmaster2000 · 2006-01-26 09:39 · Score: 2, Interesting

Sorry. I'm on your side to some degree in this argument, but your claim that google isn't gaining financially isn't true.

Google offers its cache as part of its search engine service. The service itself makes money, and is popular for its entire featureset. Just because a specific feature does not itself generate revenue doesn't mean that it doesn't indirectly contribute to financial gain.

A bathroom in a coffee shop is a really nice feature that doesn't itself generate a single dime.

Anyhow, you don't have to be making money from copyright material to be infringing on a copyright, so this whole part of the argument is kind of pointless.
Re:I don't like this ruling. by prSpectiv2 · 2006-01-26 09:48 · Score: 1

Interesting, I've never seen that happen before. In my own defense I said "as far as I've seen," but yeah you're right to an extent. Obviously in this case you could lose out on potential revenue if the user goes to cache before the direct link, but who does that if the site is up? What's really ironic is that Google's own ads aren't accessible from Google's own cache.

--
Nice guys don't finish last. In reality, they're abducted halfway through the race.
Re:I don't like this ruling. by 10101001+10101001 · 2006-01-26 09:56 · Score: 1

The moment one decides to put something on the Internet, he loses a large chunk of control over that content.

Only if you ignore copyright.

Caching is an inherent, and necessary, component of Internet technology.

No, it's a side-feature. Note, storing an image and the rendered website in memory at the same time (and displaying part of it on screen) is fundamentally no different than arranging a collection of mirrors to collage a few pictures. That's rather different than someone photocopying said images and passing them out.

Searching as a whole does not work without it.

Wrong, searching can work by storing a count of certain words/phrases, or it can work by keeping a count of the number of websites with a word that link to a website. But nothing about knowing there's 5 "the"s and 10 "I"s means being able to trivially reconstruct a website.

Your original post was that the original site owner was entitled to relief because of lost financial gain (due to users viewing Google's ads rather than his own).

Sure. Or a loss of revenue because people can't see your nifty design. Or your nifty server-side page generation.

You now present a new argument: content control. However, posting any content on the Internet is entails a conscious surrender of control, and in the bargain that control is surrendered for convenience: distribution, access, attention, what have you.

Ah, I didn't realize that they were different. In server-side generation, financial benefit and content control can be the same thing. Think of, for example, a website that updated with new images. But now you're further claiming that the internet absolves all copyright. It's rather stupid to claim that, given that a signficiant percentage of people (and especially, google) fall under treaties that protect copyright.

As to your coffee shop analogy, there is a significant difference. Google is not making any money from its action.

And I'm not making any money when I give all my friends photocopies of books from the library.

No Google ads appear on the cached sites, unless they were there already. In your coffee shop, you are selling coffee to those photocopy readers. Google is not, as you condeded above.

The better analogy is the coffeshop allowing vs not allowing free access to the photocopy readers. If they don't have to buy coffee (or view those nice ads just beyond the photocopier room), then surely they're not gaining.

For a copyright violation to take place, the violator has to be receiving a material gain.

Re-read the law again. The NET Act in the US makes it so that "personal gain" can be a basis for prosecution. This basically translates into a way to fix the loophole of the coffee-selling analogy. You can't commit copyright infringement and then merrily claim that you didn't directly benefit so you're in the clear. Btw, the NET Act is about criminal prosecution.

It is not at all clear that Google is receiving any financial gain.

Yea, not at all clear. Google just puts up the cache thing even though they've not even the smallest inkling that it's drawing in more users to their regular, ad-supported, site. Nope, Google is a charity.

As for metatagging, no standard exists. You are arguing for a standard (be it an opt-in or opt-out).

No, he/she was arguing that copyright predicates opt-in. Even without a copyright notice, one has to assume they're not allowed to copy. This is why violating the GPL is copyright infringement; without the GPL's blessing, it defaults to a no-distribute copyright policy.

I agree that a standard should exist. However, in the specific case of Google, Google is clear in describing the steps that a site administrator can take to exclude its content from Google's cache.

I'll be sure to tell that to the RIAA the next time they find my "cache" of CDs. They should have just followed my clear tagging pol

--
Eurohacker European paranoia, gun rights, and h
Re:I don't like this ruling. by 955301 · 2006-01-26 10:03 · Score: 1

bah, you logic is flawed, and the copyright laws are outdated. Once you release something into the wild you lose control. The laws need to be updated to reflect the value of the source, the value of the information and not the value of the copy.

Claiming authorship of information is the only real grievance once can have. Distribution of copy should be a separate issue. Inherently owning the rights of distribution of information you publish is hogwash. Exclusivity of distribution is artificial value that deserves to die the death it is now facing.

--
You are checking your backups, aren't you?
Re:I don't like this ruling. by Anonymous+Brave+Guy · 2006-01-26 10:09 · Score: 1

One thing I've noticed about Google Cache is it still loads the original images from the original server, so the ads are still there for those people

This is one of the ethical concerns I have for the whole Google Cache system. If the original site relies on advertising revenue, then caching it without generating the equivalent behaviour for the ads will be inherently damaging. Google's cache still hits the original site for ads, which might avoid this problem as long as the original site is up, but negates the argument made in this case that they essentially provide a socially useful service with no downside by making the content available if the original site is down. There's a pretty big downside if 90% of a site's market visits all at once because of a Slashdotting, no-one sees the ads because they're all using Google Cache, and then they don't bother to reread the same page later on the original site (why would they?) and generate the ad revenue that the number of visits should have provided.

We currently rely heavily on advertising-based financial models to support the web. You're reading a site that is funded significantly by ads, remember? Moreover, we will probably continue to do so until an effective micro-payment strategy becomes widespread. Hence adopting the position that it's too bad for web sites if their advertising models don't hold up may not be the most sensible plan here. (I have similar moral reservations about using universal ad-blockers.)

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:I don't like this ruling. by jandrese · 2006-01-26 10:10 · Score: 1

Could you post links? I've never seen that happen before and I'd like to figure out why it happened to you. The only ones I know that get blocked are Google's ads, to avoid the whole conflict of interest argument. You see how hot under the collar people get about the cache in the first place, imagine if Google put their own ads on it.

--

I read the internet for the articles.
Re:I don't like this ruling. by Anonymous Coward · 2006-01-26 10:19 · Score: 0

A bathroom in a coffee shop is a really nice feature that doesn't itself generate a single dime.
While that's true, I believe many localities have laws and regulations requiring coffee shops to have bathrooms.
Re:I don't like this ruling. by pthisis · 2006-01-26 10:20 · Score: 2, Interesting

1. That has little to do with the post I responded to which claimed that Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons.

Maybe you can dance around and try to claim that actually the use itself didn't affect the film's box office take but the reviews did (though in many cases of parodies the fair use and the criticism are intimately connected). It's certainly not for personal reasons. And, realistically, in a lot of cases (more commonly with more Wowza! reviewers) the "clip from an upcoming movie" is itself the product being sold, and people don't watch for the value-added commentary.

2. The primary way that commercial uses change court interpretations of fair use is that courts tend to put a higher burden of proof on the defendent to prove that their commercial use is fair (typically, if the user is non-commercial than they place a higher onus on the plaintiff to demonstrate that the use is unfair).

But commerciality--even an impact on the marketability of the original--does not prima facie demonstrate that a use is unfair. Indeed, in that Slashdot baby "Sony Corp. of America v. Universal City Studios, Inc" a use that clearly affects the marketability of the copyrighted product was upheld (after all, if you cannot tape a movie when it airs on the networks and time-shift your viewing, you're more likely to buy the official movie release).

The basic balancing test required to ensure that use is fair use says otherwise to what you state.
I encourage you to read the following: Definition of Fair Use [auburn.edu] (pay special attention to points 3 and 4 there). I also encourage you to find out what a "balancing test" means, legally, in case that term is new to you.

I'd recommend the same, since your first sentence here shows a startling lack of understanding of the concept. In particular, the Court has been pretty one-sided about this portion of the balancing act: if you can't demonstrate that your market was affected then it's tough to get a court to agree that a use is unfair, but market affect on its own is proof of little.

IOW, this clause is generally invoked to protect fair use, but violating it alone doesn't go far toward showing that a use is not, on balance, fair.

If it's a litmus test at all (which it really isn't, since "no commercial impact on the copyright holder" is normally a defense but not always), it's a litmus test in the opposite direction of what you're claiming.

--
rage, rage against the dying of the light
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 10:22 · Score: 1

authorship and disitribution rights are inexorably linked. otherwise, authorship has no value other than for the ego.
You might think that copyright laws are outdated, but the judge should be bound by them. remember that bit about the role of the judicial branch?
Re:I don't like this ruling. by gad_zuki! · 2006-01-26 10:29 · Score: 1

Its funny, ironically the "opt-in" crowd here will shoot you down when it comes to google. They may have a point, but at its core google is a for-profit company making off-site replicas of content. That's a little bothersome. Whats even more bothersome is that the cache ignore deleted pages. So if I take something down then its still in the cache for a long time (forever?). So to truly delete something from the web you need to make a blank page. So now we have to jump through two hoops just for google.

If MS was doing this I doubt there would be so many defenders. Unfortunately, judges and the US government really dont want to restrict the growth of such a profitable and high profile company. This is standard practice in the US. So Google gets the same preferential treatment players like MS and Sony get.
Re:I don't like this ruling. by FrostedChaos · 2006-01-26 10:39 · Score: 1

Intellectually, I don't like this ruling one bit. "Fair Use" is broadly supposed to have minimal to nil financial effects on the copyrightholder and in general the "fair user" is doing the using for personal reasons.

You completely made up this part. The law does not specify whether or not fair use will have financial impacts on the copyright holder.

Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads.

Bzzzt! Google doesn't put google ads on the cached pages.
If anything, the google cache is a large scale financial transfer from Google to the copyrightholders, because when their sites are down, their users can often use the google cache, rather than switching to using a competing site.

Those of you who do the "yesbutNOCACHEtag" dance have got it backwards to: it's not the responsibility of the copyrightholder to sing to the tune of whatever the latest fad is. Rather, it's the other way around - google should convince people that it's in their interest to put a "CACHEME!" tag.

robots.txt is not a fad, it's the de-facto standard for the internet, the same as http, TCP/IP, or DNS itself.

Rememeber folks, in terms of the cache here, we're referring to google's ability to serve content IN ITS ENTIRETY to end-users - we're not talking about those tiny snippets needed to make search engine results useful.

How about your ability to download web pages "IN THEIR ENTIRIETY" to your computer, and then view them using a "web browser"? According to your own legal philosophy, you should be barred from doing this.

So, in conclusion, I suggest that you follow your own advice-- starting with slashdot.org. Do not post again to this site without getting written approval from the "copyrightholders" (what a contrived word).
Otherwise, I will consider you just another hypocritical troll.

--
"Any connection between your reality and mine is purely coincidental." -Slashdot
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 10:41 · Score: 1

You completely made up this part. The law does not specify whether or not fair use will have financial impacts on the copyright holder.
Wrong. Do i need to post the balancing tests for Fair Use yet again?
I can't be bothered to read the rest of your post, given that I've already responded to the same false claim that you just made several times already. If you can't be asked to read the other replies before reading, then I am not going to waste my time on you.
Re:I don't like this ruling. by PapayaSF · 2006-01-26 11:02 · Score: 1

Those of you who do the "yesbutNOCACHEtag" dance have got it backwards to: it's not the responsibility of the copyrightholder to sing to the tune of whatever the latest fad is.

Ah, but it is. IANAL, but my understanding is that the copyrightholder is required to take steps to protect his work, by getting a copyright in the first place, for example. Also, trademark owners can lose trademark protection by not trying to prevent infringement. So I'd say the plaintiff's case is silly, given that a NOCACHE tag would have prevented the "infringement" in the first place.

--
Q: What does the "B." in Benoit B. Mandelbrot stand for? A: Benoit B. Mandelbrot
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 11:04 · Score: 1

but my understanding is that the copyrightholder is required to take steps to protect his work, by getting a copyright in the first place
your understanding is wrong.
Also, trademark owners can lose trademark protection by not trying to prevent infringement.
Your understanding on tradmarks is correct, but irrelevant. this story has no more to do with trademarks than it does about elephants.
Re:I don't like this ruling. by periol · 2006-01-26 11:30 · Score: 1

You seem to be missing the part about "Copyright owners are, by law, deemed to consent to fair use of their works by others". You want the government to give you copyright? Then you have to abide by what the government says qualifies as fair use of your product or idea or whatever. If you disagree, you can go to court and try to change the law. Or you can give up your copyright.

Should we get into a discussion of what fair use *should* be? Personally, I think copyright restrictions are too tight, and that fair use isn't as broad as it should be. That's my opinion of the law, but not the government's. I'm hoping in the future the government changes it's position. At least this case helps to clarify a bit the previously murky aspect of how to apply fair use on the internet.
Re:I don't like this ruling. by greenrd · 2006-01-26 11:31 · Score: 1

I'll recognise that as non-free, just as soon as corporations recognise the harm they do to communities as theft from those communities that should be stopped or compensated for. I'm not holding my breath.

--
Female Prison Rape in NY
Re:I don't like this ruling. by vonsneerderhooten · 2006-01-26 11:39 · Score: 1

Could you please elaborate?
Just saying someone is wrong doesn't help anyone, and it makes you look like a prick.
Please tell us how you think he's wrong.

If it's not the responsibility of the copyright holder to protect his/her content, then whose responsibility is it?
Re:I don't like this ruling. by periol · 2006-01-26 11:42 · Score: 1

Despite what has been said, your (potential) revenue loss is only one factor out of many in deciding whether a use of copyrighted material can be deemed fair use. I will happily come up with many examples if you would like.

One quick example can be found here.
Re:I don't like this ruling. by periol · 2006-01-26 11:48 · Score: 1

just to add two more cents, this also explains why google's cache is still considered fair use even if someone takes the content down - they already gave it away for free. they can always ask google to remove it from the cache, if they accidentally posted copyrighted material that they didn't want cached or available.
Re:I don't like this ruling. by mumblestheclown · 2006-01-26 11:58 · Score: 1

http://www.copyright.gov/faq.html
Re:I don't like this ruling. by typical · 2006-01-26 12:07 · Score: 1

Google's cache is basically a large-scale financial transfer from the copyrightholders (who serve to benefit from the ads they serve and other interaction they get from end-users visitng their site) to google, who benefits directly by keeping people longer on google's site and thus, basically, shucks them more ads.

Uh, no.

I use regular links (which don't have highlighting and the Google thingie that the top) unless the regular link isn't available. So *if* the ads are available, I'm hitting them anyway. Well, actually I'm not, because my browser filters them out, but whatever.

Plus, anyone can set up a cache. There's nothing special about Google versus, say, you, except that Google is helping me out by providing me with archived, otherwise unavailable data and you aren't.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.
Re:I don't like this ruling. by Goldsmith · 2006-01-26 12:10 · Score: 1

Nope, not going to defend him. We just disagree on which law should be applied.
Re:I don't like this ruling. by 955301 · 2006-01-26 12:15 · Score: 1

It's not the judicial branch at fault here, obviously. And authorship and distribution are not that tightly coupled. The only association they have is fear. I'm afraid that if you distribute my ideas, you'll claim them as your own. And that's very likely right now because that claim, for better or for worse equates to profit.

My hypothesis is that once you come up with an idea, you release it as fast as possible and as widely as possible such that it's inarguable that you are the author. You can profit from distributing your idea, but there will probably be better distributors. The real value instead comes from the frequency of your ideas, your aggressive push of them and the level at which they are taken up by others. Then your time and ability to train others becomes more valuable.

How you make a profit on this is beyond me... Perhaps the manufacturers begin paying the successful originating companies/persons for more ideas/drugs/research/music, etc. Then the distributors buy from manufacturers, etc.

I guess copyright stems from trying to have all three in the same company - when it starts to break down, i.e., others get ahold of your "Intellectual Property", you try to fight the breakdown with laws, non-disclosures, etc.

In the music industry, this would be a songwriter spreading their work to singers, lyric sites, etc. Singers & bands make a cut of the song and the best rendition get's more attention from distributors and the audience.

The only way all of this would work though is if the lyrics and cuts spread fast.

So, quick! Someone make a site that allows songwriters to post original songs while only making claims to authorship, musicians to upload their take on the song, and charge a fee to cover distribution and keep the originators supplying material.

--
You are checking your backups, aren't you?
Re:I don't like this ruling. by arkhan_jg · 2006-01-26 12:28 · Score: 1

In other news, explain to me again why if this is fair use, why i can't make photocopies of every book in the bookstore (for example, without photos) and offer them for free reading in my coffeeshop? remember, we're not talking about one or two books for the interested reader - we're talking about a basically free borders or barnes and noble. you'd be free to read the entire book at your leisure - for books consisting mostly of text, you'd be basically getting the whole thing free. and, if you buy some coffee, that's great too. look - i know real world analogies to the internet don't often make sense, but in this case i think my giant coffee shop isnt too far off.

Because the books are available in the bookshop for sale, not for reading. How about if the book author donates a copy to a library - he doesn't give up his copyright, but he's specifically placing his book in a place where he expects people to read it for free. Would he have the right to be annoyed if the library used a quote from the book for a poster? Or if people indeed did take the book home and read it?

If you publish an open and non-login-required website, you are defacto giving people a right to read that content. You can put up banner ads, and use a particular font, but the very nature of the web means that people are not obligated to follow your font or colour choices, or to read or click your ads as part of viewing your page. If you wish people to view website content in a particular way, then force them to do so by having to pay to login to some java controlled setup, or have a advert you must view to see the content (ala salon)

I see no difference between choosing which browser I get to view your public webpage in, and its settings, or using google's cache to view it.

Seriously - copyright is supposed to be a way to advance the public good by giving authors first dibs on a sale, not absolute control of how someone uses the content once they legally got their hands on it. That's what fair use IS - ways to use legally obtained copyrighted material that the author doesn't have to give permission for. Caching content that you've specifically made available for free and public viewing (tho not put into the public domain, I realise websites are still copyrighted) strikes me as a fair thing to do. If you don't like the way the web works, and the law works, then either don't publish it publically for free viewing in the first place, or at least take the minor step of telling search engines to stay away. Nobody forced you to put your content on a website. It's no different than if you put your artpiece up in a public park, you shouldn't be surprised if people take photos of it.

But, forget that - the real issue is control. You are basically saying that GOOGLE has the right to display YOUR content as GOOGLE sees fit.

Bingo, that's exactly right. As a theoretically visitor to your website I have EXACTLY that right - no law says I have to keep to your font choices, or use your colours. My browser caches your site for my convenience. You agreed to that when you published your webpage in the medium where that's how the internet works. Google gets to do the same on my behalf, the user. I have no problem with this for my published websites, before you ask - I wouldn't expect the rules to change for me as a publisher, because 'I' didn't bother to find out how things worked before I published my site.

Copyright is not property. It's not control. It's a limited set of rights, under a limited set of circumstances, to advance the public good.

--
Remember kids, it's all fun and games until someone commits wholesale galactic genocide.
Re:I don't like this ruling. by trawg · 2006-01-26 12:51 · Score: 2, Insightful

Copyright holder makes a web page available *FOR FREE* to the general public. Google caches it. Please explain how Google's cache financially hurts the copyright holder. Providing something *FOR FREE* that is available *FOR FREE* would seem to have a "nil financial effect on the copyright holders", no?

This is a really interesting topic, for me at least. I mirror a lot of files for Australian users on various websites over here. One thing I've found is that the vast majority of sites (at least the big ones) have a standard Terms and Conditions page, that generally say something along the lines of "you are not allowed to reproduce any of the content on this page".

I always respect this and don't mirror anything from any sites that have this in their T&Cs (without asking for permission first). Unfortunately this means that any Australian that wants to download something has to go through the international site and can't use a local mirror.

Now, my reckoning is that it definitely doesn't cost these companies anything if I mirror their files and make them available. If anything, it is a) saving them money on bandwidth costs and b) increasing exposure of their products and services, as we always link back to the parent site.

However, I assume that from their perspective, they don't want people mirroring distributing awesome_application.exe because a malicious website owner could put up a trojaned, virused or otherwise bad version of their file. Users then cry and go to them for support and they have to try and clean up the mess. We approached Microsoft to mirror their files for our users but were denied; while they never said this was the reason I think it is a safe assumption.

There are ways to limit these problems, like md5 checking, PGP signing, but I assume these are way out of the reach of regular users who just want to Click Download And Run.

Anyway, a little off-topic for your post, but I thought it might be relevant as one of the downsides to allowing people wholesale access to mirror/reproduce/redistribute/copy your works.
Re:I don't like this ruling. by DerekLyons · 2006-01-26 17:17 · Score: 1

The moment one decides to put something on the Internet, he loses a large chunk of control over that content. Caching is an inherent, and necessary, component of Internet technology.
Not quite. Caching, in it's original form as a means of short term storage, is an inherent and necessary component of Internet technology. Long term (essentially permanent) storage like Google and the Wayback machine provide isn't caching (temporarily storing a local copy), it's copying - something else entirely.
Re:I don't like this ruling. by squoozer · 2006-01-26 21:49 · Score: 1

My site has Google ads but I don't see that that as a conflict of interest as they are part of the page just like ads from, say, double click would be. Some of the Google ads do display so Google isn't going out its way to block Google ads - I just don't think the cached page is linked enough / viewed enough to get a full complement of ads.

Anyway, cached version and the real version.

The cached version of the content I'm seeing has two ads above the main content and a search for ads box above the menu. The main site has a fair few more as you can see.

--
I used to have a better sig but it broke.
Re:I don't like this ruling. by squoozer · 2006-01-26 21:58 · Score: 1

I don't know what it's like in the US but over on this side of the pond teaching establishments (schools and colleges mainly) are treated differently to individuals and businesses. While a school could get away with photocopying a chapter of a book and distributing it a business probably couldn't. Even saying that the rules have been more strongly enforced against schools in recent years and they struggle to get away with printing chapters off now.

What I don't understand is why people feel this is right at all. Is it simply because Google puts a frame across the top that says "this is a cached page". What if that frame was not included? It would be the same content that was provided but nearly indistinguishable from the original site (in some cases). Is it still right then? If it is then you are basically saying that it's ok to republish any work as long as you provide a note saying it's republished.

--
I used to have a better sig but it broke.
Re:I don't like this ruling. by logic-gate · 2006-01-26 22:06 · Score: 1

Please explain how Google's cache financially hurts the copyright holder.
By providing the information the web surfer is looking for without visiting the website, it prevents the website owner from earning money from advertising or product sales.
Re:I don't like this ruling. by tinkerghost · 2006-01-27 02:31 · Score: 1

Providing something *FOR FREE* that is available *FOR FREE* would seem to have a "nil financial effect on the copyright holders", no? Um actually I can certainly see a financial gain for the origional CW holder .... bandwidth can be very expencive --- if you /. my server and/or I run out of my BW for the week/month, a cached copy keeps my page - complete with my adds - available until I have more BW available.
Re:I don't like this ruling. by FrostedChaos · 2006-02-03 15:01 · Score: 1

Wrong. Do i need to post the balancing tests for Fair Use yet again?

Ok, you are right here. The law was written with the intent that it would have minimal impact on the industry's profits. Sorry.
Still, my point was that, in a larger sense, the welfare of the public trumps the benefit that one small industry may gain from favorable regulations.

I can't be bothered to read the rest of your post, given that I've already responded to the same false claim that you just made several times already. If you can't be asked to read the other replies before reading, then I am not going to waste my time on you.

I don't see your written permission to post!

--
"Any connection between your reality and mine is purely coincidental." -Slashdot

It was obviously a cheap ploy to make come cash by Archbob · 2006-01-26 08:32 · Score: 0

Anyone an see this was one of those idiotic lawsuits in a cheap attempts to make some cash.

Comment removed by account_deleted · 2006-01-26 08:34 · Score: 5, Insightful

Comment removed based on user account deletion

Whooooosh! by DrEldarion · 2006-01-26 08:35 · Score: 3, Funny

That overwhelmingly loud noise everyone just heard was the sound of every Slashdot reader gasping at the fact that the DMCA just got used for something positive.

Re:Whooooosh! by TubeSteak · 2006-01-26 10:16 · Score: 1

Actually, that was Rep. Howard Coble*'s head exploding

*(R-North Carolina) Chairman of the House Judiciary subcommittee on Intellectual Property & main sponsor of the DMCA

--
[Fuck Beta]
o0t!
Re:Whooooosh! by ScrewMaster · 2006-01-26 11:35 · Score: 1

Imploding, actually ... that whooshing noise was ambient air filling up the internal vacuum.

--
The higher the technology, the sharper that two-edged sword.
Re:Whooooosh! by Frank+T.+Lofaro+Jr. · 2006-01-26 11:52 · Score: 1

Even the Communications Decency Act is used for something positive. The ISP exemption survived even though the prohibitions were struck down.

Ironically, what remains of the law is pro-freedom.

As for this court decision, WAY TO GO!

The Nevada District Court (the one in Las Vegas, btw - way to go for my city) made the right decision, and justified it exceedingly well. So many different legal questions under many different legal theories were addressed and multiple factors considered and the decision was as in our favor as much as it could be. This is amazingly good news, especially in a time when many technology based legal decisions go badly.

I am shocked at all the people attacking the ruling. Is Slashdot more anti-court than pro-freedom??
I hope this case sets a strong precendent.

--
Just because it CAN be done, doesn't mean it should!
Re:Whooooosh! by typical · 2006-01-26 12:04 · Score: 1

Uh, no. The clause that was relevant was an *exemption*. I.e. the DMCA's regular restrictions that would have made Google liable *don't* apply here.

All this means is that someone didn't manage to abuse the DMCA in a particular instance. While this is nice, it's hardly amazing.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.

"fair use"? What's dat?! by Thud457 · 2006-01-26 08:35 · Score: 5, Funny

I thought we sold "fair use" a while ago for three magic beans and some DVDs.

--

the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

Re:"fair use"? What's dat?! by Anonymous Coward · 2006-01-26 09:21 · Score: 2, Funny

> I thought we sold "fair use" a while ago for three magic beans and some DVDs.

The problem with these 'Jack and the Beanstalk' allusions is that, in the story, the beans actually worked!
Re:"fair use"? What's dat?! by Poltras · 2006-01-26 10:17 · Score: 1

So didn't they in the Brothers Grimm. When fiction meets reality...

--
Of Code And Men

NOARCHIVE by DarkClown · 2006-01-26 08:38 · Score: 2, Interesting

wonder if the guy bothered with a robots.txt or used the meta NOARCHIVE - not that actually preventing that was his intent.
i don't mind the google cache at all, what drives me up a wall is what jeeves and other engines do with external pages by sticking them in a frame. so, if you put code in the page to force it out of frames, then engines like yahoo penalize (or drop from the index entirely) for messing with the user navigation.....

Re:NOARCHIVE by josepha48 · 2006-01-26 08:48 · Score: 1

I was going to say the same thing, then I saw your post. I contacted google and asked them not to cache my pages years ago and they said, to put add a meta tag with that in it and my pages would not be cached. It works. What are these guys so into sue someone today that they can't code their pages right?

--
Only 'flamers' flame!
Does slashdot hate my posts?
Re:NOARCHIVE by hogfat · 2006-01-26 09:12 · Score: 1

It seems he knew exactly how to prevent Google's caching of his pages, yet decided not to disallow it. In fact, the ruling seems to indicate that he submitted his site to Google and created his copyrighted works in order to bring about this lawsuit.
Re:NOARCHIVE by Anonymous Coward · 2006-01-26 10:53 · Score: 0

I guess it all depends on if you believe in opt-out or opt-in.

I can tell you that most telephone customers believe in opt-in, whereas most telemarketers believe in opt-out. The reason is that telemarketers loose money in the opt-in scenario.

The same situation applies to Google and the rest of the search industry. For them to have a product, they need to have a big index. However, if for some reason, some web content developer wants to be the sole source of their content they should be able to do that without having to jump through a dozen different hoops. Sure Google wants to make the cost fall on the web devloper rather than on them. But I would argue that Google needs to have a morally strong position that doesn't externalize the costs onto millions of web developers.
Re:NOARCHIVE by Anonymous Coward · 2006-01-26 13:59 · Score: 0

NOARCHIVE, nor the ROBOTS.TXT files work. I know, I've tried it to keep Google out of the areas I did not want it to go to, it still spiders/caches it - one YEAR later.

The oddest thing I EVER saw with Google spidering my site, was when it spidered an url INSIDE one of my password protected pages. It cached the redirected page (I.e. the login screen), but it STILL creeps me out that google managed to find the URL. Maybe it was the Google toolbar my companion was using?

Who Knows?
Re:NOARCHIVE by josepha48 · 2006-01-27 10:47 · Score: 1

Its a web standard to use meta tags and the noarchive is a search engine standard as is robots.txt.

--
Only 'flamers' flame!
Does slashdot hate my posts?

salami? by commodoresloat · 2006-01-26 08:39 · Score: 1

You misspelled "large trout."

The "Implied License" is the most interesting by cfulmer · 2006-01-26 09:05 · Score: 4, Insightful

Forget the fair use analysis, the most important thing here is the success of the "Implied License" claim. Basically, it goes like this: You operate a website. The web was created specifically with the idea that "robots" would crawl across it, and there is a standard well-known way to prevent them from crawling your site. Even more specifically, there's another standard well-known way to keep search engines from cacheing your content. Being on the web but not using these techniques means that you give search engines permission to cache your content.

It's sort of like what happens when you leave a potful of candy at your front-door on Oct. 31st. In theory, you could claim that all those kids who come to your door and help themselves are stealing. But, because everybody knows how Halloween works, you've implicitly given permission for them to do it.

In this opinion, the Fair Use analysis was basically just used as a stopgap of "what little infringement that's left after you account for the implied license is a fair use." If the website had included a robots.txt file, the fair use case would have been much harder to make.

The Implied License is a stake in the ground for "This is the Internet. The rules are different here." IMO, that's a good thing -- there are a bunch of things that just couldn't happen if you had to get explicit permission from every content owner.

Re:The "Implied License" is the most interesting by Obfuscant · 2006-01-26 11:50 · Score: 2, Insightful

The web was created specifically with the idea that "robots" would crawl across it, ...
Uhh, no. The web was created specifically with the idea that humans would crawl across it. It wasn't until the web grew beyond easy comprehension by humans that robots were created to crawl it.
Were your statement correct, the robots.txt exclusion protocol would have been part of the CERN webserver documentation from day one. It wasn't. My web pages were up for a very long time before there were robots wandering the web.
It's sort of like what happens when you leave a potful of candy at your front-door on Oct. 31st.
Yes, I know what happens. Been there, done that. The first person to visit saw an entire pot full of candy and took it all. After I refilled the pot, thinking that the selfishness of the first visitor was an abberation, the second visitor took the entire pot full of candy and the pot. Ethics are what you do when you think people aren't looking.
Re:The "Implied License" is the most interesting by Anonymous Coward · 2006-01-26 14:08 · Score: 0

I suppose the same argument can be said about houses. If you leave your door unlocked, you are INVITING people in to your home.

I believe in an opt-in Internet. by rjonesx2 · 2006-01-26 09:05 · Score: 3, Insightful

The Google cache is absolutely ridiculous. As an individual who has had quite a bit of experience on both sides of the white hat / black hat search engine industry, the cache is NOT a webmaster's friend.

1. The cache removes content control away from the author. For example, a site like EzineArticles.com prevents scraping by using an IP blocking method based on the speed at which pages are spidered by that IP. It is absurdly easy to circumvent this by simply spidering the Google cache of that article instead of spidering the site. Google's IP blocking is far less restrictive, and combined with the powerful search tool, it allows for easy, anonymous contextual scraping of sites whose Terms of Service explicitly refuse it.

2. The cache extends access to removed content, often for months if not years at a time. Google rarely replaces 404 pages (perhaps it is because of their wish to have the largest number of indexed pages). I have clients who have nearly 48,000 non existent pages still cached in google that have not been present in over 14 months. Despite using 404s, 301s, etc. these pages have not yet been removed. Furthermore, Google's often mishandling of robots.txt, nocache, and nofollow leaves webmasters dependent upon search traffic hesitant to force removal of these pages using the supposedly standardized methods of removal.

3. The cache allows Google to serve site content anonymously. Don't want the owner of a site to know you are looking at their goods (think of companies grepping for competitor IPs), just watch the cache instead.

The list goes on and on. But I think the point is this...

Why should a web author have to be technologically savvy to keep his or her content from being reproduced by a multi-billion dollar US company? Content control used to be as simple as "you write it, its yours". It got a little more complicated with time to the point at which it might be useful to use, perhaps, a Terms of Service. Even a novice could write "No duplication allowed without expressed consent". Now, a web author must know how to manipulate HTML meta tags and/or a robots.txt file.

Fair use is for users, for people, not multi-billion dollar companies.

--
Fight Link Spam with LinkSleeve.org

Re:I believe in an opt-in Internet. by gowness · 2006-01-26 09:14 · Score: 1

im new and havent been keeping up with this whole lawsuit, but exactly how is google actualy using the cache other than just keeping a record of sites. ok yes its the creators site, but i still dont see how google is useing it.
Re:I believe in an opt-in Internet. by slavemowgli · 2006-01-26 09:25 · Score: 1

I think the commentor right above you has hit the nail on the head, though: there is a well-known, standard way to prevent these things from happening (robots.txt and meta tags), so if you choose not to use those tools, you're granting people an implicit license index your stuff. As he said - if you leave a bowl of candy on the front lawn on the 31st of October, are you going to sue kids that help themselves to it? Probably not, and even if you did, you likely wouldn't win. Everybody knows that these things happen at Halloween, so you wouldn't have a case.

Another example (this time one of my own): if you go to a doctor who says "I'm going to give you a flu shot now", and if he then proceeds to do just that, can you sue him because you did not explicitely said "I allow you to do this"? I don't think so - you *knew* what would happen if you didn't say anything or otherwise indicate that you didn't agree to getting a flu shot. IIRC, the legal term is "concludent behaviour" (although it may be a different one in English).

Same thing applies here: if you put a web page on the web, you have to expect people to use it, and the fact that you didn't do anything to prevent that even though you easily could've means that you don't have a case later on when something someone does doesn't sit well with you. Deep links, indexing for search engines (not counting caching) and caching all fall under this.

--
quidquid latine dictum sit altum videtur.
Re:I believe in an opt-in Internet. by ClearlyPennsylvania · 2006-01-26 09:26 · Score: 1

Fair use is for users, for people, not multi-billion dollar companies.
Actually, "fair use" is frequently used for quotes in books - Book A quotes Book B, and is allowed to because of fair use. So, fair use is applied to large, wealthy companies - all the time, actually. In fact, I think this is exactly what fair use is for.
Re:I believe in an opt-in Internet. by babyrat · 2006-01-26 09:37 · Score: 1

Why shouldn't fair use apply to corporations? What's the cutoff point? Can someone quote someone else in a book? That book is making a multi-billion dollar corporation money - but that would be considered fair use.

Or consider that the document isn't really reproduced until someone (an individual) requests the page, then it is being reproduced for that individual. Wow - that's too philosophical.

Fair use applies just as much to old fashioned printed documents as it does to web documents. If you had a line in your Terms of Service that "No duplication allowed without expressed consent", your document could still be reproduced under the fair use laws of copyright law (by an individual or a corporation). You don't want someone doing this, then don't commit it to hardcopy and give it to them. You don't want something reproduced electronically (under the fair use laws) then don't put it on the web for everyone to see.

Why should a web author have to be technologically savvy to keep his or her content from being reproduced by a multi-billion dollar US company?

Because he's publishing it on a technological platform? Why should I have to learn to write in order to create a book? Well, I don't - I could hire someone to transcribe my words. A web author could hire someone to publish web pages too.
Re:I believe in an opt-in Internet. by Anonymous Coward · 2006-01-26 09:51 · Score: 0

Your post translated:

"I hate the google cache because it proves that I'm deceiving users into clicking my links because I'm a scumbag who employs cloaking techniques to artificially manipulate search results. Wah."
Re:I believe in an opt-in Internet. by rjonesx2 · 2006-01-26 09:53 · Score: 1

Fair use has never, to this point, included duplication and reproduction of a complete work for financial gain. That is what I am getting at. We are not talking about snippets here, we are talking about the complete duplication of a copyrighted work being reprinted without author permission. The burden, in past, has always been on the copyright licensor, not the copyright owner. For example, an author did not have to know the law to write a book, publish it, and prevent its contents from being fully redistributed for profit without his or her knowledge. Now, authors must be familiar with HTML (and have access to it, mind you, as many users publish to free blogs that do not grant this access) to prevent republication by Google. Google is stealing traffic from authors by republishing their materials without direct consent.

--
Fight Link Spam with LinkSleeve.org
Re:I believe in an opt-in Internet. by tbird81 · 2006-01-26 10:00 · Score: 1

OPT-IN INTERNET!!!
It's already opt-in. If you don't want people using your website, don't put it on the net. Shut down your server.
Personally, I think Google should ignore, (or at least penalise with lower rankings) sites that use noarchive.
There's nothing worse than finding an article in Google, clicking on the link, and the page not longer part of the website. Being able to read the cached version means the result is not a total waste of time.
I realise the internet is used to make money, and that controlling information is a way that websites think that they'll make money, but search engines are used to find information, not to try and coax subscribers to shithole websites trying to make a buck.
Lowering the pagerank of sites with the 'noarchive' tag will improve the search engine's ability to get useful information for the user.
Re:I believe in an opt-in Internet. by vadim_t · 2006-01-26 10:08 · Score: 1

Huh, and "fixing" those "problems" is good for me how?

The google cache is very nice precisely because of the things you don't like. I want to screen scrape, see old stuff removed for no good reason, and visit sites anonymously. But that last part could be done without google cache anyway.

As an user, I don't give a damn about your interest in keeping control, sorry.
Re:I believe in an opt-in Internet. by Anonymous Coward · 2006-01-26 10:41 · Score: 0

Now, a web author must know how to manipulate HTML meta tags and/or a robots.txt file.

And that's so horrible, because it's so hysterically difficult to learn and/or copy-paste a single HTML tag and insert (i.e. copy-paste) a few lines of text into a text file.

But seriously, stop all the bullshitting. Maybe 1-5% of webmasters care about the fact that Google caches their pages, and if they're complaining, they know how to fix it, so hey. Besides, you're putting your it on the frickin' internet. I understand if it's a book, and someone's going around lending free copies of xeroxed pages without the author's knowledge or permissions. But on the web, it's more obvious than a Rudolph's nose that everyone and everything is going to see and be able to use your content in any way they please. Not to mention, Google's Cache gets used by so many people for so many purposes (at least among my friends), that even if the number of webmasters who care is 50% or more, it just doesn't matter, because a lot more people care about being able to access the cached content than being unable to.
Re:I believe in an opt-in Internet. by kindbud · 2006-01-26 11:03 · Score: 1

1. The cache removes content control away from the author.

So does any browser. It's what HTML and HTTP were designed to do.

2. The cache extends access to removed content, often for months if not years at a time.

So does my personal copy of your site, thanks again to HTTP.

3. The cache allows Google to serve site content anonymously.

So does any proxy.

If you want absolute control over your work, don't publish it.

--
Edith Keeler Must Die
Re:I believe in an opt-in Internet. by imthesponge · 2006-01-26 11:08 · Score: 1

Same thing applies here: if you put a web page on the web, you have to expect people to use it, and the fact that you didn't do anything to prevent that even though you easily could've means that you don't have a case later on when something someone does doesn't sit well with you. Deep links, indexing for search engines (not counting caching) and caching all fall under this.

Are you saying that caching falls under fair use, or are you saying that when you publish something online you implicitly grant users rights in addition to those they have under copyright law? I would assume you mean the former, but you mention an "implicit license" that suggests the latter.
Re:I believe in an opt-in Internet. by imthesponge · 2006-01-26 11:13 · Score: 1

I just read TFA; now I see that it does mention an implicit license. IANAL, but I don't think that applies to all pages, since the page in question explicitly allowed indexing.
Re:I believe in an opt-in Internet. by typical · 2006-01-26 11:58 · Score: 1

1. The cache removes content control away from the author. For example, a site like EzineArticles.com prevents scraping by using an IP blocking method based on the speed at which pages are spidered by that IP. It is absurdly easy to circumvent this by simply spidering the Google cache of that article instead of spidering the site. Google's IP blocking is far less restrictive, and combined with the powerful search tool, it allows for easy, anonymous contextual scraping of sites whose Terms of Service explicitly refuse it.

Yup. EzineArticles.com is abusing HTTP. It's not supposed to do that (and I could think of many valid systems, like prefetching, that could potentially trigger this). There are some correct systems that will break with their scheme. That's their fault.

I don't care *what* someone's ToS says. If you need a ToS, then you can't allow anonymous access. Simple as that. The benefit to the Web of caching being allowed vastly outweighs the benefit to the Web of everyone having their own halfassed ToS schemes. Anything else vastly reduces the ability of a computer to deal with the Web in an automated fashion -- computers can't read legalese.

2. The cache extends access to removed content, often for months if not years at a time. Google rarely replaces 404 pages (perhaps it is because of their wish to have the largest number of indexed pages). I have clients who have nearly 48,000 non existent pages still cached in google that have not been present in over 14 months. Despite using 404s, 301s, etc. these pages have not yet been removed. Furthermore, Google's often mishandling of robots.txt, nocache, and nofollow leaves webmasters dependent upon search traffic hesitant to force removal of these pages using the supposedly standardized methods of removal.

I've yet to hear of problems with Google's handling of these, and Google *does* provide a link to the current page. Why should Google take down 404 pages? I *hate* getting 404s. It drives me up a wall. But sometimes, when I get one, I can go back to the Wayback Engine or Google Cache and actually see the page.

3. The cache allows Google to serve site content anonymously. Don't want the owner of a site to know you are looking at their goods (think of companies grepping for competitor IPs), just watch the cache instead.

A) I don't sympathize with people who expect to be able to glean information from monitoring everyone reading content.
B) There are a thousand ways to do this that don't involve Google. This argument is already lost.

Now, a web author must know how to manipulate HTML meta tags and/or a robots.txt file.

Such is the price of progress. If he doesn't like that, he can have a tool that does it.

Fair use is for users, for people, not multi-billion dollar companies.

Right. Me. The end user. The person browsing.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.
Re:I believe in an opt-in Internet. by periol · 2006-01-26 12:01 · Score: 1

Fair use has never, to this point, included duplication and reproduction of a complete work for financial gain.

First of all, it's not exactly *duplication* that we're talking about here. And we're also not talking about actual financial gain, are we? Using those words is somewhat deceptive, because your are pushing the argument in certain directions that make this issue more black and white than it is. What do I mean?

-any financial gain that Google receives here is implicit and indirect, at best. if it was explicit or direct, there'd obviously be more of a problem

-and unless things changed from twenty minutes ago, the cache does not show an exact duplication of content. for one, it pulls images from the site, not the cache. so we're talking about text here, which may or may not show up in the same way it was designed.

*** The way I see it, there is a great murkiness around the concept of fair use. What I think of as fair use is not the same as what someone else might think, because the concept is better defined in some areas than in others, and let's face it, analogies tend to break down. Personally, I think Google's cache is an excellent example of something I think *should* be fair use. I think that copyright should be loosened, shortened, and be redefined for the internet.

But that's just me.
Re:I believe in an opt-in Internet. by reve · 2006-01-26 13:08 · Score: 2, Insightful

Okay, let's say someone is scraping your site. If they are scraping your site and redistributing the information, you have a very, very clear violation of copyright and I urge you to contact your lawyer.

If it's just some guy scraping your site for his own edification, it's probably fair-use -- regardless of your crazy terms of service no one read.

Ultimately, an LWP::Simple based "scraper" is just a specialized browser -- unless they're redistributing your work.

Someone using Google to violate your rights is not Google's fault. It's an opportunity to ask for punitive damages.

--
-- r . m o s q u i t o --
Re:I believe in an opt-in Internet. by smoker2 · 2006-01-26 18:44 · Score: 1

Yeah, the user has to be really technically savvy to use robots.txt
User-agent: * Disallow: /

How about people who need to keep stuff private, just don't make it web accessible ? Oh that's right, they want to be published but just want to take all their toys home when they've had enough.
The web is public. There are established ways to prevent casual use by anonymous visitors. If you can't be bothered to use them, simple as they are, then I'm afraid it's tough luck. I guess you're in favour of a 2 tier internet too.
Oh, and copyright has never been as simple as "you write it, its yours". You have to catch abusers and legally defend your content before copyright has any meaning. Not so simple is it. Especially as Google has legally been found to be non-infringing in this case.
Re:I believe in an opt-in Internet. by Anonymous Coward · 2006-01-26 23:30 · Score: 0

By putting material on port 80 you are explicitly allowing anyone and everyone to cache your material. Sorry, that's just one of the reasons http works well. If you don't like it, you may with to consider an end to end application layer protocol, such as https.
Re:I believe in an opt-in Internet. by cjb110 · 2006-01-27 05:47 · Score: 1

If you publish something on the World Wide Web then it is open to everyone in the world, without restriction. If you do not want that then don't publish it, or publish it to a secure location. Obviously if they try to republish it under their name then go get your lawyers.

If google is keeping links to removed content then that should tell you that people are linking to that content as they find it useful, or to stop f'ing around with your site organization.

It's not google's job to make sure that things on a public site are public, its your job to make sure content that isn't public isn't available.

--
----- I refuse to have an argument with an unarmed person

Bad Ruling by geekee · 2006-01-26 09:08 · Score: 1

"Finally, a frivolous lawsuit that got its just desserts. We can only hope that will herald a new age, where the insanely stupid lawsuits are going to fianlly the death they so rightly deserve."

They could have said he was right, but then not awarded any monetary damages. This sets a bad precedent. Copyright used to be automatic. Now, if I don't put the right tag in my html, I forfeit copyright to search engines.

--
Vote for Pedro

Re:Bad Ruling by LaCosaNostradamus · 2006-01-26 09:41 · Score: 1

Now, if I don't put the right tag in my html, I forfeit copyright to search engines.

How so? The search engine just invokes delay in the exposure of the data you already exposed publicly. It's an information-distribution detail for something you've already distributed without qualification. What's been forfeited? You can't control the other delays or other modifications involved in distribution, either. For example, people record all kinds of broadcast media for review at a later time that's convenient to them. User-controlled delays are well established ... they ARE THE PRECEDENT.

I think that it's reasonable to assume that anyone using the web for a public page knows that said page can be reached from a search engine like Google or Yahoo. After all, these engines are essential for making the web useful. Decrying a natural extension (like caching) of their utility is foolish on many levels. The court made the right choice.

--
[You have a stable society when some nut guns down a schoolyard and the law doesn't change.]

May be a hint of how the book scanning will go. by Godeke · 2006-01-26 09:11 · Score: 2, Informative

After reading the actual opinion that granted summary judgement, if this same logic is applied to the scanning and offering of search on "real world" materials, Google may be able to withstand lawsuits on the book scanning effort quite well. There are some differences that could create a different outcome, but this outcome was 100% favorable to Google and the idea of indexing and caching of materials to allow such search and reference was solidly defended by the judge.

--
Sig under construction since 1998.

Re:May be a hint of how the book scanning will go. by Shakes268 · 2006-01-26 09:19 · Score: 0

I think it may be a little bit different. If I put up a copy of an article I've written for viewing on my web site I have essentially published it for the masses freely. Sure, If someone tries to steal it I should, under law have copyright protection keeping someone from publishing it later and making money off of it.

However, if I have a book that is written and sold with the contents not available online I see it being a little bit different. Google has to violate my copyright to scan or copy the material of the book into electronic form then they are hosting the copyrighted material on their website without my consent. Anyone remember the sign above the photocopier in the university library with warnings about copyright violations??

If I had made the contents of the book available on my web site as free content and they cached the page, I don't think there would be anything anyone could do.
Re:May be a hint of how the book scanning will go. by Godeke · 2006-01-26 09:31 · Score: 1

I agree to a point, but the ruling made it clear that the "additional social value" of access was a consideration for the fair use acceptance. Google is using the same argument for the book search tool, and since it was accepted here, perhaps it will have a leg to stand on.

The realities of the case that was just ruled in Google's favor would seem on a literal reading of copyright law to be in the copyright owners favor. The judge quite clearly found that fair use applied in a context I was suprised by.

He also found in the section discussing license that because Google has several opt out mechansims (robots.txt, meta tags and direct contact) that the client failed to avail himself to, they had an implied license. Google has been very vocal in the book effort that any work would be excluded simply by asking for it to be excluded. I thought that was weak, but the judge her found passive licensing to be in force in this case, which may mean that anyone who doesn't actually request removal in the book effort may be seen as passively approving the indexing.

My point wasn't that it made it a slam dunk, but that this case went so favorably that it made it more likely that the book scanning cases might as well, something that surprised me.

--
Sig under construction since 1998.
Re:May be a hint of how the book scanning will go. by Shakes268 · 2006-01-26 09:45 · Score: 0

I think the content in question here, was passively licensed by placing it in a publicly accessible format as the internet. Without the use of any robots.txt or meta tags to prevent engines from traversing the content and by not protecting the content via any type of password protection,etc he opened the content up freely.

I think with the books, google employees (or temps) have to physically get a book, scan it into the system, catalog it and make it available. By making it available to someone without the consent (in my opinion they should have an opt-in policy, not an opt-out policy for exclusion) of the owner of the IP contained within the book they have committed copyright violations on the same grounds that the RIAA seem to be suing many over. Making copyrighted material available online or in any format without consent or licensing is illegal. It's literature piracy.

Yes, this is just my opinion but I have a few books published and know how I feel on the subject.
Re:May be a hint of how the book scanning will go. by Godeke · 2006-01-26 10:06 · Score: 1

Have you opted out? Personally, I find the idea that people might be directed to my content (on or off line) by search to be thrilling. Google is working very hard to take those people who find my content and convert it into sales. If it turns out they do something evil with the search tools, perhaps I will change my mind, but free (cost to me) exposure seems a benefit not a negative.

Of course, there are those who feel that libraries should be burned down and the only way knowledge should be transferred is via direct payment to the holder of that knowledge. Several of the textbooks in the University here have come in the new "limited duration license" CDs where they become coasters six months after purchase (at the end of the semester). That way, it kills the used book market without the expense of needing a new edition. I find that pathetic in a money-grubbing sort of way and my wife refuses to use such "textbooks". Sadly, I suspect that in fifty years, that will be the norm and the fact that I have my old textbooks on the shelf will be look upon as a quaint oddity.

--
Sig under construction since 1998.

Slashdot Cache Ruled Fair Use!! by LesPaul75 · 2006-01-26 09:18 · Score: 3, Interesting

Ok, no more excuses Slashdot... It's time to start caching pages and preventing the Slashdot effect.

Re:Slashdot Cache Ruled Fair Use!! by TubeSteak · 2006-01-26 10:21 · Score: 1

Maybe this ruling will encourage our fearless leader CmdrTaco to move the /. caching discussion higher up on his list of things to talk about.

However, imagine the word "Slashdotted" fading into the past, while a new overload rises up in its place: Digg

"You got Digged"

Is that something we really want to happen?

--
[Fuck Beta]
o0t!

Web caches too! by slashdotmsiriv · 2006-01-26 09:25 · Score: 2, Interesting

What about web caches? They violate copyright law too! My firefox does it too. Should I use 0 Mbytes of disk space for browser caching?

Where's the engine for the robots.txt excluded? by Kazoo+the+Clown · 2006-01-26 09:31 · Score: 3, Interesting

By now someone must have created a search engine that only indexes sites whose robots.txt tells them not to index. I'm surprised I haven't heard of a particular one. Bet it would raise a few hackles though...

Re:Where's the engine for the robots.txt excluded? by Anonymous Coward · 2006-01-26 11:19 · Score: 0

By now someone must have created a search engine that only indexes sites whose robots.txt tells them not to index.

I'm sure someone has.

They're sure as hell not going to advertise it tho' are they?

POSTED No Trespassing by Anonymous Coward · 2006-01-26 09:35 · Score: 0

This is no different that being required to put up a no trespassing sign on your property to keep people off of it.

Not everything is "secure by default" :)

Need a 'Loser Pays' Rule on Frivolous Suits by schwit1 · 2006-01-26 09:38 · Score: 1

Google gained precedent, but shouldn't have to foot the legal bill for fighting this. The system wrongfully (IMHO) allows these people(gold digger and his ambulance chaser) to put no skin in the game.

What A Pity... by MightyMartian · 2006-01-26 09:45 · Score: 1

What a pity that Chinese citizens will never know if the archived material discusses Tibet or Tiananmen Square. But that's alright, greed trumps decency and ethics anyways. Oh, and I have every expectation that the Google apologists will mod me flamebait, so go on, I've got karma to burn and a deep abiding hatred of evil corporations that aid tyrants that are scared of simple words.

--
The world's burning. Moped Jesus spotted on I50. Details at 11.

Torrents - by bizitch · 2006-01-26 09:47 · Score: 1

So if I use Google's cache to locate torrents for say I don't know ... King Kong (2005) ... I'm all good?

--
---- "Logoff! That cookie shit makes me nervous!" - A. Soprano

Re:Torrents - by arkhan_jg · 2006-01-26 12:41 · Score: 1

Yes, because whoever created that torrent and published it has given you implied licence to download the torrent, and fair use covers the gaps.

Oh, you're talking about using the torrent to download a copy of king kong (2005)? That's still illegal for at least the people doing the uploading, because the copyright holder hasn't published the entire film on a website for free viewing, so there's no implied licence or fair use.

--
Remember kids, it's all fun and games until someone commits wholesale galactic genocide.

I'm sure you're right... by rewt66 · 2006-01-26 09:50 · Score: 2, Insightful

I'm sure you're right. I'm sure you're much smarter than the judge, and have a much better grasp of the law. What a pity that you weren't the one judging the case!

Sorry, mumbles, but I don't buy it. My money's on the judge being right, and you being a loudmouth with too much time to post over and over, as you have in reply to everybody who argued with you.

And to try to actually contribute something to the debate: So you don't like the "fair use" part of the decision. Fair enough; though as I said, between you and the judge, my money's on the judge as the one who correctly groks what fair use is all about. But that aside, what about the other three points? Most interestingly, what about the "implied license" point?

And what about the judge's assessment that Field was "attempting to manufacture a suit against Google"? That is, this wasn't about actual injury to Field, it was about Field actively looking for a chance to sue someone? Does any of that matter to you?

Re:I'm sure you're right... by mumblestheclown · 2006-01-26 09:57 · Score: 1

you're right. judges are always right and always make the smartest decisions. not only that, but all legal scholars agree with all decisions that judges make, because, again, judges are always right.
I type faster than you. get over it. total time spent on this thread 10 minutes while doing other work. i happen to think it's an interesting topic needing debate - i will call bad arguments bad (and there have been several bad arguments here), but i am willing to listen to any good ones that come up.
Re:I'm sure you're right... by rewt66 · 2006-01-26 10:03 · Score: 1

Valid point. In fact, you could say it a lot stronger: Judges have handed down some pretty wacked-out decisions. Still, in the choice between some random judge and some random ./ poster...
But, other than pointing out that my sarcasm may be misplaced, you didn't answer my questions at all.
Re:I'm sure you're right... by mumblestheclown · 2006-01-26 10:19 · Score: 1

And what about the judge's assessment that Field was "attempting to manufacture a suit against Google"? That is, this wasn't about actual injury to Field, it was about Field actively looking for a chance to sue someone? Does any of that matter to you?
To me, this is like the Newdow case. In my view, Newdow is 100% right in that he wants the phrase "under God" struck from the pledge. However, his case was dismissed recently because he didn't have standing - something about his not having legal custody of his daughter or something like that. Initially, my view was that the details of his daughter / the specific claimaint aren't what's interesting here - what seemed interesting was the underlying meat.
However, you may have a point: I mean, the guy might be a self-promoting fortune seeker in trying to sue google, but, well, isn't that a key component of our adversarial justice system anyway? I mean - what difference does it make whether it's a jerk or a multi-million dollar corporation suing? Copyright law doesn't differentiate (in the main). You're right it's an interesting question and an interesting sidelight to the main question, which is whether google should be allowed to cache complete page content as it does.. and, not just keep it internally for use in generating its query results, but in rediplaying the whole pages, either in their entirety or not (depending on which of the respondents here you believe - you might notice that some people responded to my post claiming that what google does is ok because google DOES keep the page intact in its entirety, and others claimed that what google does it ok because google DOES NOT keep the page in its entirety!)
Re:I'm sure you're right... by rewt66 · 2006-01-26 10:30 · Score: 1

the guy might be a self-promoting fortune seeker in trying to sue google, but, well, isn't that a key component of our adversarial justice system anyway?
No, the key component is the adversarial judicial system. Fortune seekers, "lawsuit as lottery", no, those most definitely are not a key component of our legal system. They're a bug, not a feature.
That's partly why the standing rules are the way they are. If you are legitimately injured, you can sue. If you don't like what somebody is doing but it doesn't injure you, tough. Mind your own business, because the courts don't care that you don't like what isn't injuring you. I mean, really, do you want to be sued because somebody doesn't like your choice of interior decorations of your house?
To extend that slightly - you're not supposed to go looking for a chance to sue someone. If someone injures you, you're free to sue, but if you go looking for it, that will make the judge look at your side of the case much more harshly, and rightly so.
Disclaimer: IANAL

No, *you've* got it backward. by Anonymous Coward · 2006-01-26 09:50 · Score: 0

You act as if copyright exists for the benefit of the copyright holders, and fair use is a bone thrown to the rest of the public by the copyright holders so long as it does not impose an undue burden on the copyright holders. This is backward.

In fact copyright exists for the benefit of the public itself, and fair use is a mechanism by which the public ensures that the bones they've thrown the copyright holders do not impose an undue burden on everyone else.

The police and government may have been acting as if they work for the copyright holders rather than the public lately, but "intellectually" that is not where the responsibilities are or are supposed to lie. If the public cannot make backup caches of publicly available material in case the copyright holder becomes unavailable later, the utility of that material is serverely diminished to everyone else in a way the copyright holder can not fairly claim they have earned. Google happens to run one such cache; that is a coincidence. It is not the only one.

All through your post you're using the language of capitalism, libertarianism, and property. This language means nothing when you're talking about a government-created market interferance like copyright. Copyright is not a form of property.

Very specific case, though by Anonymous+Brave+Guy · 2006-01-26 09:55 · Score: 4, Insightful

This is good news for the Wayback Machine at archive.org.

I'm not sure about that. Although the result of this case seems fair and clearly indicated on several counts, there's a lot that might not apply to archives more generally, so I'm not sure how much of a precedent has been set.

In particular, the case was brought by someone who practically admitted trying to set Google up: he knew about mechanisms like META tags and robots.txt, knew that Google was caching his site, made no attempt to stop them, and indeed actually set up robots.txt explicitly to allow bots to crawl his site. This supports Google's first two defences here, having an implied licence and estoppel.

The most interesting discussion, IMHO, is on the fair use defence. The court considered in a lot of detail whether the use made by Google qualifies as fair use. On the first criteria (how the material is being used), it was found significant that the material was being used for different purposes in the cache than on the original site: the latter was presumed artistic, while the former allowed access to the material when the original site was down, historical comparisons of the site content, highlighting of search terms that made a page relevant to the user's search, etc. Hence the court concludes as follows:

Because Google serves different and socially important purposes in offering access to copyrighted works through "Cached" links and does not merely supersede the objectives of the original creations, the Court concludes that Google's alleged copying and distribution of Field's Web pages containing copyrighted works was transformative.

The court also noted that Google made no attempt to profit from the display of the material, did not attach advertisements, made clear that the copy could be out of date, and linked clearly to the original source. (I wonder whether that non-profit, no-ads observation will come back to kick Google later...)

The other fair use discussion is less interesting, although the fact that the plaintiff had made his works available for free and not made any other attempt to profit from them was important, because this meant the market value of the original hadn't been damaged. One interesting tidbit is that apparently the SCOTUS has ruled that the fourth fair use factor (any damage to the market/value of the original work) can't be used to argue that the copyright holder could have licensed an otherwise fair use (such as the caching here) and thus the use can't be fair.

Some of the DMCA defence stuff could have quite significant implications. In particular, the fact that Google caches material only for a fairly short time (14-20 days is mentioned) is relevant, since a prior ruling about Usenet servers could be used.

In summary, Google would basically have won out on four different defences here, even without the fact that the original use might not qualify as direct copyright infringement (since the plaintiff went after the downloading done automatically in response to users; he didn't go after GoogleBot's initial copying process that caches the site on Google's system). It doesn't seem at all clear that a lot of the arguments would apply to other caching services, though: amongst other things, Google's cache in this case is temporary; known to the plaintiff, who had not tried to stop it and actually encouraged it; not for direct profit nor carrying any advertising; and clearly not damaging the market value of the original works.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:Very specific case, though by Eightyford · 2006-01-26 10:10 · Score: 1

Holy insightful response batman! You do no this is slashdot right? ;)

--
Religion for nerds. Stuff that really matters

anal: just deserts by Animaether · 2006-01-26 10:01 · Score: 1

http://www.snopes.com/language/notthink/deserts.ht m

Slashdot requires you to wait at least 15 seconds before blablabla :| *eyes watch*

It's not that bad by Anonymous+Brave+Guy · 2006-01-26 10:02 · Score: 1

If the kind of damage you're talking about were actually happening here, I'd agree with you, but note that the judgement relies on (among other things) Google not displaying ads with the cached page or otherwise profiting from it, the originals not generating any income for the copyright holder, and the fact that the plaintiff was well aware of the conventions that could be used to prevent his site being copied and in fact used robots.txt to request quite the opposite.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

What did you expect? by Nom+du+Keyboard · 2006-01-26 10:16 · Score: 1

What did you expect from a guy who is also a lawyer? Shakespeare was right 500 years ago, and it hasn't changed yet.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

Careful by Anonymous+Brave+Guy · 2006-01-26 10:16 · Score: 1

Your summary is rather misleading. The court also knew that the plaintiff was well aware of those preventative mechanisms and opted not to use them. In fact, he deliberately set up robots.txt so that his content would be considered. The same might not be true at all for Ma and Pa AOL's family homepage.

The Implied License is a stake in the ground for "This is the Internet. The rules are different here." IMO, that's a good thing -- there are a bunch of things that just couldn't happen if you had to get explicit permission from every content owner.

I'm not sure that's a good thing at all. Firstly, I don't think the Internet per se should receive any special treatment in law. The activities it facilitates -- such as making and distributing digital copies of works very quickly and cheaply -- may have dramatic implications, but then any applications of the law should be based on those activities, not on one particular medium.

Secondly, there are also a bunch of things that will go wrong if content providers have to play ball with anyone and everyone who wants to run any service that uses their site's content in any way based on this implicit permission. Unless there are a very small number of universal and legally enforceable mechanisms (which META or robots.txt could become, but aren't at present in any jursidiction AFAIK) the implied licence is a very dangerous precedent.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:Careful by honkycat · 2006-01-26 12:48 · Score: 1

Firstly, I don't think the Internet per se should receive any special treatment in law. The activities it facilitates -- such as making and distributing digital copies of works very quickly and cheaply -- may have dramatic implications, but then any applications of the law should be based on those activities, not on one particular medium.

The law doesn't need to treat the Internet specially just to recognize that it has its own customs and norms. Part of the ordinary practice of operating a web server on the Internet entails configuring your robots.txt file to keep spiders away from things you don't want them to index. A law that required that an author take "reasonable and customary precautions suitable to the publication medium," for example, would handle this well, IMO. I didn't read the decision, just the linked article, but it sounds more or less like that is what they upheld. That seems eminently reasonable to me.
Re:Careful by Anonymous+Brave+Guy · 2006-01-27 00:17 · Score: 1

I tend to agree with you, but note that the court here was clear that the plaintiff was aware of the conventions around META, robots.txt, etc., knew that Google would check for them, and deliberately chose to set up a robots.txt file that invited robots to scan the site anyway. There's no issue here about whether these techniques are standard practice.

I'm not sure it would always be sufficient just to argue that robots.txt and the like are accepted standards these days, simply on the basis that if you took a random sample of people with web sites, I imagine the majority would never have heard of them. It's a bit like not top-posting: those who have been around a while often don't like it and even regard it as rude, but the Outlook generation couldn't care less.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Careful by cfulmer · 2006-01-29 03:55 · Score: 1

The court did mention this, but I don't think that it impacts the implied license. I can't say "I had an implied license based on your knowledge and behavior" when I was completely unaware of your knowledge and behavior at the time that I relied on them.

I believe in common sense. by LordOfTheNoobs · 2006-01-26 10:19 · Score: 1

nofollow leaves webmasters dependent upon search traffic hesitant to force removal of these pages using the supposedly standardized methods of removal.

Now, a web author must know how to manipulate HTML meta tags and/or a robots.txt file.

So they are dependant on google giving them traffic for their livelyhood, and complain when they are not capable enough ( or are too cheap to hire someone who is capable enough ) to bother properly forming their content to the medium through which it is transmitted.

robots.txt may not be a law off the web, but neither is the specification for hypertext transfer.

One would think that a 'web' author should at least know how the 'web' works before authoring their precious little IP there.

--
They're there affecting their effect.

We're all criminals by Nom+du+Keyboard · 2006-01-26 10:20 · Score: 1

Google after the search engine automatically copied and cached a story he posted on his website.

And so did everybody else's browser that ever visited that site. I'm sure he'll want to sue us all next.

--
"It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."

slashdot article caching by Khopesh · 2006-01-26 10:21 · Score: 1

The idea just got legitimized. Sure, let appeals pass to solidify the ruling, and perhaps get some loyal slashdotter lawyer to do a cheap verification on some disclaimer/license. A nice archive of the past few days' stories and their links would be VERY nice.

Mirrordot and company do a decent job, but too often they don't cache enough (like Pages 2-5 of a story), and having it official would be great for users and would-be slashdotting victims. ... though this does bring potential advertising revenue into perspective; good for OSTG, bad for article hosters.

--
Use my userscript to add story images to Slashdot. There's no going back.

But the grandparent is the one who matters by Anonymous+Brave+Guy · 2006-01-26 10:21 · Score: 1

You, with your liberal anything-goes views, contribute zero value to the society by using the web. The GPP, with his desire to prevent certain detrimental activities, is contributing content that is presumably of value to some people even allowing for his wishes to control certain behaviour. Which of you do you think the law should support here?

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:But the grandparent is the one who matters by vadim_t · 2006-01-26 11:01 · Score: 1

Huh? And using a heavily restricted site contributes to society how?

Take slashdot for instance. Is my contribution any greater because I have to jump through hoops to make this post (slashdot bans my ISP's anonymous proxy)? I sent quite a lot of mail about it already, but it all seems to go to the bit bucket.

In the end, I can still post, but as I can't remain logged in, I have to load each page twice, once for the page itself, the second time to change to nested mode.

But back ontopic. Why are they detrimental activities? Screen scraping allows me to gather the stuff I want. I can contribute then to society by letting other people use the same code, which then can benefit from it. I call that a good thing.

Finding old information removed from the site is often invaluable. Don't you hate it when the site that seems to be just about the subject you need is down, or removed right the page you wanted? Thanks to google, often I still can have it.

I don't see anything wrong about anonymous access either, my ISP's transparent proxy already provides a degree of anonymity anyway. Even though my IP address is clearly sent by the proxy in the headers, lots of sites (like slashdot) still react to my proxy's address instead of mine.
Re:But the grandparent is the one who matters by Anonymous+Brave+Guy · 2006-01-26 11:21 · Score: 1

Huh? And using a heavily restricted site contributes to society how?

The things we're talking about are hardly heavily restricted. More to the point, the alternative isn't necessarily an unrestricted site; it could well be no site at all. This is the point a lot of people forget in their haste to tell us how information wants to be free, you can't stop distribution, etc: by default, only the author of the work has it. If society wants the author to share, it has to make it worth his while, or he'll simply keep it to himself and no-one will benefit under any circumstances.

Finding old information removed from the site is often invaluable. Don't you hate it when the site that seems to be just about the subject you need is down, or removed right the page you wanted? Thanks to google, often I still can have it.

On the other hand, consider an author who's willing to put up a draft of their book on a site, offering that version of the content to the public for a limited time in exchange for hopefully receiving constructive feedback before publishing. This benefits both parties. It probably also benefits both parties to withdraw that content when the final version is published: the author wants to make his return on investment, and the content in the final version will probably be better than in the draft so you don't want out-of-date information taking priority.

However, archives can screw this up: anyone who wants to fleece the author can (potentially to their own disadvantage as well) look up the draft and use that instead of paying the asking price for the final version, and anyone searching for the work might not even be aware that the final version exists, because they see an archived version of the draft page, not the announcement page that replaced it. And of course, many authors will no longer publish anything from any potentially publishable book, even a draft or a sample chapter, for fear of publishers turning them down (as many now routinely do). None of this helps anyone, and archives have to take some responsibility for the damage. (This isn't to say that they can't be useful for other purposes, of course, just that it's not an entirely one-sided argument and keeping old material on-line against the author's wishes can be damaging.)

I don't see anything wrong about anonymous access either, my ISP's transparent proxy already provides a degree of anonymity anyway.

That's a discussion for another time, perhaps, but just consider how many crimes, from the somewhat irritating to the seriously damaging, are possible only because of the effective anonymity the Internet provides. Again, that's not to say that some don't benefit, but IMHO most of the benefits are illusory, while the damage from spam, viruses, phishing and on-line fraud, and countless other things is very real.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:But the grandparent is the one who matters by typical · 2006-01-26 12:01 · Score: 1

or he'll simply keep it to himself and no-one will benefit under any circumstances.

Yup. And yet, somehow, there is far more content on the Internet than I could ever see in a lifetime of browsing. Sorry, but no sympathy here.\

That's a discussion for another time, perhaps, but just consider how many crimes, from the somewhat irritating to the seriously damaging, are possible only because of the effective anonymity the Internet provides. Again, that's not to say that some don't benefit, but IMHO most of the benefits are illusory, while the damage from spam, viruses, phishing and on-line fraud, and countless other things is very real.

That's a straw man argument. None of these are directly facilitated by Google Cache.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.
Re:But the grandparent is the one who matters by periol · 2006-01-26 12:13 · Score: 1

If society wants the author to share, it has to make it worth his while, or he'll simply keep it to himself and no-one will benefit under any circumstances.

Ah, but society's understanding of what is worthwhile may not be the same as the author's. Theoretically, copyright is given by the community to the author. If you offer me a job, and I don't think it pays enough, I have the right to turn it down. The author has the right not to publish. And the community has the right to tweak the laws should it decide they're not working out best for the interests of all.

The funny thing about your argument is how black & white it is. As a writer, I will often choose to post some of my content for free, and I will alternatively post other content in protected areas that only those I've approved can see. I also have a large number of works that I haven't published online at all.

If you really feel like you're getting screwed, well, you can always go to a judge and argue that your understanding of copyright is correct. And then you can hope that he or she will agree with you. Luckily for you, I'm not a judge, and you aren't in my courtroom - because I think you are flogging a dead horse. We made it a long time without copyright, and our current system is too tight, and the reins on fair use need to be loosened a bit. IMHO.
Re:But the grandparent is the one who matters by Anonymous+Brave+Guy · 2006-01-26 12:14 · Score: 1

And yet, somehow, there is far more content on the Internet than I could ever see in a lifetime of browsing.

Sure. But how much of it is good content, the kind that a skilled artist has worked hard on and it shows, and how much of it is the WWW equivalent of the latest teenie lip-synch group? Anyone can write a blog. Writing a blog worth reading is a whole different game.

[I'll mostly gloss over the anonymity thing. There are examples, though admittedly less serious than what I mentioned before when I was commenting on Internet anonymity in general, where even anonymising the browsing of a site can damage the site's providers. Think of people who use their site logs to help improve the UI, based on how many of their visitors are returning, or identifying that a lot of visitors from a particular country are coming and it would be worth investing in a local mirror. However, I'm not sure any of that is particularly relevant to this discussion.]

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:But the grandparent is the one who matters by vadim_t · 2006-01-26 12:25 · Score: 1

The things we're talking about are hardly heavily restricted. More to the point, the alternative isn't necessarily an unrestricted site; it could well be no site at all. This is the point a lot of people forget in their haste to tell us how information wants to be free, you can't stop distribution, etc: by default, only the author of the work has it. If society wants the author to share, it has to make it worth his while, or he'll simply keep it to himself and no-one will benefit under any circumstances.

This is a faulty argument. It assumes that the more control to the author, the more incentive to publish. But it isn't so. Copyleft effectively works around copyright and wouldn't be needed if copyright didn't exist, and some people wouldn't create content if it could be tracked back to them.

Would you still post anything on the Internet if anybody, your family, friends, employers, government, etc, could instantly find everything you did online? Imagine somebody being able to enter your name, and finding every post in every forum, a list of the sites you accessed, and all of it going back 10 years?

Or consider an alternative situation, where every bit of information has DRM attached, and I could at will make my posts vanish from the Internet when I found it convenient, restrict who can see them and when? Should I be able to now go and cancel my previous post, making you look like you're completely out of your mind arguing with nobody at all?

That's a discussion for another time, perhaps, but just consider how many crimes, from the somewhat irritating to the seriously damaging, are possible only because of the effective anonymity the Internet provides. Again, that's not to say that some don't benefit, but IMHO most of the benefits are illusory, while the damage from spam, viruses, phishing and on-line fraud, and countless other things is very real.

This is very debatable, but I don't think it's worth discussing. Mainly because for a long time I ran a Freenet node, so obviously I already considered all that, and didn't think it was a huge deal.

However: The current infrastructure makes spam perfectly possible to track, yet it didn't stop it, viruses don't work on my systems, and the existence of phishing has absolutely nothing to do with the anonymity of the *client*.
Re:But the grandparent is the one who matters by typical · 2006-01-26 15:42 · Score: 1

Sure. But how much of it is good content, the kind that a skilled artist has worked hard on and it shows, and how much of it is the WWW equivalent of the latest teenie lip-synch group?

Percentagewise? Probably a minority. But I don't care about percent -- I care that in absolute terms, there is far more good content available than I can consume. I have a huge reading backlog. I certainly would not trade the ability to get at documents that have gone missing (sever mistake, no more funding to make document available, someone thinks that something is "obsolete", domains expire, etc) for even more content.

--
Any program relying on (nontrivial) preemptive multithreading will be buggy.
Re:But the grandparent is the one who matters by Anonymous+Brave+Guy · 2006-01-27 00:34 · Score: 1

I care that in absolute terms, there is far more good content available than I can consume. I have a huge reading backlog. I certainly would not trade the ability to get at documents that have gone missing ... for even more content.

Fair enough, you're perfectly entitled to your own opinion. However, what you wrote above is only a personal preference. It's not necessarily in the best interests of society as a whole (or even yours, but we can never know that for sure without knowing what potential content you're turning down and how much value you might have put on it if you'd known about it).

My personal view is that older content that is really valuable is unlikely to disappear anyway, and that which does probably does so through "natural selection". I don't want the Internet to become so full of outdated, obsolete, irrelevant information that it becomes impossible to find the good stuff, and compared to even five years ago it's already heading that way fast.

If something has disappeared, IME there's often a good reason for it: usually it's either no longer relevant or replaced by something better. Occasionally it's not something that should ever have been out there in the first place, and I don't accept the black-and-white view that once some information is out there, it's always in society's interests to keep it so and make it widely available.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Comment removed by account_deleted · 2006-01-26 10:22 · Score: 1

Comment removed based on user account deletion

A clip, yes... by Jeff+Molby · 2006-01-26 10:27 · Score: 1

Ebert & Roeper can show a movie clip

...but they can't show the whole movie to point out which parts they (dis)like.

Re:A clip, yes... by pthisis · 2006-01-26 10:44 · Score: 1

...but they can't show the whole movie to point out which parts they (dis)like.

Maybe, maybe not. Depends on how it's presented. There are complete critical copies of copyrighted works along the lines of Roland Barthes' S/Z that have been protected. While they incorporate literally the entire work, the amount of critical material bastly outweighs the amount of source material. Reproducing a painting along with criticism thereof is commonplace and sometimes protected. There is no bright-line standard for how much of a work can be copied.

--
rage, rage against the dying of the light

License by Anonymous Coward · 2006-01-26 10:29 · Score: 0

By putting something out there for all and sundry, you would appear to this non-lawyer, to be giving an implied license to read and use it.

Because you can, in fact, say that you don't want it cached, the fact that you are negligent in doing so is your own fault and not that of a third party. The burden on Google's shoulders is far greater than the one for responsible webmasters, therefore it is reasonable that the webmaster, the only one in a position to know whether or not they wish to give permission, should be given that burden. This goes double when one may request that Google remove it after the fact without need of a lawsuit.

Furthermore, it is a transformative use--it has the search terms highlighted, links to the original page, and tells you that it is not Google's. It even gives "convert to HTML" links as well as translations in some cases. In these cases *even if* the whole work is copied, it CAN still be fair use. And this is a good thing. Because otherwise I'd just tell you that you cannot cache this work right here--you know, the one you're reading right now. Oops, your browser already automatically did that for you? Too damn bad. As the copyright holder, I can license it any way I damn well please. Thankfully, the Court was more sensible than that in this case.

Moreover, it's a further financial drain on Google rather than a profit center. True, it's one of the great features that makes Google popular, but it's not something they make money on in and of itself--they don't put any ads on the pages they cache or anything like that.

Lastly, I am not a lawyer and this is not legal advice. Even so, Google is promoting the progress of arts & sciences here with this service. That is the ONLY legitimate reason for copyrights, and insofar as that aim is not being served copyright law should be curtailed. Money is one of the ways it is intended to promote them, but insofar as money gets in the way, too bad. No one has a right to money due to some artificial legal construct. And that includes me, even though I am a programmer, an author and have been a musician (albeit not a very good musician).

Ninwa hit the nail on the head by CGameProgrammer · 2006-01-26 12:24 · Score: 1

Google's search engine works exactly like any human browsing the web -- it scans webpages and follows links. In order for it to see MP3 files, those files have to be made available for public download already. If that is legal then caching those files also is, arguably. If the files were posted illegally, the fault is not Google's.

Of course Google Cache does not actually copy graphics, as far as I am aware, and it certainly wouldn't cache MP3 files. Archive.org is a more complete cacher.

--
~CGameProgrammer( );

DMCA reasoning not likely to stand by Crispy+Critters · 2006-01-26 12:50 · Score: 1

The judge really botched the analysis of the DMCA argument. That is not likely to stand.

To quote, it requires that the cache exist "for the purpose of making the material available to users... who... request access to the material [from the originating site]." We understand what this is referring to - an ISP or a web proxy intercepts a web request and serves a cached copy of a page rather than getting a fresh copy. However, Google offers up its cached pages to people as an alternative to requesting the material from the originating site.

The judge's claim is that some people use the Google cache after unsuccessfully trying to load the original site (e.g. b/c of the slashdot effect). However, the user does not need to request the original page to load the page from the Google cache.

It seems clear that the law allows caching that is transparent to the user, while Google has gone to great pains to make their caching as untransparent as possible. On the other hand, this seems to have strengthened their fair use and estoppel claims.

Re:DMCA reasoning not likely to stand by PoconoPCDoctor · 2006-01-26 15:57 · Score: 1

I sense someone with Juris Doctor credentials! B-)

I'm married to a lawyer, so please don't take offense. I have also had a run-in with copyright laws, but the ruling on Google might keep the page I am being asked to remove alive after all.

--
"Let us raise a standard to which the wise and honest can repair" - George Washington

Good ruling!!!! by rts008 · 2006-01-26 13:16 · Score: 1

I was always under the assumption that there was a reason for this net thing being referred to as the "information highway"---never have I heard it hyped as the information tollroad.

--
Down With Slashdot BETA!!! I've been around the corner and seen the oliphant; you can only abuse me from your perspecti

What does this mean for open WiFi? by Mr2001 · 2006-01-26 13:58 · Score: 1

A similar argument was thrown around a lot back when we had those stories about people being arrested for using unsecured WiFi networks without permission.

The argument works just as well there: You operate a wireless network. Wireless networks were created specifically with the idea that clients would try to connect to nearby networks (automatically, in some cases), and there are multiple standard well-known ways to prevent strangers from connecting to your network or accessing the internet through it. Running a wireless network without using these techniques means that you give strangers permission to connect to it.

It didn't seem to fly back then, though. Does this mean courts will now take the "implied license" argument more seriously when applied to other technologies besides HTTP?

--
Visual IRC: Fast. Powerful. Free.

Re:What does this mean for open WiFi? by cfulmer · 2006-01-27 06:43 · Score: 1

I think the facts of the two situations are enough to distinguish them. Wireless routers are consumer devices and most lay people just don't know how to configure a wireless router properly -- they just don't expect that others will be able to use their networks and (even if they did), they're probably not familiar with the risks involved.

Being on the Internet, however, is different -- 99.99% of all websites that exist now have been created since search engines came into existance and since the robots.txt de facto standard was created. So, new entrants should reasonably know and expect that their website will be indexed. Heck, the vast majority of websites actually rely on this.

Copyright law actually specifically incorporates the implied license view in one place. If you have a copyright in an architectural work that is visible from a public place, then 17 U.S.C. 120 says that your copyright does not extend to pictures of that work. So, by putting a cool new building on a public street, I'm giving people the right to take pictures of it.
Re:What does this mean for open WiFi? by Mr2001 · 2006-01-27 13:13 · Score: 1

I think the facts of the two situations are enough to distinguish them. Wireless routers are consumer devices and most lay people just don't know how to configure a wireless router properly -- they just don't expect that others will be able to use their networks and (even if they did), they're probably not familiar with the risks involved.

So if you're running a web server on your PC without setting up robots.txt, because it's part of your OS or some app you use and you never took the time to investigate it, would you be able to successfully sue Google for caching it? The only difference between that scenario and the actual case here would be the site admin's ignorance.

Being on the Internet, however, is different -- 99.99% of all websites that exist now have been created since search engines came into existance and since the robots.txt de facto standard was created.

Well, 100% of all wireless networks that exist now have been created since wireless encryption came into existence. :P

--
Visual IRC: Fast. Powerful. Free.

Caching Tributes to 9/11 by PoconoPCDoctor · 2006-01-26 14:13 · Score: 1

This ruling comes as I have been asked to remove a web page from my personal web site. The page was a copy of an obituary that appeared in January of 2002. It was a touching tribute to Michell Robotham, who was killed on 9/11 in the attacks on the World Trade Center.

The newspaper deleted all the memorials, so had I chosen to just link to the newspaper web site, the link would have not worked. I have agreed to remove the copyrighted material, but let them know that I cannot control what Google has cached, and that they would have to contact Google to removed their cache of the page.

I wrote about my run-in with copyright laws on my blog.

The link will be removed tomorrow, but will live on in Google.

--
"Let us raise a standard to which the wise and honest can repair" - George Washington

I like this ruling... by tv_dinners · 2006-01-26 15:30 · Score: 2, Insightful

..since it means that I can now create a search engine that caches and displays Google's results.

I wonder how long it would take for Google to cry foul for doing exactly what they do ?

This sounds very familiar... by vga_init · 2006-01-26 16:31 · Score: 1

I can remember reading about a very similar case that happened years ago. I believe it is detailed in one of my Lawrence Lessig books. It basically went like this: a student at a university had developed a search engine with a caching function. If you think about it, that's exactly the kind of technology that Google has.

The poor boy was threatened with lawsuits. I don't know whether he tried to fight them or not, but he did get shafted. If memory serves correct, he settled and the RIAA or whoever squeezed him for as much as they could (which wasn't very much). Hopefully this case with Google sets a precedent that may be used in other cases.

Why not read the law instead of making it up? by Anonymous Coward · 2006-01-26 18:31 · Score: 0

I don't care *what* someone's ToS says. If you need a ToS, then you can't allow anonymous access.

But you can, and you are wrong.

A web site is a work that can be protected by copyright. As such, a person holding the copyright in a website can set terms and conditions as to who, if anyone, may distribute and reproduce works embodied in the site, orthogonally to how he or she makes the site available from a purely technical standpoint.

This is well established in law. In fact there are cases which are exactly on point, such as Intellectual Reserve v. Utah Lighthouse Ministries. That case dealt with someone who had, effectively, cached a copy of someone else's website, and was found to be infringing for doing so.

More than that, that case established (although it is not the only one) that people who merely view a web site infringe the copyright of the author, absent some permission to do so. (The reason is based on reasoning in MAI Systems Corp v. Peak Computer, which sets out that storing a work in RAM is making a copy under the Copyright Act definition).

Now, the idea that implicit licenses exist is also established in law. However, the scope of any "implicit license" is going to be quite limited, and it is not clear that a permanent and unauthorized -- and "opt out" --- cache is at all justifiable.

The instant case was decided as it was because the lawyer in question basically orchestrated the circumstances to bilk Google for some money. The court found this to be an abuse and sided with Google. If the lawyer's case wasn't such an obvious scam, I doubt it would have went the way it did. On appeal, I have no doubt parts of this ruling would be overturned.

MOD PARENT UP by l2718 · 2006-01-27 02:08 · Score: 1

'nuff said

Slashdot Mirror

Google's Cache Ruled Fair Use

213 comments