Archive.org Sued By Colorado Woman
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
Did she post the notice properly, though? There are laws about how you must post a 'no trespassing' sign on physical properties.
Similarly, there are ways to post that notice on your website as well. robots.txt comes to mind. If she didn't bother to post the notice correctly, the case should be just thrown out.
"If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
TFA clearly states that there was no robots.txt file. I suppose she expects us (programmers) to rush out and perfect natural language processing so all our spiders can read her stupid notice.
This appears to be it.
Oh, and Ms. Shell, 1996 called. They want their website design back.
PS - By clicking on the link above you are agreeing to all the stuff Ms. Shell posted on her site.
"We make our world significant by the courage of our questions and by the depth of our answers." Carl Sagan
Sitting in my day care, the art is decopainted.
It appears her site is at http://www.profane-justice.org/
s e456.cfm
Check out this article here: http://www.phillipsnizer.com/library/cases/lib_ca
According to this, she requested that the site be removed from the Archive in December, 2005, and they complied. They're actually countersuing her. They moved to have her claims dropped for various reasons, but the court chose to only drop the ones related to conversion, civil theft and the RICO claims. The issue of breach of contract and copyright infringement still apply.
I think it's absolutely ridiculous that this can go forward, especially when there are two established methods to stop the Archive's activity: The opt-out, which will remove history, and robots.txt (which she didn't use and appears to still not use), which will prevent that spider from ever archiving her site again.
Her site shows up in Google, I wonder why she hasn't sued them? Could it be that she likes the exposure of the big search engine, but doesn't want any history of her site archived by the Internet Archive?
I find it interesting how information is used selectively here. She is cast as the victim in the second and third paragraphs, with the standard foster home sob story. She was supposedly a wonderful person, but then we find, buried in parentheses a paragraph later, the bolded text above. Hmm, pregnant...cheerleader...
It also looks like her own kids reported her for child abuse and went to live with their father, and she's pretty ticked about that.
She wonders why she's disliked by the court system. Well, the evidence is all over the article:
And look how effective she is! Heck, this belongs on Fark not Slashdot.
Check out her current robots.txt file.
As a devil's advocate, though, what enforcement is there of robots.txt?
I could easily write a program that runs on my workstation and completely ignores it. In fact, I have a offline-browser that downloads sites and *does* completely ignore it while spidering for which pages to download (I won't name names.) There's nothing technical requiring spiders to honor it, presumably there's no legal system to honor it, it's all just trust.
Next time this comes up in court, the case might be a much more interesting, "I had a robots.txt file, but [search engine] ignored it!"
Comment of the year
Quoted:
IF YOU COPY OR DISTRIBUTE ANYTHING ON THIS WEB SITE, YOU ARE ENTERING INTO A CONTRACT. SEE COPYRIGHT NOTICE & SECURITY AGREEMENT (READ BEFORE ACCESSING THIS WEBSITE) - Copyright 1996- 2007, Suzanne Shell and individual contributors where appropriate. The content if this web site is intended to generate income, it is not free if you intend to archive, copy, print or distribute anything electronically fixed herein.
Which of course implies this is a commercial website.
However, if you go to the contact info page, we can see her PGP key (...) contains:
-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: PGP 8.1 - not licensed for commercial use: www.pgp.com
GG?
If you were offended by anything I said... No, I'm not sorry. Please lighten up.
Suzanne Shell
14053 Eastonville Rd.
Elbert, CO 80106
719.749.2971
For those looking to share your views, Suzanne has asked that we continue to contact her organization at her official "non-web" addresses.
By reading this post or having it copied into your browser's cache, you agree to give me a permanent, irrevocable, fully paid-up license, at my option, to revoke, amend, terminate, or waive any breach of any contract of adhesion, EULA, click-through license, or any other such license or contract which does not bear my physical (i.e. the non-electronic kind made with a pen) signature or the physical signature of a designated agent you have entered with me, or to take that action on your behalf. In the event of any disagreement pertaining to, arising under, or relating to this license, you give me the choice of law and venue and agree not to oppose any changes of venue, motions for removal to or from federal court, or objections to standing that I request and that you give me the right to waive any objections on your behalf. In the event of any breech, you agree to pay all of my court costs and attorney's fees.
:] Honestly, I hope this'll get thrown out soon, the judge just might not be able to do that at this stage in the litigation. I mean, hell, how long as SCO vs. IBM been going on now?
If you do not agree to this, you must get my signature in pen and ink stating that whatever license or contract you propose is not subject to this agreement. After all, if you don't think this is valid, just why the HELL do you think some stupid thing you put on your web page is!?
-----
There, put that bugger on your website, and let them weasel out of that
It's OK now - after negotiating with Ms. Shell she agreed to place the contents of her site into the public domain. Here's a copy of our contract:
GET / HTTP/1.1
User-Agent: By accepting this HTTP GET request you agree to release into the public domain the entire contents of this web site.
Host: www.profane-justice.org
Pragma: no-cache
Accept: */*
HTTP/1.1 200 OK
Date: Sun, 18 Mar 2007 02:11:09 GMT
Server: Apache
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Connection: close
Transfer-Encoding: chunked
Content-Type: text/html