Archive.org Sued By Colorado Woman
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
TFA clearly states that there was no robots.txt file. I suppose she expects us (programmers) to rush out and perfect natural language processing so all our spiders can read her stupid notice.
No, she didn't post the notice properly:
The case should be thrown out, period. She should just have learned her lesson and used a proper robots.txt file next time. If you're going to post stuff on the Internet and don't want it to beb indexed or archived, you should know what you're doing. If you don't, it's your problem. The lawsuit is frivolous and inane.
I hear there's rumors on the Slashdots
Sitting in my day care, the art is decopainted.
It appears her site is at http://www.profane-justice.org/
s e456.cfm
Check out this article here: http://www.phillipsnizer.com/library/cases/lib_ca
According to this, she requested that the site be removed from the Archive in December, 2005, and they complied. They're actually countersuing her. They moved to have her claims dropped for various reasons, but the court chose to only drop the ones related to conversion, civil theft and the RICO claims. The issue of breach of contract and copyright infringement still apply.
I think it's absolutely ridiculous that this can go forward, especially when there are two established methods to stop the Archive's activity: The opt-out, which will remove history, and robots.txt (which she didn't use and appears to still not use), which will prevent that spider from ever archiving her site again.
Her site shows up in Google, I wonder why she hasn't sued them? Could it be that she likes the exposure of the big search engine, but doesn't want any history of her site archived by the Internet Archive?
The inaccurate Slashdot description would make you believe that, but actually she has this on her webpage: "IF YOU COPY OR DISTRIBUTE ANYTHING ON THIS WEB SITE, YOU ARE ENTERING INTO A CONTRACT".
She filed pro se.
s e456.cfm
http://www.phillipsnizer.com/library/cases/lib_ca
Suzanne Shell
14053 Eastonville Rd.
Elbert, CO 80106
719.749.2971
For those looking to share your views, Suzanne has asked that we continue to contact her organization at her official "non-web" addresses.
It isn't just a "nice convention". It's a sufficiently reasonable precaution available to plaintiff to effectively avoid the inadvertent disclosure of copyrighted documents. Failure to provide a simple robots.txt file evidences a lack of reasonable precaution and undermines plaintiff's claims to redress in a court of law.
In her defense it seems she probably needs the money after being fined $6000 in a Colorado state court a few months ago for a contempt violation (unauthorized practice of law) after she participated in three separate Colorado court cases under a power of attorney when she had no prior involvement- after having been warned on a prior occasion that this was illegal in the state of Colorado. In fact it's illegal everywhere except Slashdot. But of course it's lies, all lies!She needs a good spanking.
It's OK now - after negotiating with Ms. Shell she agreed to place the contents of her site into the public domain. Here's a copy of our contract:
GET / HTTP/1.1
User-Agent: By accepting this HTTP GET request you agree to release into the public domain the entire contents of this web site.
Host: www.profane-justice.org
Pragma: no-cache
Accept: */*
HTTP/1.1 200 OK
Date: Sun, 18 Mar 2007 02:11:09 GMT
Server: Apache
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Connection: close
Transfer-Encoding: chunked
Content-Type: text/html