Archive.org Sued By Colorado Woman
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
Did she post the notice properly, though? There are laws about how you must post a 'no trespassing' sign on physical properties.
Similarly, there are ways to post that notice on your website as well. robots.txt comes to mind. If she didn't bother to post the notice correctly, the case should be just thrown out.
"If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
she should have put a password on it.
I Hate Allan
TFA clearly states that there was no robots.txt file. I suppose she expects us (programmers) to rush out and perfect natural language processing so all our spiders can read her stupid notice.
This appears to be it.
Oh, and Ms. Shell, 1996 called. They want their website design back.
PS - By clicking on the link above you are agreeing to all the stuff Ms. Shell posted on her site.
"We make our world significant by the courage of our questions and by the depth of our answers." Carl Sagan
Sitting in my day care, the art is decopainted.
' If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."'
No that would only change the nature of the web for US citizens, just like the online internet gamble thingie, the rest of us will shrug and move about our businesses as usual.
It appears her site is at http://www.profane-justice.org/
s e456.cfm
Check out this article here: http://www.phillipsnizer.com/library/cases/lib_ca
According to this, she requested that the site be removed from the Archive in December, 2005, and they complied. They're actually countersuing her. They moved to have her claims dropped for various reasons, but the court chose to only drop the ones related to conversion, civil theft and the RICO claims. The issue of breach of contract and copyright infringement still apply.
I think it's absolutely ridiculous that this can go forward, especially when there are two established methods to stop the Archive's activity: The opt-out, which will remove history, and robots.txt (which she didn't use and appears to still not use), which will prevent that spider from ever archiving her site again.
Her site shows up in Google, I wonder why she hasn't sued them? Could it be that she likes the exposure of the big search engine, but doesn't want any history of her site archived by the Internet Archive?
Call it, then replay a message including:
"If you don't agree, say 'NO', otherwise we consider you agree to our contract."
[short silence]
"Thank you. You have agreed to our terms and conditions. We will send you the invoice in..."
(and proceed.)
Since agreement to her conditions is supposed to be implied by a visit of a second party, human or not, agreement from side of the second party, human or not, containing correct reply (or correct lack of thereof - implied consent like in case of her license) is just as valid legally.
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
I find it interesting how information is used selectively here. She is cast as the victim in the second and third paragraphs, with the standard foster home sob story. She was supposedly a wonderful person, but then we find, buried in parentheses a paragraph later, the bolded text above. Hmm, pregnant...cheerleader...
It also looks like her own kids reported her for child abuse and went to live with their father, and she's pretty ticked about that.
She wonders why she's disliked by the court system. Well, the evidence is all over the article:
And look how effective she is! Heck, this belongs on Fark not Slashdot.
Check out her current robots.txt file.
As a devil's advocate, though, what enforcement is there of robots.txt?
I could easily write a program that runs on my workstation and completely ignores it. In fact, I have a offline-browser that downloads sites and *does* completely ignore it while spidering for which pages to download (I won't name names.) There's nothing technical requiring spiders to honor it, presumably there's no legal system to honor it, it's all just trust.
Next time this comes up in court, the case might be a much more interesting, "I had a robots.txt file, but [search engine] ignored it!"
Comment of the year
Quoted:
IF YOU COPY OR DISTRIBUTE ANYTHING ON THIS WEB SITE, YOU ARE ENTERING INTO A CONTRACT. SEE COPYRIGHT NOTICE & SECURITY AGREEMENT (READ BEFORE ACCESSING THIS WEBSITE) - Copyright 1996- 2007, Suzanne Shell and individual contributors where appropriate. The content if this web site is intended to generate income, it is not free if you intend to archive, copy, print or distribute anything electronically fixed herein.
Which of course implies this is a commercial website.
However, if you go to the contact info page, we can see her PGP key (...) contains:
-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: PGP 8.1 - not licensed for commercial use: www.pgp.com
GG?
If you were offended by anything I said... No, I'm not sorry. Please lighten up.
Lets forget for a second the person suing Internet Archive. Lets also forget the quality of her website and that she may or may not have included a proper robots.txt file.
Suppose you're a teenager or college student. You do some silly prank and it includes a picture of yourself and your friends. You tell your friend you don't want the picture going on the net. For whatever reason, he/she posts it up anyways. Nothing illegal. Or suppose the names of a doctor's patients were to leak onto the Internet. Or that you're a company and an ex-employee posts information that is defamous or incorrect about your company. Or that you post an angry rant on the net, you've grown past it and then delete from your site. In all cases, wouldn't some or all this information be cached on Google or Internet Archive?
My point is we're all human. Wether our activities are simpy stupid, embarassing, or could affect a company's ability to do business, some or all of us have said or done some of these things. It could affect our relations and our ability to find work. The problem with the Net is its very easy to find this information and it becomes widespread. We've all embarassed ourselves, at the least. We can sue the person responsible for posting the information. But it is, as I understand, very difficult to have the information removed from caches on search engine and archive sites.
Rather than sue, sue, sue, shouldn't there be an easier way to remove this information?
Suzanne Shell is a family rights activists who has published 3 books and a website advocating on behalf of the fundamental human right of family association. This activism largely applies to government intrusions into the family under child abuse/neglect investigations and court actions. She has been active in this arena since 1991 and is nationally recognized as an expert in this arena.
She is advertised by: http://www.profane-justice.org/
An example of her work: http://www.profane-justice.org/sctcomplnt.pdf
She urges interested parties to contact her via her contact address or phone number on the letterhead in above PDF.
Let her know what you think!
I can see why she wouldn't want to get indexed. She's basically providing information for people trying to resist intervention from child protective services. She suggests taking your children and hiding them out of state if CPS tries to contact you. She also suggests (and sells an ebook about) teaching your children to never talk to a social worker or answer questions... you know like 'does anyone ever hit you?' and 'what did you have to eat yesterday?' and 'do you have a doctor to go to when you are sick?'
Of course CPS and child abuse reporting in general is imperfect. I'm an ER physician which makes me a mandated reporter. This means if I simply suspect abuse or neglect, I am required by law to report it. The system is set up to favor false positives over false negatives. That means that CPS investigates many cases for every real abusive situation they find.
This is NOT a bad thing. There are 1500-2000 reported deaths from child abuse every year in the US. One in twenty kids in the US is physically abused in any given year and most of those victims are under age 6. Most of these cases of abuse never come to the attention of authorities because child abuse is woefully under-reported (which is why we have mandated reporter laws.)
Most of the instances of 'think of the children' posturing are conservative unthinking crap. However advocating outdated ideas of parental property rights over the rights of children to not be abused goes beyond simple whack-job well into the realm of pure evil.
Nick
Suzanne Shell
14053 Eastonville Rd.
Elbert, CO 80106
719.749.2971
For those looking to share your views, Suzanne has asked that we continue to contact her organization at her official "non-web" addresses.
By reading this post or having it copied into your browser's cache, you agree to give me a permanent, irrevocable, fully paid-up license, at my option, to revoke, amend, terminate, or waive any breach of any contract of adhesion, EULA, click-through license, or any other such license or contract which does not bear my physical (i.e. the non-electronic kind made with a pen) signature or the physical signature of a designated agent you have entered with me, or to take that action on your behalf. In the event of any disagreement pertaining to, arising under, or relating to this license, you give me the choice of law and venue and agree not to oppose any changes of venue, motions for removal to or from federal court, or objections to standing that I request and that you give me the right to waive any objections on your behalf. In the event of any breech, you agree to pay all of my court costs and attorney's fees.
:] Honestly, I hope this'll get thrown out soon, the judge just might not be able to do that at this stage in the litigation. I mean, hell, how long as SCO vs. IBM been going on now?
If you do not agree to this, you must get my signature in pen and ink stating that whatever license or contract you propose is not subject to this agreement. After all, if you don't think this is valid, just why the HELL do you think some stupid thing you put on your web page is!?
-----
There, put that bugger on your website, and let them weasel out of that
Yes, it would be hard. However the technical problems would certainly be solvable for a company like google. The real problem is that this would serve to further entrench google's monopoly on search. I love google as much as the next guy, but I certainly don't want to start introducing measures that will make it more difficult for a google-killer to arise.
That's the worst analogy I've ever heard that didn't fulfill Godwin's Law. The web was designed to be a public place for the sharing of information. Trying to think of it like the inside of someone's house is just going to lead you in the wrong direction. A much better analogy would be that information posted on the web is like a campaign poster or some other notice posted on someone's lawn. Yes, it's hosted on private property and the information posted there belongs to the person posting it, but it is displayed publicly and acting as if it weren't is just silly.
"While I understand the importance of web searches, would it be so hard to change the nature of web crawlers so that they didn't spider a site unless they were specifically allowed?"
.... wait for it .... robots.txt.
I guess you could, you could just as easily put terms in the licenses of web servers that state that by using the web server software, you agree to let all your documents be crawled except where you deny that with
That wouldn't be too hard either would it?
all the best,
drew
http://www.youtube.com/watch?v=vcaf2ThG7q4
UFO seen in skies over Winton!
FreeMusicPush If you want to see more Free Music made, listen to Free
It's OK now - after negotiating with Ms. Shell she agreed to place the contents of her site into the public domain. Here's a copy of our contract:
GET / HTTP/1.1
User-Agent: By accepting this HTTP GET request you agree to release into the public domain the entire contents of this web site.
Host: www.profane-justice.org
Pragma: no-cache
Accept: */*
HTTP/1.1 200 OK
Date: Sun, 18 Mar 2007 02:11:09 GMT
Server: Apache
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Connection: close
Transfer-Encoding: chunked
Content-Type: text/html