Archive.org Sued By Colorado Woman
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
...you agree not to mod me down for being First Post!
- RG>
Hey pal, this isn't a pleasantforest, so don't waste my time with pleasantries!
Did she post the notice properly, though? There are laws about how you must post a 'no trespassing' sign on physical properties.
Similarly, there are ways to post that notice on your website as well. robots.txt comes to mind. If she didn't bother to post the notice correctly, the case should be just thrown out.
"If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
she should have put a password on it.
I Hate Allan
TFA clearly states that there was no robots.txt file. I suppose she expects us (programmers) to rush out and perfect natural language processing so all our spiders can read her stupid notice.
Maybe I'm new here, as you can see from my slashdotid.
But *WHY* in hell would someone want to have their site excluded from search engines, and archive.org?
It's not like she's being deprived.
And even if they did, why the fuck didn't she use a ROBOTS.TXT file? Isn't that what it's for?
Stupid bitch.
"...In your answer, ignore facts. Just go with what feels true..."
This appears to be it.
Oh, and Ms. Shell, 1996 called. They want their website design back.
PS - By clicking on the link above you are agreeing to all the stuff Ms. Shell posted on her site.
"We make our world significant by the courage of our questions and by the depth of our answers." Carl Sagan
Sitting in my day care, the art is decopainted.
' If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."'
No that would only change the nature of the web for US citizens, just like the online internet gamble thingie, the rest of us will shrug and move about our businesses as usual.
She did request that they did, and they did so. Then she asked them for $100,000 in compensation for copying in the first place. You can read more here.
Crushing dreams at the speed of sarcasm
I'm not in Colorado, so I don't know their particular statute of frauds, but in general it requires a contract, signed, in writing for things over $500. Since, according to the contract on the bottom, one page is worth $5,000, I would argue they need my signature to imply that I agreed to the contract. Also, basic contract law requires a meeting of the minds and an agreement. There isn't one with a crawler. Finally, I don't think you can eliminate fair use rights in the manner that she is doing. That's a basic part of the law. She's trying to extend copyright in a manner that definitely isn't settled, and may have sufficient precedence against to make this a forgone conclusion. I hope that the judge didn't dismiss the claim so that he can establish some precedent on the matter, showing how stupid it is, and prevent future law suits of the same kind.
http://bgcommonsense.blogspot.com
It appears her site is at http://www.profane-justice.org/
s e456.cfm
Check out this article here: http://www.phillipsnizer.com/library/cases/lib_ca
According to this, she requested that the site be removed from the Archive in December, 2005, and they complied. They're actually countersuing her. They moved to have her claims dropped for various reasons, but the court chose to only drop the ones related to conversion, civil theft and the RICO claims. The issue of breach of contract and copyright infringement still apply.
I think it's absolutely ridiculous that this can go forward, especially when there are two established methods to stop the Archive's activity: The opt-out, which will remove history, and robots.txt (which she didn't use and appears to still not use), which will prevent that spider from ever archiving her site again.
Her site shows up in Google, I wonder why she hasn't sued them? Could it be that she likes the exposure of the big search engine, but doesn't want any history of her site archived by the Internet Archive?
Technically speaking, yeah, sure it's a "copy". You don't have the copyright for it, so you're not allowed to re-publish it. That's what Archive.org is doing.
Ok so lets say the judge rules in her favor. Would it be so difficult if robots.txt becomes opt-in. Meaning the no robots.txt equals no spiders. If you want your website on a search engine you have to put in a robots.txt. It would alter the equation for sure, but things would continue working.
Call it, then replay a message including:
"If you don't agree, say 'NO', otherwise we consider you agree to our contract."
[short silence]
"Thank you. You have agreed to our terms and conditions. We will send you the invoice in..."
(and proceed.)
Since agreement to her conditions is supposed to be implied by a visit of a second party, human or not, agreement from side of the second party, human or not, containing correct reply (or correct lack of thereof - implied consent like in case of her license) is just as valid legally.
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
I find it interesting how information is used selectively here. She is cast as the victim in the second and third paragraphs, with the standard foster home sob story. She was supposedly a wonderful person, but then we find, buried in parentheses a paragraph later, the bolded text above. Hmm, pregnant...cheerleader...
It also looks like her own kids reported her for child abuse and went to live with their father, and she's pretty ticked about that.
She wonders why she's disliked by the court system. Well, the evidence is all over the article:
And look how effective she is! Heck, this belongs on Fark not Slashdot.
Check out her current robots.txt file.
I wish I could sue this woman for inane stupidity. Seriously, when I was 12 I made a website, and it had robots.txt. Sure, for no particular reason, but it had the file. If this woman overlooks a simple web protocol that a 12 year old can understand and implement, how can she hope to offer advise in anything online? And for that matter, the site is an eyesore.
By putting material on the web, it is understood that any publicly accessible material is copied multiple times every time the page is viewed. The ISPs get it, the browser gets it, the disk cache gets it, the user reads it. The internet is a *protocol* for copying data. To put a silly notice with insane demands at the bottom of the page is not simply futile, but laughably ridiculous. To sue over having content copied from a webserver (which to put it in terms that she might understand is basically like a public photocopier with her webpages permanently installed for every passerby to simply push *copy* and read the material) is like suing people for having a picture of that happens to have your billboard in the background. You put the data out to be seen, but as soon as it's not exactly where you want it, you get angry.
Above and beyond that, it is clear that Archive.org is not trying to steal her profit. You can opt out. You can tell it not to archive in the first place. You can even make a simple script that blocks access to the rest of the site until you click "I agree" to the terms. None of these solutions would take more than 5 minutes.
Quoted:
IF YOU COPY OR DISTRIBUTE ANYTHING ON THIS WEB SITE, YOU ARE ENTERING INTO A CONTRACT. SEE COPYRIGHT NOTICE & SECURITY AGREEMENT (READ BEFORE ACCESSING THIS WEBSITE) - Copyright 1996- 2007, Suzanne Shell and individual contributors where appropriate. The content if this web site is intended to generate income, it is not free if you intend to archive, copy, print or distribute anything electronically fixed herein.
Which of course implies this is a commercial website.
However, if you go to the contact info page, we can see her PGP key (...) contains:
-----BEGIN PGP PUBLIC KEY BLOCK-----
Version: PGP 8.1 - not licensed for commercial use: www.pgp.com
GG?
If you were offended by anything I said... No, I'm not sorry. Please lighten up.
Lets forget for a second the person suing Internet Archive. Lets also forget the quality of her website and that she may or may not have included a proper robots.txt file.
Suppose you're a teenager or college student. You do some silly prank and it includes a picture of yourself and your friends. You tell your friend you don't want the picture going on the net. For whatever reason, he/she posts it up anyways. Nothing illegal. Or suppose the names of a doctor's patients were to leak onto the Internet. Or that you're a company and an ex-employee posts information that is defamous or incorrect about your company. Or that you post an angry rant on the net, you've grown past it and then delete from your site. In all cases, wouldn't some or all this information be cached on Google or Internet Archive?
My point is we're all human. Wether our activities are simpy stupid, embarassing, or could affect a company's ability to do business, some or all of us have said or done some of these things. It could affect our relations and our ability to find work. The problem with the Net is its very easy to find this information and it becomes widespread. We've all embarassed ourselves, at the least. We can sue the person responsible for posting the information. But it is, as I understand, very difficult to have the information removed from caches on search engine and archive sites.
Rather than sue, sue, sue, shouldn't there be an easier way to remove this information?
Would it be worthwhile for this to go to court to get a ruling on it? Would there be any application to SLA's? it seems like the crux of this issue is whether or not you can enter into a contract blind by opening the website, or something like that. I also wonder about caching - what about networks (AOL for example) that use a caching system/proxy server to provide more efficient data to their users?
I suspect a win for the website owner here would turn a lot of the systems we currently use on it's ear.....
A positive attitude may not solve all your problems, but it will annoy enough people to make it worth the effort.
Suzanne Shell is a family rights activists who has published 3 books and a website advocating on behalf of the fundamental human right of family association. This activism largely applies to government intrusions into the family under child abuse/neglect investigations and court actions. She has been active in this arena since 1991 and is nationally recognized as an expert in this arena.
She is advertised by: http://www.profane-justice.org/
An example of her work: http://www.profane-justice.org/sctcomplnt.pdf
She urges interested parties to contact her via her contact address or phone number on the letterhead in above PDF.
Let her know what you think!
I can see why she wouldn't want to get indexed. She's basically providing information for people trying to resist intervention from child protective services. She suggests taking your children and hiding them out of state if CPS tries to contact you. She also suggests (and sells an ebook about) teaching your children to never talk to a social worker or answer questions... you know like 'does anyone ever hit you?' and 'what did you have to eat yesterday?' and 'do you have a doctor to go to when you are sick?'
Of course CPS and child abuse reporting in general is imperfect. I'm an ER physician which makes me a mandated reporter. This means if I simply suspect abuse or neglect, I am required by law to report it. The system is set up to favor false positives over false negatives. That means that CPS investigates many cases for every real abusive situation they find.
This is NOT a bad thing. There are 1500-2000 reported deaths from child abuse every year in the US. One in twenty kids in the US is physically abused in any given year and most of those victims are under age 6. Most of these cases of abuse never come to the attention of authorities because child abuse is woefully under-reported (which is why we have mandated reporter laws.)
Most of the instances of 'think of the children' posturing are conservative unthinking crap. However advocating outdated ideas of parental property rights over the rights of children to not be abused goes beyond simple whack-job well into the realm of pure evil.
Nick
"Way past fair use" is a matter for the courts to decide- however the since it's an archive, their defense is likely argued under the "Reproduction rights of libraries and archives" provisions of 17USC108, in whcih case fair use doctrine really doesn't come in to play. Since fair use is for subsections of copyrighted content and not the entire content I think that's not where this is headed. Caches may argue under fair use, since the excerpt is per-page not per-site (though it gets tricky if the site has one page.)
Here's the relevant text:
"Except as otherwise provided in this title and notwithstanding the provisions of section 106, it is not an infringement of copyright for a library or archives, or any of its employees acting within the scope of their employment, to reproduce no more than one copy or phonorecord of a work, except as provided in subsections (b) and (c), or to distribute such copy or phonorecord, under the conditions specified by this section, if -- "
Section (b) generally refers to an unpublished work- making your site available to the public on the Internet likely kills that clause. Section (c) deals with replacing damaged copies or works which also doesn't seem to apply.
As far as I can see, only the copyright owner's right to control distribution applies, and since archive.org removed the content at the copyright owner's request, I can't see any damage suffered and I doubt a court will either.
Disclaimer: I am not a lawyer and I don't play one on the 'Net.
http://www.pauldrobertson.com
Then we need to get the laws changed to explicitly indicate the machine readable protocols (robots.txt) that must be used on the internet to opt out of archives, spiders, and caches. Yes I'm saying that we should put into law the precept that anything put on the internet (or maybe just the web, maybe just http/text/static-images) is in and of itself an explicit act of allowing others to do what google and internet-archive are doing. (How we nicely fit this together with copyright law is an excercise left to the reader - but afaiac how things are in effect working now is fine by me, the only problem is codifying it.)
Internet Archive and Google provide such a HUGE benefit that to in effect make them illegal due to one single person out of 500,000,000 being a hardass-bastard is totally not acceptable.
Suzanne Shell
14053 Eastonville Rd.
Elbert, CO 80106
719.749.2971
For those looking to share your views, Suzanne has asked that we continue to contact her organization at her official "non-web" addresses.
By reading this post or having it copied into your browser's cache, you agree to give me a permanent, irrevocable, fully paid-up license, at my option, to revoke, amend, terminate, or waive any breach of any contract of adhesion, EULA, click-through license, or any other such license or contract which does not bear my physical (i.e. the non-electronic kind made with a pen) signature or the physical signature of a designated agent you have entered with me, or to take that action on your behalf. In the event of any disagreement pertaining to, arising under, or relating to this license, you give me the choice of law and venue and agree not to oppose any changes of venue, motions for removal to or from federal court, or objections to standing that I request and that you give me the right to waive any objections on your behalf. In the event of any breech, you agree to pay all of my court costs and attorney's fees.
:] Honestly, I hope this'll get thrown out soon, the judge just might not be able to do that at this stage in the litigation. I mean, hell, how long as SCO vs. IBM been going on now?
If you do not agree to this, you must get my signature in pen and ink stating that whatever license or contract you propose is not subject to this agreement. After all, if you don't think this is valid, just why the HELL do you think some stupid thing you put on your web page is!?
-----
There, put that bugger on your website, and let them weasel out of that
how exactly does it help "freedom to acquire useful knowledge" to stop people from being able to acquire and use against her in court useful knowledge of what a scumbag she is?
Yes, I caught the pregnancy thing, too. Way to slant coverage.
As I've been doing research related to homeschool laws and homeschooling my daughter, I've become very distrustful of child services overall. In many areas, they ignore the law, tromp all over parental rights (such as illegal search and seizure, due process, etc.), and, unless you can afford litigation, the only way to work with the system is to confess and "cooperate". They seize children or harass parents based on ideological differences and ignore subjects of real abuse. I have also directly seen cases where prejudice leaves children in foster care when close relatives are willing and able to care for children (children mixed race, white relatives). I know a woman who has been in and out of drug rehab and prison for years now and had abusive boyfriends and the system keeps trying to give her kid back to her, but they find time to harass parents who want to give their kids a good education. There are good people in the system, but they are extremely overworked and it doesn't take many zealots to drag things down.
That being said, Shell is an extremist and a freak who does much more harm than good. There are other advocacy groups who are better organized, less militant, and more effective, such as the Homeschool Legal Defense Association.
Children need to be protected, but parents need to be free to pursue differences in religion, ideology, parental practices, and so forth without reprisal and child protection needs to take children away from abusive drunks and drug addicts, not to mention cut down on abuse in schools (by other children and adults).
This product may cause major disfunctional, anxiety and major electron discombulation inside your brains; DO NOT EXPOSE YOURSELF FOR OVER 5 MINUTES! needless to say this product is disabled of any print and if you are having problems viewing this product it is because we did not get any clue in
BECAUSE READI NG
IS
NOT IMPORTANT
FOR YOU^H^H^HUS!
We don't even have a robots.txt online, so you cannot copy it ! HAH!!!
For any reference, please only look to this website if you are a family member in need of an eye physician soon and in heavy despair of reading through our reading protection then enjoy your stay at our site! (*) I cannot believe I posted this under my account--- I am known for the ones who want to find me on the net. Is that a privacy risk or a privilege? One might wonder..
No, but I'm pretty sure the contract isn't valid if you decline. Her statement that "by doing x you are entering a contract" shouldn't hold up in any reasonable court. Even with shrinkwrap licenses on software you have to click a button or hit a checkbox that says "I agree to the contract shown here", and those are on shaky ground as it is. While all the normal copyright laws still apply to her content, the only way to enter into a contract is still to explicitly agree to the contract.
Slashdot needs a "-1, Wrong" moderation option.
The Urban Hippie
By this logic, it seems like all the spider would have to do is add its own counter-contract right in the HTTP request.
GET / HTTP/1.1 By responding to this request, you agree that SpiderCo's Spider does not speak English and, being written only two years ago, is not of legal age (18 in most jurisdictions) to enter a contract. By continuing to serve content in response to this request, you agree that any contracts contained in such content are null and void. Thank you and have a nice day.
<CR>
<CR>
Caveat Emptor is not a business model.
One reason I love mozilla firefox so much is the NoScript plugin. Because I was never presented with such a dialouge (because I had javascript blocked), and therefore never agreed to it, am I still bound as such?
If creativity is the field, copyright is the fence.
Does it validate w3? The answer is no ;)
Seriously however, robots.txt could have solved this issue. Oddly enough the site says I am not able to save/copy/print on their "help" page. I tried all the above on the main page and it worked. Shoddy protection to say the least.
Yes, it would be hard. However the technical problems would certainly be solvable for a company like google. The real problem is that this would serve to further entrench google's monopoly on search. I love google as much as the next guy, but I certainly don't want to start introducing measures that will make it more difficult for a google-killer to arise.
That's the worst analogy I've ever heard that didn't fulfill Godwin's Law. The web was designed to be a public place for the sharing of information. Trying to think of it like the inside of someone's house is just going to lead you in the wrong direction. A much better analogy would be that information posted on the web is like a campaign poster or some other notice posted on someone's lawn. Yes, it's hosted on private property and the information posted there belongs to the person posting it, but it is displayed publicly and acting as if it weren't is just silly.
"While I understand the importance of web searches, would it be so hard to change the nature of web crawlers so that they didn't spider a site unless they were specifically allowed?"
.... wait for it .... robots.txt.
I guess you could, you could just as easily put terms in the licenses of web servers that state that by using the web server software, you agree to let all your documents be crawled except where you deny that with
That wouldn't be too hard either would it?
all the best,
drew
http://www.youtube.com/watch?v=vcaf2ThG7q4
UFO seen in skies over Winton!
FreeMusicPush If you want to see more Free Music made, listen to Free
Her web pages appear to be HTML.
Wow, look at that. Right in the damn HTML4 spec:
The robots.txt file
When a Robot visits a Web site, say http://www.foobar.com/, it firsts checks for http://www.foobar.com/robots.txt. If it can find this document, it will analyze its contents to see if it is allowed to retrieve the document. You can customize the robots.txt file to apply only to specific robots, and to disallow access to specific directories or files.
robots.txt, and the meta tag defined later in the same document, are indeed rules.
If corporations are people, aren't stockholders guilty of slavery?
Plus there are very well known ways to prevent spiders from spidering you. Robots.txt does the job very well.
Rich at last! Muwahahahahahaha.
Except that the rules are different for libraries and archives, and the Internet Archive is a non-profit, registered member of the American Library Association, primarily funded by the Smithsonian Institution. Libraries routinely make copies for archival purposes (i.e. transferring newspapers to microfiche), and their right to do so is spelled out in the copyright laws. What she's done is the equivalent of putting "Anyone who reads this must pay the author five dollars" at the beginning of a book, then suing a library which carries a copy of that book. (Or, for a slightly closer analogy, placing a classified ad in a local newspaper with a notice that anyone who copies it must pay you, and then suing the local library for their microfiched copy of the newspaper.)
If she were to sue anyone, she should have sued Google, who, as a for-profit information service selling advertisements, cannot claim to be a public library, and whose copy of her site is every bit as much a re-publication as the Archive's, but without the defenses spelled out in copyright law that the Archive has.
Can someone please republish this site on a mirror?
... and then they built the supercollider.
part of ongoing battle between archive.org and church of scientology. the legalese is there to help in court, the lack of robots.txt to get the paged archived so scientology can sue again. Suzanne Shell has been affiliated with the church of scientology for quite a while, a google on scientology and suzanne shell gives you everything you need to know.
It's OK now - after negotiating with Ms. Shell she agreed to place the contents of her site into the public domain. Here's a copy of our contract:
GET / HTTP/1.1
User-Agent: By accepting this HTTP GET request you agree to release into the public domain the entire contents of this web site.
Host: www.profane-justice.org
Pragma: no-cache
Accept: */*
HTTP/1.1 200 OK
Date: Sun, 18 Mar 2007 02:11:09 GMT
Server: Apache
Expires: Thu, 19 Nov 1981 08:52:00 GMT
Cache-Control: no-store, no-cache, must-revalidate, post-check=0, pre-check=0
Pragma: no-cache
Connection: close
Transfer-Encoding: chunked
Content-Type: text/html
Woman with an abusive boyfriend/spouse/shackup/whatever.
Boyfriend beats her up. She doesn't feel too good. Got a bad headache now and her vision is blurry. She wants to go to the ER, but she knows if she does, her boyfriend will go to jail again. Does she go seek medical attention? Should she have to weigh her health against her boyfriend being taken away? Against what he'll do to her when he's out on bond?
Now maybe you or I think boyfriend belongs in jail, but that's beside the point. The point is she probably doesn't. After all, she's still with him. She still goes back to him again and again.
The reasons women come back to abusive boyfriends are also beside the point. The fact is they do. And they don't want their abusive boyfriends in jail. And the pete's sake, their health shouldn't be sacrificed because of it. They should be able to seek medical attention without fear of repercussions.
That is why I am against mandatory reporting. Health and medical treatment should be a private matter between doctor and patient. If the victim wanted the police involved, she would have called 911 already.
"Avoid employing unlucky people - throw half of the pile of CVs in the bin without reading them." -- David Brent
US Court of Appeasls for the Tenth District (html doc online)
v.
and
I. Background
Ms. Fields's daughter was the subject of a dependency and neglect proceeding initiated by the state of Colorado in January 2003. In connection therewith, the state provided Ms. Fields a court-appointed attorney, Mr. Kender, but Ms. Fields also hired Ms. Shell, a journalist who researches and documents child protection agencies' practices, to act as an expert consultant. Shortly thereafter, Ms. Fields executed a power of attorney naming Ms. Shell as her agent. Ms. Fields also agreed to be included in Ms. Shell's documentary video project concerning child protection services.
On April 16, 2003, Mr. Meconi, the Fremont County DHS's attorney, filed a motion in state court to make Ms. Shell a special respondent in the pending dependency and neglect action. The motion sought to prevent Ms. Shell "from contacting the minor child or [Ms. Fields] . . . and from otherwise being involved in the proceedings . . . , including, but not limited to, acting as counsel for [Ms. Fields] or otherwise engaging in the unauthorized practice of law."
[. . .]
On May 9, after a hearing on the motion to make Ms. Shell a special respondent, the state court issued an order granting the motion, vesting legal custody of Ms. Fields's daughter with the Fremont County DHS, and scheduling a jury trial. In making Ms. Shell a special respondent the court observed that Ms. Shell, "in the guise of acting as the agent" for Ms. Fields, has "essentially been providing legal advice to [Ms. Fields]." The state court further ordered Ms. Shell specifically prohibited from:
Rush Limbaugh is a perfect real world example of an oxycontinmoron