Archive.org Sued By Colorado Woman
An anonymous reader writes "The Internet Archive is being sued by a Colorado woman for spidering her site. Suzanne Shell posted a notice on her site saying she wasn't allowing it to be crawled. When it was, she sued for civil theft, breach of contract, and violations of the Racketeering Influence and Corrupt Organizations act and the Colorado Organized Crime Control Act. A court ruling last month granted the Internet Archive's motion to dismiss the charges, except for the breach of contract claim. If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."
Did she post the notice properly, though? There are laws about how you must post a 'no trespassing' sign on physical properties.
Similarly, there are ways to post that notice on your website as well. robots.txt comes to mind. If she didn't bother to post the notice correctly, the case should be just thrown out.
"If you make people think they're thinking, they'll love you; But if you really make them think, they'll hate you." - DM
Clearly she should have. However, she didn't and that's the issue at stake here. Is ignorance of the procedure an excuse?
No it's not an excuse. By placing her information on the internet, she must comply with the standards of the internet. The important thing is that there was a legitimate way to prevent this which she could have implemented, but she didn't. Just as the cartoon advertisement got in a shitload of trouble in Boston. It's not that they weren't allowed, but rather they simply needed to file the proper form.
If you were to use a felt tip marker to put your copytight notice on a braille flyer, but didn't put the notice in braille, and someone who read the braille and was incapable of reading the marker then sent that notice to their friends, I suspect they would not be in violation of the copyright notice.
Is it just my observation, or are there way too many stupid people in the world?
' If Shell prevails on that count, sites like Google will have to get online publishers to 'opt in' before they can be crawled, radically changing the nature of Web search."'
No that would only change the nature of the web for US citizens, just like the online internet gamble thingie, the rest of us will shrug and move about our businesses as usual.
dsshell@ix.netcom.com
wget -r -l0 --delete-after http://www.profane-justice.org/
45 5F E1 04 22 CA 29 C4 93 3F 95 05 2B 79 2A B2
Ok so lets say the judge rules in her favor. Would it be so difficult if robots.txt becomes opt-in. Meaning the no robots.txt equals no spiders. If you want your website on a search engine you have to put in a robots.txt. It would alter the equation for sure, but things would continue working.
I wish I could sue this woman for inane stupidity. Seriously, when I was 12 I made a website, and it had robots.txt. Sure, for no particular reason, but it had the file. If this woman overlooks a simple web protocol that a 12 year old can understand and implement, how can she hope to offer advise in anything online? And for that matter, the site is an eyesore.
By putting material on the web, it is understood that any publicly accessible material is copied multiple times every time the page is viewed. The ISPs get it, the browser gets it, the disk cache gets it, the user reads it. The internet is a *protocol* for copying data. To put a silly notice with insane demands at the bottom of the page is not simply futile, but laughably ridiculous. To sue over having content copied from a webserver (which to put it in terms that she might understand is basically like a public photocopier with her webpages permanently installed for every passerby to simply push *copy* and read the material) is like suing people for having a picture of that happens to have your billboard in the background. You put the data out to be seen, but as soon as it's not exactly where you want it, you get angry.
Above and beyond that, it is clear that Archive.org is not trying to steal her profit. You can opt out. You can tell it not to archive in the first place. You can even make a simple script that blocks access to the rest of the site until you click "I agree" to the terms. None of these solutions would take more than 5 minutes.
A robots.txt file is not a contract or a technical restriction, but rather the standard means of communicating behaviors that contradict the inherent technical structure of the internet. If someone ignores your robots.txt file, that in and of itself is not a violation of anything. But it does mean that you have properly stated your policy. Your policy may not have the force of law (and by many accounts, it shouldn't), but if you don't state that policy in the proper form it can be assumed to have no force, rather than just that the enforcability of the policy must be examined.
As an example, having a website is like having a store. If you leave the front door of your store open, with the lights on, and someone inside, entering is not a tresspass. If you put a small lock on that door, no matter how easy to pick, entering that store becomes tresspass. If you put a huge sign on the door that says "door lock is broken, private property, KEEP OUT," then entering that store is very probably tresspass, unless you're blind. Putting up a huge sign on the door that says "DO NOT ENTER", but doing so in braille, will probably not count as sufficient warning.
The ______ Agenda
how exactly does it help "freedom to acquire useful knowledge" to stop people from being able to acquire and use against her in court useful knowledge of what a scumbag she is?
Yes, it would be hard. However the technical problems would certainly be solvable for a company like google. The real problem is that this would serve to further entrench google's monopoly on search. I love google as much as the next guy, but I certainly don't want to start introducing measures that will make it more difficult for a google-killer to arise.
That's the worst analogy I've ever heard that didn't fulfill Godwin's Law. The web was designed to be a public place for the sharing of information. Trying to think of it like the inside of someone's house is just going to lead you in the wrong direction. A much better analogy would be that information posted on the web is like a campaign poster or some other notice posted on someone's lawn. Yes, it's hosted on private property and the information posted there belongs to the person posting it, but it is displayed publicly and acting as if it weren't is just silly.
"While I understand the importance of web searches, would it be so hard to change the nature of web crawlers so that they didn't spider a site unless they were specifically allowed?"
.... wait for it .... robots.txt.
I guess you could, you could just as easily put terms in the licenses of web servers that state that by using the web server software, you agree to let all your documents be crawled except where you deny that with
That wouldn't be too hard either would it?
all the best,
drew
http://www.youtube.com/watch?v=vcaf2ThG7q4
UFO seen in skies over Winton!
FreeMusicPush If you want to see more Free Music made, listen to Free
Actually, unless I am googling the wrong person, it appears Suzanne Shell and her site are connected with Scientology. This would be a big surprise, as they never launch nuisance suits or otherwise harrass archive.org!
Who's supposed to grow up? The attention whore that filed suit, or the person trying to fulfill her wishes?