Slashdot Mirror


The Internet Archive Sued Over Stored Pages

Kailash Nadh writes "The Internet archive, which has been storing snapshots of millions of webpages since 1996 has been sued by the firm Harding Earley Follmer & Frailey, Philadelphia. The firm was defending Health Advocate, a company in suburban Philadelphia that helps patients resolve health care and insurance disputes, against a trademark action brought by a similarly named competitor. In preparing the case, representatives of Earley Follmer used the Wayback Machine to turn up old Web pages - some dating to 1999 - originally posted by the plaintiff, Healthcare Advocates of Philadelphia. Last week Healthcare Advocates sued both the Harding Earley firm and the Internet Archive, saying the access to its old Web pages, stored in the Internet Archive's database, was unauthorized and illegal." CT:update note that the submittor got it backwards: Healthcare Advocates is the sueing Wayback and Harding Earley Follmer & Frailey, not the other way around.

151 of 801 comments (clear)

  1. obvious man question by 0110011001110101 · · Score: 5, Insightful

    fsck me if i'm wrong, but wouldn't this be similar to suing someone for referencing an old book I wrote, just because I'd released a new one that didn't contain much of the old information?

    --
    Don't anthropomorphize computers: they hate that.
    1. Re:obvious man question by aussie_a · · Score: 2, Insightful

      No, this is like suing someone for distributing an old book you've written withour the person having your permission.

    2. Re:obvious man question by muszek · · Score: 2, Interesting

      you mean it's like being a library?

    3. Re:obvious man question by aussie_a · · Score: 2, Insightful

      Can one get sued for taking a picture of someone else's property without their permission?

      Again, not really applicable. This would be a case of taking a photo of every page in a book so that the words are legible, and then distributing the photos grouped together (with one or two pages missing).

    4. Re:obvious man question by Professor_UNIX · · Score: 5, Insightful
      No, this is like suing someone for distributing an old book you've written withour the person having your permission.

      Putting up an unprotected web site is akin to putting up a billboard. If I take a picture of the billboard and publish it in a textbook that kids read for the next 20 years, should I be expected to be sued by the billboard company? I'm really sick and tired of companies that have absolutely no clue how the Internet and the world wide web works putting up sites and then expecting you to never cache them anywhere. They have this old mentality that they control the flow of information and frankly, that's just not true anymore.

    5. Re:obvious man question by VernonNemitz · · Score: 4, Insightful

      Per the first question in this thread, NO, this suit is like complaining about a "for sale" sign in a store window being photographed, saved for years, and later viewed. After all, almost everything (there have been a few mistakes) posted on the Web that is publicly accessible was put there to be seen!

    6. Re:obvious man question by aussie_a · · Score: 2, Informative

      you mean it's like being a library?

      I was under the impression that libraries had permission to distribute the content that it does. In fact, in Canada, authors (Canadian ones at least) get given some money to cover their books that are in libraries. I'd say that pretty much means there's an agreement (and not an assumption of one because the author hasn't said no) between libraries and authors.

    7. Re:obvious man question by webview · · Score: 2, Insightful

      Yes, if I never deleted my cache and I happened to go that site, those pages would still be on my machine. Am I going to get sued too?

    8. Re:obvious man question by jarich · · Score: 4, Insightful
      authors and publishers could not have any impact on any sales/giveaways f

      Really? How about those Harry Potter books that were sold a few days ago? :)

    9. Re:obvious man question by vettemph · · Score: 2, Interesting

      Problem is, The plaintif claim the thier robot.txt file instructs the webcrawler to dis-allow access to the older archives of the plaintifs web content. The fact that you can block access to your older PUBLISHED content is disturbing in itself because it is so useful in finding the truth. I should be illegal to tamper with the evidence. The Wayback machine should show everything at set the standard before it's to late. Perhaps it is time to "Rise Up".

      --
      The government which is strong enough to protect you from everything is strong enough to take everything from you.
    10. Re:obvious man question by tod_miller · · Score: 2, Insightful

      I agree, although the fact that the only difference between a copyrighted work as a webpage, and a copyrighted work as a book, is intent. But if someone give you a book for free, and you have a copy, they cannot revoke that, but can they stop you distributing it? (as copies - based on assumption below)

      The fact is, you always copy webpage, even when you don't cache them, it is in memory (and/or pagefile/swap).

      TV is not cached in a normal set, because it is transmissio based, HTTP is just copy a file, and showing it (with other files embedded, css, jpg etc).

      --
      #hostfile 0.0.0.0 primidi.com 0.0.0.0 www.primidi.com 0.0.0.0 radio.weblogs.com
    11. Re:obvious man question by Chuck+Chunder · · Score: 5, Interesting
      Putting up an unprotected web site is akin to putting up a billboard. If I take a picture of the billboard and publish it in a textbook that kids read for the next 20 years, should I be expected to be sued by the billboard company?
      Apparantly, yes.
      --
      Boffoonery - downloadable Comedy Benefit for Bletchley Park
    12. Re:obvious man question by hacker · · Score: 5, Interesting

      I can tell you exactly where the problem lies (and I know this because I have customers who behave this way):

      When they write documents, they write them in HTML format. They send their email, they send itin HTML format. When I asked for them to prepare content for their website, they gave me a Microsoft Word document in HTML format, and said "You don't have to use the same fonts I used in this document, but please keep the layout the same on my website."

      These users equate "a document" to "a website", and they think that once they stop using or sending that document out, that their "website" should be removed as well. They think websites are "sent" to people, not requested "by" people, and that when you close your browser, your "document" is gone.

      That simply is not the case, and people need to be re-educated to understand these technologies and how they work. The Internet was MEANT to be self-healing, in case one node or another went down, information and information pathways would still be functioning.

    13. Re:obvious man question by Jaruzel · · Score: 2, Interesting

      In the UK, the copyright act was amended; photo copying ANY part of a copyrighted document is now considered illegal. The concept of 'fair use' is no longer applicable.

      That said, certain professions (librarians etc) can register as an exception so that they can photocopy a percentage of a document legally (just like the old days...)

      -Jar.

      --
      Together, We Can Make Slashdot Better. I Do NOT Mod ACs. - Check Me Out
    14. Re:obvious man question by Xformer · · Score: 2, Funny

      Ok, alright... first authorized sale.

      --
      All I want is a kind word, a warm bed and unlimited power.
    15. Re:obvious man question by robslimo · · Score: 2, Insightful

      OK people, ignore this guy's "first-post-without-reading-the-article" and read further down in the discussion.

      To sum it up, the plaintifs are claiming that the Wayback Machine didn't obey the robots.txt at their site and are calling it breach of contract.

      Of course, obeying robots.txt is voluntary anyway, so what contract?

    16. Re:obvious man question by Zeinfeld · · Score: 5, Interesting
      To sum it up, the plaintifs are claiming that the Wayback Machine didn't obey the robots.txt at their site and are calling it breach of contract.

      It seems rather more likely that the plaintifs fucked up their robots.txt file entries and that is why they were spidered.

      At the risk of receiving yet another deposition I was part of the conversations that led to robots.txt. It was never intended to be an access control mechanism or an effective content control mechanism within the meaning of the DMCA. The objective was simply to allow sites with automatically generated content to tell the robots that parts of their site are not suitable for spidering.

      So now it looks like we are going to have revisit the business model for the way back machine and work out how to float a littigation fund.

      Actually one way that it could be done is to sign and timestamp material on receipt and offer the signatures as a premium service.

      --
      Looking for an Information Security student project suggestion?
      Try http://dotcrimeManifesto.com/
    17. Re:obvious man question by nine-times · · Score: 2, Insightful
      But if someone give you a book for free, and you have a copy, they cannot revoke that, but can they stop you distributing it? (as copies - based on assumption below)

      Except that it's not someone giving you a free book. Like you said, you have access to copy it, so it's a little more like someone putting a free book on display in a public place right next to a xerox machine and encouraging people to xerox the thing and take it home to read.

      And, in fact, that book will also be cached elsewhere, so it's as if they've set out this easily xerox-able book, with instructions on how to xerox it, and permission to do so, with the intent of putting it through the "xerox network". So you're in a world filled with xerox machines, and unless they're idiots, they should understand that their distribution model relies on (in some cases) other people taking the xerox copy and setting it next to their own xerox machine for others to copy.

      Now, lets say this book publisher releases a new book. And updated version of the same book. They set the new book next to the xerox machine, and ask that everyone in this "xerox network" copy the new book and set that next to their own xerox machines, replacing the old book. Now let's say that one guy just doesn't replace the first edition, but sets the second edition out next to it, leaving the old one where it is, and gives people the choice. He's doing this as a public service, so people can study the evolution of literature. He's doing it for free.

      You want page 59? Well, page 59 of the book was publicly available in two different versions. Which one do you want?

      In this strange hypothetical situation, would the book publisher have a right to sue? Ok, maybe it's not exactly an apt example either, but saying "the only difference between a copyrighted work as a webpage, and a copyrighted work as a book, is intent" isn't exactly right either.

    18. Re:obvious man question by robslimo · · Score: 4, Informative

      I should have pointed out that the aspect of robots.txt they're complaining about is an "extension" of sorts where archive.org will remove any archived copies of your site if it disallows the ia_archiver spider *and* they submit their site to be recrawled (guaranteeing that the spider will see the new directive).

    19. Re:obvious man question by randyflood · · Score: 2, Insightful

      Let's put this into perspective.

      Compare this case to previous cases that courts have considered where Ticketmaster has tried to sue people who have *linked* to their web site. The courts have said that links themselves are not copyright infringement because no actual copying occurs. But in their reasoning they have really clearly implied that if these other web sites had copied content from the Ticketmaster web site, that Ticketmaster would have won the Copright Infringement claim.

      See

      http://www.bc.edu/bc_org/avp/law/st_org/iptf/headl ines/content/2000040401.html

      The DMCA stuff is much more dicey.

      But, I think that they actually have a prima facia case of copyright infringement.

      IMNAL

      Randy

      --
      Randy.Flood@RHCE2B.COM
    20. Re:obvious man question by mrchaotica · · Score: 4, Interesting

      Or just move the hosting to Sealand and ignore lawsuits. Although IANAL, I think this is the more reasonable course of action, since you have to be insane to deal with the insanity of today's copyright law.

      Well, either that or try to get absorbed by the Library of Congress or something...

      --

      "[Regarding the 'cloud,'] ownership was what made America different than Russia." -- Woz

    21. Re:obvious man question by blamanj · · Score: 2, Informative

      Putting up an unprotected web site is akin to putting up a billboard. If I take a picture of the billboard and publish it in a textbook that kids read for the next 20 years, should I be expected to be sued by the billboard company?

      Sadly, the answer to this is probably yes. Two examples:
      1) Coke sues a photographer for including one of its billboards in a picture.
      2) The filmmakers of "Bewitched" were forced to edit the Transamerica pyramid out of their shots of the San Francisco skyline because the building is a registered trademark.

      Our IP laws seem destined to be controlled by corporate greed and congressional stupidity.

    22. Re:obvious man question by EdelFactor19 · · Score: 2, Insightful

      I have to disagree with your comment. Your basis of using a back isnt applicable it would be like me taking a picture of a billboard advertisement (which I don't own, and is publicly viewable), and doing this of every billboard in town, everyday, and then distributing that somewhere....

      --
      "Jazz isn't dead, it just smells funny" ~Frank Zappa
      EdelFactor
    23. Re:obvious man question by IgnoramusMaximus · · Score: 2, Insightful
      But, I think that they actually have a prima facia case of copyright infringement.

      If true, say goodbye to Google Cache.

    24. Re:obvious man question by randyflood · · Score: 2, Insightful


      Well, pretty much any web site that just takes a copyrighted work from one web site, rips it off and then puts it up on their own web site is, and always has been in danger.

      The only real defense I can see either google or the Internet Archive raising is "Fair Use". But, if the court rules in favor of the Internet Archive and Google on the "fair use" issue, then how is in going to phrase it exactly? How are they going to phrase it in such a way so that I can not just mirror any copyrighted page I want to my geocities.com web site?

      Trust me, I love the google cache. It let's me bypass content filtering. But, I think it will be a tricky issue for the court to come up with a way to apply the existing copyright law in a rational way to things like the Google cache.

      You could use the nocache directives to hint to things whether or not they should be included in things like the Internet Archive or the Google cache. But that would break the efficiency of proxies and stuff. So, maybe the way it ought to really work is that we should just have a new directive or something that says "don't archive this site" or somesuch or "google is allowed to archive this, but no-one else"(that doesn't count proxy servers). Now, I know that anyone *could* cache the stuff to their hearts content regarldess of the flags. But if they republished the stuff on the web, and you found them, you could sue them or something.

      --
      Randy.Flood@RHCE2B.COM
    25. Re:obvious man question by BlogPope · · Score: 2, Insightful
      But wouldn't the archive.org people also have a copy of the applicable robots.txt on file?

      Doesn't really matter. Robots.txt is a convention, it was not intended as an access control device, and has no force of law. All the public documents on robots are laughably out of date (the web server might be doing important stuff, like gopher, so be sure not to load the machine), which cause it to make bad recommendations (pipelining multiple pages accross a single connection, part of the http 1.1 specs to reduce server load, is incompatible with the recommendations)

      If their case revolves around robots.txt instead of copyright law, I see them losing, fast.

      --
      My other car is a Popemobile
  2. Robots.txt? by AltGrendel · · Score: 3, Insightful

    Did they set up their robots.txt file properly? If not, they may not have a case.

    --
    The simple truth is that interstellar distances will not fit into the human imagination

    - Douglas Adams

    1. Re:Robots.txt? by notmatt · · Score: 2, Interesting

      "But on at least two dates in July 2003, the suit states, Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages." If would appear that they did kind of. That seems to be at the heart of the matter. The Internet Archive don't seem to be very surprised that it is happening. I don't think the company doing the suing have much of a case really, but IANAL.

    2. Re:Robots.txt? by Looke · · Score: 4, Insightful

      Why would a missing robots.txt imply that others are allowed to distribute the content?

    3. Re:Robots.txt? by Baddas · · Score: 3, Informative

      As it says in the article, the robots.txt is an entirely voluntary measure. The IA doesn't need to obey it, but they do, in order to be a courteous member of the internet.

    4. Re:Robots.txt? by Illserve · · Score: 2, Informative

      They don't have a case either way!

      Adherence to robots is voluntary, done in good faith by crawlers for the general well being of the web.

    5. Re:Robots.txt? by mabinogi · · Score: 2, Insightful

      That paragraph in the article completely baffled me.

      I'm not sure if it was the reporter or Healthcare Advocates - but someone has absolutely no idea how robots.txt, web crawlers, web servers and the Internet Archive work.

      They're implying that when the lawyers queried the old versions, that somehow the wayback machine really grabbed the copies old copies straight from Healthcare Advocates' website.

      Perhaps they don't understand that "Wayback Machine" is just a name.
      It's not actually a time machine.

      --
      Advanced users are users too!
    6. Re:Robots.txt? by afidel · · Score: 2, Insightful

      Of COURSE it's the webmasters responsibility to opt out! If you put something up on a publically available website then you can expect the public to consume and possibly retain copies of your work. Heck failing to opt out and then suing is like suing a library because they have an old copy of your book on hand. It's no big secret that there are websites out there which use spiders to download, store, and analyze the vast majority of the internet, and that your site will be included in such efforts unless you purposly decide to disallow such use through the commonly agreed upon methods.

      --
      There are 4 boxes to use in the defense of liberty: soap, ballot, jury, ammo. Use in that order. Starting now.
    7. Re:Robots.txt? by RealityMogul · · Score: 3, Informative

      Larger images aren't cached on the archive servers, so they'd go to the real server. Most likely the original images weren't there so they started getting a flood of 404s and started investigating the problem.

    8. Re:Robots.txt? by SillyNickName4me · · Score: 2, Insightful

      Because that is the nature of this beast called the WWW. Yeah, I know about copyright and such, and no, I am not one of those who believe information should be free always, but I do believe that when you publish toa medium that has sharing and caching and linking at its core, then you cannot blame others for your publuication being shared, cached and linked to.

      Think before you publish etc.

    9. Re:Robots.txt? by aug24 · · Score: 2

      A present (not missing) robots.txt file which didn't include a rule for those pages might imply permission to cache...

      --
      You're only jealous cos the little penguins are talking to me.
    10. Re:Robots.txt? by SillyNickName4me · · Score: 2, Insightful

      I guess you don't mind me scanning in your book that you've written and distribute it over p2p?

      There are more differences between books and webpages then the fact that the first are usually made of sheets of paper and the second are not.

      Wanting the advantages of the WWW while retaining the control that paper publications give you is like wanting to eat your cake and have it.

      There is this thing called 'reasonable expectation'. In the case of the WWW the reasonable expectation to have is that publishing something on it means it is out there, will be copied, linked to, shared, archived etc. Don't like it? then don't use the WWW.

    11. Re:Robots.txt? by Looke · · Score: 2, Insightful
      True, robots.txt has nothing to do with this. You have no right to publish my copyrighted content.

      The copyright holder is the only one who can give such permission, for example via a copyleft license (GPL, CC, etc.).

    12. Re:Robots.txt? by slavemowgli · · Score: 4, Insightful

      Concludent behaviour. If I go to a doctor and get an injection, can I come back six months later and sue the doctor because he did not explicitely ask for permission to give me that injection? Well, I can true, of course, but I won't get far, because when he said "I'll have to give you an injection" and I didn't say no but instead rolled up my sleeve so he could give it to me, he was allowed to conclude that I was OK with it, even if I did not explicitely say so. IANAL, but I personally think the same principle should apply here. There is a standard mechanism for limiting access (in the sense of not authorizing it, that is, not as in making it technically impossible) - namely, robots.txt exclusion -, but if you chose to not use it, then the fact that you are running a *public* webserver that has the *sole purpose* of handing out its information to *everyone* who asks for it should be enough to conclude that you are, in fact, OK with not only the fact that people do receive your information, but also with the fact that they use it - no matter whether that means reading it (like a regular user would), indexing it (like a search engine would) or archiving it (like the Internet Archive *and* just about any search engine would).

      --
      quidquid latine dictum sit altum videtur.
  3. Cached by donnyspi · · Score: 4, Funny

    Better sue everyone who has visited the website in question but never purged their temporary internet files folder.

  4. Other archives by erykjj · · Score: 3, Insightful

    Would that make archived newspaper editorials, TV reports, etc. illegal as well? Google beware.

  5. Lookng forward by AtariAmarok · · Score: 3, Insightful

    Looking forward to newspapers filing similar frivolous lawsuits against libraries for maintaining old copies of the papers in their collections; copies that newspaper company might be embarassed about now.

    --
    Don't blame Durga. I voted for Centauri.
    1. Re:Lookng forward by aussie_a · · Score: 3, Interesting

      Again, not comparable (but this didn't stop you from getting modded up of course). The libraries had permission to buy the papers and allow access to them in the first place. Internet Archive had no such agreement with this company. IA took the absence of them saying no as an implicit agreement, which for pretty much anything else, isn't legal (it hasn't been tested yet with websites and caches). They in fact, did say no. But a bug caused this message not to be delivered/it was ignored some of the time.

    2. Re:Lookng forward by Ninwa · · Score: 2, Interesting
      "The libraries had permission to buy the papers and allow access to them in the first place. Internet Archive had no such agreement with this company."
      You make it sound as if the internet archive archived pages that required authorization. All pages they authorized were available to the public at that point in time, therefore no contract is required. IANAL so correct me if I'm wrong, obviously their lawyers would say that I am, but I think this lawsuit is frivilous.
    3. Re:Lookng forward by aussie_a · · Score: 2, Insightful

      All pages they authorized were available to the public at that point in time, therefore no contract is required.

      Wait, if I make something available online, then ANYONE can grab my content and place it on their own website (possibly making it so I lose revenue in some cases) if I don't have a robots.txt file? Wow. I can't believe everyone here thinks that's okay.

    4. Re:Lookng forward by aussie_a · · Score: 5, Insightful

      Having a public website is implicitly allowing anyone to read/view what you've made available.

      But NOT to redistribute it.

    5. Re:Lookng forward by aussie_a · · Score: 2, Insightful

      why shouldn't we be able to use it for non profit work?

      You can. Just don't distribute it without permission. Don't like it? Tough. That's the law.

      Or maybe if I watch public broadcast television and then write a transcript on my blog, is that a violation?

      Actually, one person I know has been forced to take down their website because it hosted transcripts of a tv show. Summaries are covered by fair use. IA isn't a summary by the way.

      If I give away a trillion copys of my book, can I sue the guy who sells it on ebay? Its not a perfect example, but it's close.

      Does right of first sale apply websites? If so copyright on the internet is meaningless. Which I'm sure would make many people here very happy (although I wonder how many of them make a living from their website).

      Here's another example, if a copy released a press release, and I post it on my blog, can I now be sued for using their content without their permission?

      AFAIK, press releases are made by the author to be redistributed. Websites aren't.

      If its public, it should be free. If you didn't want it to be free you shouldn't of been giving it away for free.

      Does that mean I can abuse your copyright and write my own book and have your characters in it? Copyright owners have many rights. One of those rights is the ability to say what happens with their work. That work does stop at what someone does in their private home, but it does cover redistribution. Just because YOU don't want it to, doesn't mean it copyright doesn't cover it.

    6. Re:Lookng forward by poopdeville · · Score: 2, Insightful

      Having a public website is implicitly allowing anyone to read/view what you've made available.

      But NOT to redistribute it.


      But redistribution can be a fair use of copyrighted materials too, such as in libraries or archives.

      --
      After all, I am strangely colored.
    7. Re:Lookng forward by Pofy · · Score: 2, Insightful

      >I know that the argument is flawed (you try
      >writing something better as soon as you wake
      >up), but the point remains that there's a
      >fundamental difference between redistributing
      >and archiving.

      But they DO redsitribute it, you can search their archieve and access it and its content. That is not archieving.

  6. Instead of sueing them.... by ID000001 · · Score: 3, Interesting

    ....why not just ask them to take them off?

    1. Re:Instead of sueing them.... by Conspiracy_Of_Doves · · Score: 4, Insightful

      Because that would be UnAmerican(tm)

  7. Library by Pablo+El+Vagabundo · · Score: 3, Insightful


    Wouldnt this reference site be covered under some of the same protections as a library. It serves some of the very same purposes.

    Hopefully this falls flat.

    I wonder where the server are locations

    Pablo

  8. Lawsuits these days... by akadruid · · Score: 4, Insightful

    Lawsuits these days sound more like people whining like spoiled brats than someone really done an injustice.

    They publish the thing, person X stores it, person Y uses stored info to prove they publish it. So what? If they'd written the thing in a newspaper they would sue someone for keeping the newspaper?

    Huh

    --
    "Those who cast the votes decide nothing; those who count the votes decide everything." (attrib. Joseph Stalin)
  9. Let's compare this to other media by wallykeyster · · Score: 2, Insightful

    Would a lawsuit be considered if instead of a cache of web pages, the other side had used old newspapers from the library or VHS recordings of an old television broadcast? Once they've put their web pages into the public, don't they lose control of who keeps a copy?

  10. God damnit by colonslashslash · · Score: 4, Insightful
    I don't know about you guys, but this whole "sue anything that moves" culture is really starting to piss me off.

    I'm not saying that legally they don't have a legitimate case, but is it really necessary to persue an organisation such as the Internet Archive over something so passive as this? In my opinion, hell no it isn't.

    --
    She's built like a steak house, but she handles like a bistro....
    1. Re:God damnit by Illserve · · Score: 3, Interesting

      It's going to get worse before it gets better. Our culture is being forced to confront issues of privacy and information ownership that have previously laid under the radar only because violating these issues was inconvenient or expensive.

      But the internet is changing that, and now an errant picture or snippet of text can be reproduced and distributed widely for practically zero dollars.

      I think eventually we'll settle on some kind of bubble of privacy concept, in which anything inside is legally protected, but anything you distribute outside that bubble is fair game for anyone, forever.

      This is generally the case in the real world. If someone wears clothes, they effectively have created a privacy bubble, only allowing limited information about themselves to be distributed (via reflected light) to be seen by others. But what information they do allow to escape is fair game for distribution in photographs.

      In a sci- fi series (Neverness et al), Zindell argues that in the future, even identity will be as carefully concealed in public as one's privates. As information technology saturates our culture, even revealing our identity in public is going to be increasingly dangerous.

      Of course DRM advocates will try to attach little bubbles of limited privacy to specific bits of content released into the wild. Eventually, I hope, common sense will prevail and such ridiculous notions will be abandoned.

    2. Re:God damnit by _LORAX_ · · Score: 3, Interesting

      Actuanlly no..

      The courts have held that things not plainly visible ( plainly being not obvios to a human at a reasonale distance or public place ) are illegal to disiminate. Like when you turn on night vision during the day. It captures IR and translates it to B&W, the problem is that our body reflects more of it than our clothes do giving all clothes a semi-transparent look. The courts have held that even though they were recourding in public they violated the privacy of the people taped. This doesn;t mean that all IR captures in public are illegal, but when it's specifically used to reveal information about a person that is not plainly visible it might be a crime.

      The courts have also held that augmention of senses cannot be used as an excuse to break the 4th ammendmant. Cops can only use items that are plainy visible to initate a search on a private residence. This president was set after they used heat signatures to get warrants for pot growers ( because of the grow lamps used ). Remeber that with technology today you can basicly see movement and hear speech through walls.

  11. summary is incorrect by paulbd · · Score: 5, Informative

    The archive is being sued by Health Advocates, not the legal firm that had defended Health Advocates. In fact, the legal firm is named in the suit as well.

    And to clarify: its not a simple "you have our stuff stored on your systems" claim. Rather, Health Advocates is claiming that the archive failed to follow the instructions in robots.txt that were intended to prevent access to historical material.

    1. Re:summary is incorrect by kevmo · · Score: 5, Informative

      HealthCARE Advocates is suing, not Health Advocates. There is a trademark case of Healthcare Advocates (plaintiff) suing Health Advocates (defendant). The legal firm defending Health Advocates digged up the old archive. HealthCare Advocates, the plaintiff, got desperate and is suing the legal firm and IA probably in order to try to exclude whatever evidence the defense legal firm dug up.

      I guess you were trying to be informative, but in this case it makes a big difference as to which company is doing the lawsuit. Its the plaintiff, not the defendant.

    2. Re:summary is incorrect by Prothonotar · · Score: 2, Informative

      To be even more nit-picking, it's Health Advocate (singular, not plural) and Healthcar Advocates (plural).

      --
      "Every man is a mob, a chain gang of idiots." - Jonathan Nolan, Memento Mori
  12. What? They have evidence? Sue them! by div_2n · · Score: 4, Interesting

    They got caught with their pants down and now are suing because someone kept the evidence. Boy do I hope this lawsuit meets a swift and decisive end in favor of the Internet Archives.

    To be candid, I'm surprised it took this long for someone to sue them.

  13. Huh? by Virak · · Score: 2, Insightful

    So, they want to make their webpage freely available to the entire world, but they don't want people to download the pages? Make up your fucking mind, if you're going to put something on the internet, people are going to download it.

  14. Information Extracted by inkdesign · · Score: 5, Informative

    ..on at least two dates in July 2003, the suit states, Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages.

    For the "I don't wanna rtfa because its early" crowd.

    1. Re:Information Extracted by Stalyn · · Score: 4, Informative

      you forgot,

      In so doing, the suit claims, the law firm violated the Digital Millennium Copyright Act, which prohibits the circumventing of "technological measures" designed to protect copyrighted materials. The suit further contends that among other violations, the firm violated copyright by gathering, storing and transmitting the archived pages as part of the earlier trademark litigation.

      and

      Even if they had, it is unclear that any laws would have been broken.

      "First of all, robots.txt is a voluntary mechanism," said Martijn Koster, a Dutch software engineer and the author of a comprehensive tutorial on the robots.txt convention (robotstxt.org). "It is designed to let Web site owners communicate their wishes to cooperating robots. Robots can ignore robots.txt."

      William F. Patry, an intellectual property lawyer with Thelen Reid & Priest in New York and a former Congressional copyright counsel, said that violations of the copyright act and other statutes would be extremely hard to prove in this case.

      --
      The best education consists in immunizing people against systematic attempts at education. - Paul Feyerabend
    2. Re:Information Extracted by cdrudge · · Score: 3, Interesting

      For the "It's too early to think crowd"...

      How did Healthcare Advocates determin that Haridng Early was making hundreds of requests for files on the Wayback Machine? The logs would have been kept on the Wayback Machine's servers, not on anything Healthcare Advocates would have access to easily. Harding Earley would be accessing the files via the Wayback Machine's copies, not the copies that are kept on Healthcare Advocates website

  15. What about the mirror in Egypt by Anonymous Coward · · Score: 2, Insightful

    There is a Wayback machine mirror in the Bibliotheca Alexandrina. It would be very difficult for them to find any legal basis in Egypt to get this one offline.

  16. This case should last all of 30 seconds. by TractorBarry · · Score: 3, Insightful

    Assuming the judge has more than one brain cell then this case should take no more than 30 seconds and will be summarised in two sentences.

    "You published information on a public medium. Case mismissed."

    But then again this is America we're talking about.. home of the idiot lawsuit and lunatic judicial decisions so I don't hold out much hope for the triumph of reason...

    --
    Sky subscribers are morons. They pay to be advertised at !
  17. Re:God by GigsVT · · Score: 2, Informative

    The writeup says the archive is being sued by Harding et al. Then later it says it's being sued by one of those Health companies.

    I didn't even pick up on the fact there were two similarly named health care companies!

    --
    I've had enough abrasive sigs. Kittens are cute and fuzzy.
  18. I've got one word to say in response to this... by Sierpinski · · Score: 2, Interesting
  19. Re:Huh? by TedRiot · · Score: 2, Insightful

    Possibly because they are making a copy of copyrighted material and distributing such a copy or making it available for download. Is this fundamentally different from for example recording music from radio or shows from TV and redistributing them?

  20. A significant change by ivoras · · Score: 2, Insightful
    I can see that the US economics is facing a radical change... from industrial&services-based to litigation-based :)

    A century from now all profit will be gathered from suing one another about IP & copyright rights :)

    --
    -- Sig down
    1. Re:A significant change by Peyna · · Score: 2, Interesting

      I would argue it comes down to the media more than anything. See, they publish all these stories about stupid lawsuits, but make them sound like they actually stand a chance. Then they fail to mention that a few months later the lawsuit was tossed out or lost at summary judgment and fell apart.

      So what they've done (quite successfully) is make everyone think that all people do in the US is sue each other to death, but fact is that most of these lawsuits like this you hear about never go anywhere, and just end up ruining the lawyers' reputations. Then the media stories turn the public against the legal system and lawyers in general.

      When business is bad, you just pick on lawyers and things turn around it seems. It works for the media and politicians at least.

      --
      What?
  21. the bottom line by countzer0interrupt · · Score: 5, Insightful
    He said that the robots.txt file is part of an entirely voluntary system, and that no real contract exists between the nonprofit Internet Archive and any of the historical Web sites it preserves.
    Exactly right. The plaintiff is an asshat. The bottom line for publishing anything to the Web is: if you don't want it copied across the world, saved on people's hard disks (either automatically in a browser cache, or deliberately by the user), and potentially redistributed (after your initial act of publishing) for the rest of time, don't publish it to the Web. I'm not advocating the breach of copyright here - sure, I want credit of paternity for anything I put on the Web, at the very least. Pragmatically, however, I know that the Web (and the Internet at large) is a much more fluid medium. Somebody may save my webpage, copy a quote from it, download an image and use it as their desktop wallpaper, simply because they can. I can't stop them, and I'll never have proof that they did it, so I couldn't sue them if I wanted to. Therefore, I should exercise some common sense, and remember that the Web is a public medium, and if my work is so precious then maybe I shouldn't put it up there. Some web site owners want to use the power of the web to reach huge numbers of people, but they don't want to pay the price of such a fast and powerful medium. Once your words are out there, you may never get them back.
  22. Re:God by MrKahuna · · Score: 2, Informative
    Actualy, it DOES. The summary says "Internet archive, ... has been sued by the firm Harding Earley Follmer & Frailey, Philadelphia" which is false. The crazy thing is it's correct several sentences later where it says "Last week Healthcare Advocates sued both the Harding Earley firm and the Internet Archive".

    Why does Slashdot even bother with the summaries any more? They're outright wrong many times and just plain confusing and poorly written the rest. Either hire some better editors or just post the links to the original stories and be done with it. As it is, I'm about ready to delete my bookmark to this site and move on.

  23. Just waiting for the big suit... by dyfet · · Score: 2, Interesting
    ...when someone is finally sued for the "unauthorized memories" they carry with them...oh never mind, that already happened.

  24. Comment removed by account_deleted · · Score: 3, Informative

    Comment removed based on user account deletion

  25. Books vs. Internet material by sglider · · Score: 2, Insightful

    A book is a physical object, you can reference a book as long as you do not republish it in its entirety. The internet isn't a physical object, it's a collection of bytes arranged in a specific manner. It's that collection that makes it simple to take someone elses work and republish it, almost effortlessly.

    The law has the ugly job of sorting out what constitutes copyright infringement -- republishing a website, perhaps? With the internet, it has become infinitely easier to republish works in their entirety, and hence the lawsuit. If they are guilty of anything, it is not of just 'referencing' a work, it is of taking that work, and republishing it without the authorization of its author. (heh, gotta love the wordplay)

    To bring all of this to a point, it's as if I took your old book, put it into a book that talks about old stuff, and recopied everything, verbatim.

    --
    War isn't about who's right. It's about who's left.
    1. Re:Books vs. Internet material by 0110011001110101 · · Score: 2, Interesting
      it's as if I took your old book, put it into a book that talks about old stuff, and recopied everything, verbatim.

      I have to disagree with you slightly.

      I think this is more like if I were to take an old book (or collection of old books) and store them together in a single publically accessible place (hmmm like a library).

      Then those books sit there for 6 years, and someone (law firm) decides to (gasp) check out those books and use them as reference material in their suit against the people who originally published those books.

      Could the original authors of these stored books then sue the library for providing those books to the public?? (lets assume the robot.txt issue has been resolved, and the library hasn't posted these books illegaly).

      --
      Don't anthropomorphize computers: they hate that.
    2. Re:Books vs. Internet material by aussie_a · · Score: 2, Insightful

      I think this is more like if I were to take an old book (or collection of old books) and store them together in a single publically accessible place (hmmm like a library).

      Then those books sit there for 6 years, and someone (law firm) decides to (gasp) check out those books and use them as reference material in their suit against the people who originally published those books.


      More like, you placed six copies in the library, the library made a duplicate of each copy, you took away the six copies (thinking those were the only ones that would be distributed) and then the library continued to distribute the duplicates it made.

  26. YOU put it out there... by Mindcry · · Score: 2, Insightful

    Seriously, if you don't want something to see something, THEN WHY DID YOU PUT IT ON THE INTERNET TO BEGIN WITH???

    but no worries, its all cool cause we just found an excuse to pull the lever on the american justice jackpot ;)

  27. Turn on the shredder! by hhghghghh · · Score: 5, Insightful

    This is a case where a plaintiff of an action (that they probably lost) is sueing opposing council for using the internet archive looking for old documentation that is used as evidence against its claims. In effect, they're claiming that because they had a robots.txt any page that might have been on the internet archive was there illegaly, and shouldn't have been used as evidence.

    In effect, they're saying "we were wrong, we tried to destroy the evidence of our wrongdoing, but because the shredder jammed and you found the evidence anyway, you're abusing our copyright".

    The court hearing their argument should thoroughly smack them. Perhaps they should be brought to justice for trying to destroy evidence (or instructing a third party to do so), surely that's illegal in these post-Enron days.

    1. Re:Turn on the shredder! by iainl · · Score: 2, Informative

      It's not even that. The robots.txt wasn't in place until the previous court case started.

      What they're actually suing the Wayback Machine for is failing to see that there was now a robots.txt in place and so purge their entire archive history for the page.

      Tragically, search-engine advisory information files have yet to develop time travel. This is somehow Wayback's fault.

      --
      "I Know You Are But What Am I?"
  28. The write up is indeed, bollocks! by @madeus · · Score: 4, Informative

    Sorry, the writeup is bollocks. It says:

    "The Internet archive, which has been storing snapshots of millions of webpages since 1996 has been sued by the firm Harding Earley Follmer & Frailey, Philadelphia."

    and also:

    "Last week Healthcare Advocates sued both the Harding Earley firm and the Internet Archive".

    So to believe the write up, they are being sued by BOTH parties.

    However, it says, in TFA:

    "... John Earley, a member of the firm being sued, said he was not surprised by the action, because Healthcare Advocates had tried to amend similar charges to its original suit against Health Advocate, but the judge denied the motion. Mr. Earley called the action baseless, adding: "It's a rather strange one, too, because Wayback is used every day in trademark law. It's a common tool."

    Christ knows where the idea they they are being sued by the firm Harding Earley Follmer & Frailey came from.

    Doesn't anyone else read the stories first? o_O

  29. Healthcare Advocates E-Mail by MrNonchalant · · Score: 2, Insightful

    info@healthcareadvocates.com

    Be gentle, they might be in the right after all.

  30. Legal precedents ?. by Gopal.V · · Score: 2, Insightful
    > Why would a missing robots.txt imply that others are allowed to distribute the content?

    It should be treated the same way trespassing for unfenced property is treated.

    The case should be dismissed as it reproduces verbatim with attribution content that was published for public bot scraping.

    Now what, will someone sue Yahoo ! or Google for caching pages or converting PDFs to HTML ? Or Coral Cache for unauthorized reproduction of websites ?.
    1. Re:Legal precedents ?. by squiggleslash · · Score: 2, Interesting
      Alas copyright law and laws of tresspass are essentially two different branches. Analogies don't really work because different parts of the law are quite deliberately designed to work in different ways. It'd be like me comparing putting peanut oil into a car to reading pornography from a floppy disk, or me comparing your analogy to the one I just made.

      Here's the deal, and it's not very good. If the Wayback Machine doesn't have permission (implied or otherwise) to archive websites and serve copies of them, it's technically breaking copyright law except in a very small number of cases. I believe in some cases, if you fail to assert your copyright and (and yes, there's an "and" in there) you distribute your content to all-comers for free, it's considered public domain. Asserting your copyright is as simple a matter as putting a copyright notice on your content. I've heard of, albeit third hand, by word of mouth, and IANAL, cases in some juristictions where leaflets pushed in mailboxes without some form of copyright notice were considered public domain.

      That's the best I can think of in terms of defenses for the IA. The IA doesn't honour expiry dates on webpages (if it did, it'd be useless.) It doesn't quote small portions in the context of a review.

      So why hasn't it been more widely sued? Well, I think it's largely because (a) most people consider the Wayback Machine to be an invaluable public service, including most of the websites whose content they archive. and (b) because the Wayback Machine has an honourable record of removing content whose owners don't want displayed. And given (a) and (b), the costs of litigation, the fact that it doesn't appear (to me) to make any money from the operation (and so, as I understand it, is guilty of a civil offense only), people are reluctant to sue.

      My personal opinion? The law needs to be changed to protect groups who do exactly this. This is one of many areas where copyright law needs to be diluted in order to remain credible. If people performing what is obviously a public service, who do make best-efforts to honour the wishes of those who do not consent to be a part of what they're doing, need to worry about the legality of doing so, the law is wrong and liable to fall into disrepute.

      --
      You are not alone. This is not normal. None of this is normal.
  31. two words: due process by brokeninside · · Score: 2, Insightful

    It's a constitutional guarantee, at least in the US.

    The thinking runs that any wrongs that are created by illegally obtained evidence is outweighed by the wrongs that would result from abuses that would ensue if illegally obtained evidence was allowed to be used.

  32. The Archive faces a lot of potential problems... by millennial · · Score: 4, Insightful

    ... if they lose this fight.
    For example, 2600 Magazine's old web site containing a copy of the DeCSS source code is stored in the Archive. Could the Archive be held in violation of the DMCA for mirroring someone else's old site?

    --
    I am scientifically inaccurate.
  33. If there is hope, it lies with the proles? by FooHentai · · Score: 5, Insightful

    ""Day by day and almost minute by minute the past was brought up to date. In this way every prediction made by the Party could be shown by documentary evidence to have been correct; nor was any item of news, or any expression of opinion, which conflicted with the needs of the moment, ever allowed to remain on record. All history was a palimpsest, scraped clean and reinscribed exactly as often as was necessary."

    1. Re:If there is hope, it lies with the proles? by KevlarTheSleepinator · · Score: 2, Funny

      Hello, I am the legal representative of the estate of George Orwell. Posting of his work from memory on this forum is in direct violation of the copyright on the book 1984 and as such we are prepared to take every legal action possible to ensure that your infringement does not go unpunished. Expect a court order in your mailbox by the end of the week.

      --
      Move Sig, for great justice.
  34. The obvious explanations are just too many to list by mrRay720 · · Score: 3, Informative

    The problem is that by allowing illegally obtained evidence, you are officially and legally endorsing criminal activity.

    Who wins in a criminal vs criminal case? Would police officers be forced to fall on their own swords - ie commit criminal acts to gain evidence - on order of those above them? It also gives the thumbs up signal to vigilante justice too.

    Oh, let's not also forget that it'd put the idea of court orders, seartch warrants, the right to be innocence until proven guilty (admittedly already fading), and a whole host of other rights in their graves.

    There's just so much wrong with the idea of allowing illegal evidence, I'm surprised when anyone asks why it's wrong.

  35. Didn't anyone ever tell you? by legaleagll · · Score: 2, Funny

    Didn't anyone ever tell you, law firms don't sue people, people sue people.

  36. Re:US Justice is stupid. by kfg · · Score: 2, Interesting

    When will I be sued for remembering old stories and telling it to others ?

    http://en.wikipedia.org/wiki/I_have_a_dream

    KFG

  37. RTFA Addendum by poena.dare · · Score: 5, Funny
    The suit contends, however, that representatives of Harding Earley should not have been able to view the old Healthcare Advocates Web pages - even though they now reside on the archive's servers - because the company, shortly after filing its suit against Health Advocate, had placed a text file on its own servers designed to tell the Wayback Machine to block public access to the historical versions of the site.


    So the robots.txt was added YEARS AFTER the site had been archived. I don't think they correctly used the "no-archive-time-travel" directive.
  38. Short translation of the article by mwvdlee · · Score: 5, Insightful

    "We've lost our case based on evidence and will now be suing the organisation that provided the evidence for doing so".

    --
    Slashdot social media options: AIM, ICQ, Yahoo, Jabber and Mobile Text. Why no MySpace?
    1. Re:Short translation of the article by Fishstick · · Score: 2, Funny

      Your Honor, I OBJECT!!!

      On what grounds?

      ...that it's DEVASTATING to my CASE!!!

      --

      There is much cruelty in the universe, John.
      Yeah, we seem to have the tour map.

  39. Excuse me but... by hacker · · Score: 3, Insightful

    First and foremost, the existance of a robots.txt does not constitute a contract between the client (a web surfer/browser agent) and the server (the site hosting the content proper). Repeat that over and over. There is nothing stating that the existance of robots.txt on your server must be requested by my crawler or spider.

    Its preferred, but not required. Even so, I am free to ignore it if I want, and parse whatever links I see fit to grab. If you make the content public and I want to read that content, I'm going to get it, whether you have robots.txt in place or not.

    Secondly, has anyone taken the time to validate the robots.txt file found on the site in question? Note too that they just changed robots.txt on July 8th of this year. Did the previous version validate? Are they trying to rewrite history again? What did the old version look like?

    If there is even so much as one error, robots/crawlers are free to ignore/parse/merge/break it as they see fit. It happens all the time, and even when robots.txt is perfectly valid, many robots and crawlers ignore it anyway (msnbot and Yahoo's crawlers are two of the worst offenders here).

    But back to the first point, robots.txt is a guideline, not a rule, not a contract, and certainly not something that can be enforced. Does lack of a robots.txt file constitute the legal right to publically redistribute the content? Or store it for later review and retrieval? How do you know any of your former employees from 1996 haven't stored your entire website on floppy, one page at a time? Did they adhere to robots.txt? Did ANYONE adhere to robots.txt in 1996? It seems that there was evaluation of the Robots Exclusion Standard in 1996, but was everyone using it? Not likely.

    Microsoft Internet Explorer will certainly store the entire website for "reading offline" if you ask it to do so when bookmarking it. They don't parse robots.txt to exclude pages that shouldn't be stored locally.

    Its too bad that people need to try to erase history to prevail in litigation. This isn't George Orwell's 1984... well, at least not yet anyway.

  40. Re:If you put something on the web..... by cdrudge · · Score: 3, Informative

    By nature of copyrights, everything that you create is automatically copyrighted the instant that you create an origial work. This post has already been copyrighted by me.

    You don't HAVE to register the copyright with the Copyright Office in order to retain the copyright. Doing so though gains you added benifits in case there is a dispute. Published works ARE REQUIRED to be registered. However, just putting the website up does not count as publishing. In order to be considered published works, the work must be sold or otherwise have a transfer of ownership, or through rental, lease, or lending. They Copyright Office has even said "The reports also state that it is clear that any form of dissemination in which the material object does not change hands, for example, performances or displays on television, is not a publication no matter how many people are exposed to the work." (Source)

  41. Re:Please RTFA by CmdrGravy · · Score: 2, Interesting

    Huh ??? Is this really what they are saying:

    1) That the Wayback Machine came and archived their site sometime in 1999.

    2) Since then they have added a robots.txt file

    3) Because they now have a robots.txt file previously archived material should no longer be available

    If so that's complete nonsense.

  42. Robots.txt to become legally binding? by MrBandersnatch · · Score: 2, Interesting

    But on at least two dates in July 2003, the suit states, Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages.

    In so doing, the suit claims, the law firm violated the Digital Millennium Copyright Act, which prohibits the circumventing of "technological measures" designed to protect copyrighted materials. The suit further contends that among other violations, the firm violated copyright by gathering, storing and transmitting the archived pages as part of the earlier trademark litigation.


    Wow that is stretching things!! Ive never read the DMCA but to claim that a robots.txt file (which isnt a legally binding mechanism by any means) added to the site after the pages had been indexed had been ignored by the wayback machine was a circumvention of their copyright and a violation of that act...well Id fully expect any judge to have a good laugh at this.

    HOWEVER given how poor the US legal system is I wouldnt be suprised to hear that robots.txt gains legal status as a binding document for crawlers!!

  43. Re:The obvious explanations are just too many to l by DMNT · · Score: 2, Interesting
    Oh, let's not also forget that it'd put the idea of court orders, seartch warrants, the right to be innocence until proven guilty (admittedly already fading), and a whole host of other rights in their graves.

    I disagree. In North-Europe it's usual that even the illegally collected evidence counts. Abuse of power (police) is usually much more harsher a crime.

    In Finland, there was one case where the police did an undercover operation to known drug seller. Too bad that at the time they didn't have rights to buy drugs undercover, resulting two officers charged and convicted of drug trading. Even more, the seller got an easier sentence because he was interpreted to be selling the drug out of request made by officers. Especially it was bad because it was planned. This in effect circumvents the "guilty man walking because of technicality" cases.

    Besides, I see more problems with police violence in the US than I see problems with illegal evidence in the Europe. And no, I'm not trying to start a flame war.

    --
    ?SYNTAX ERROR
  44. Oops! by Marc2k · · Score: 5, Informative

    Oh man, that sucks! I guess I better turn off all caching in my browser, lest I get sued for copyright infringement, because it's storing and rebroadcasting copyrighted materials that you may no longer want me to see at later date.

    However, if you RTFA'd, you'd know that lots of IP law firms use the Wayback Machine on a daily basis, and in fact, the company suing the Internet Archive is not suing them for republishing copyrighted information. Rather, the case is that they recently placed a robots.txt file on their site that disallows viewing historical versions of the website, and the Archive is being sued because the Wayback Machine apparently ignored the robots.txt file (which, I might note is a voluntary standard, and by no means implies a contract between the two parties), which the plaintiff claims violates the DMCA. This has nothing to do with copyright violation.

    It has everything to do with robots.txt. Read.

    --
    --- What
    1. Re:Oops! by AnObfuscator · · Score: 4, Informative
      This has nothing to do with copyright violation.

      Ahem. Perhaps, if YOU had RTFA'd, you would have seen this little gem:

      From TFA:
      The lawsuit, filed in Federal District Court in Philadelphia, seeks unspecified damages for copyright infringement and violations of two federal laws: the Digital Millennium Copyright Act and the Computer Fraud and Abuse Act. (emphasis mine)

      I'd also like to point out that the Digital Millennium Copyright Act is about preventing copyright infringement.

      Read.

      Pot. Kettle. Black.

      --
      multifariam.net -- yet another nerd blog
  45. Analogies by MyLongNickName · · Score: 5, Insightful

    I've read about 500 analogies on what electronic information "is like".

    Every analogy is bad. We cannot equate electronic information with physical information of ages past. Every analogy just plain sucks.

    The reason the information age has taken off is because of the ease of transmitting, storing and copying of electronic data. These methods weren't available fifty years ago, and weren't wide spread until about twenty years ago. Trying to stuff these concepts into one-hundred plus year old ways of thinking is just useless.

    This does not mean we can't use older solutions to problems to guide us in the future. But, we need to stop shackling ourselves to old ways of thinking. The fundamental way we transmit thoughts and ideas have changed, our fundamental way of thinking about information needs to change as well.

    Does this mean "all information is free"? No. But trying to treat electronic information like a book is useless. Web sites are put out to be publicly consumed. It is contradictory to say that someone cannot cache it for non-profit purposes. Trying to reuse the "creative" parts of the web site for commercial purposes should be prohibited.

    Bottom line: Stop with the analogies. Start thinking fresh.

    --
    See my journal for slashdot ID's by year. Mine created in 2005. http://slashdot.org/journal/289875/slashdot-ids-by-year
    1. Re:Analogies by Jonny_eh · · Score: 2, Funny

      I agree!

      Using an analogy in an argument or explanation is like sneezing in the other person's face! Only slightly different in obvious ways. You get my point...

    2. Re:Analogies by tbradshaw · · Score: 4, Insightful

      But the problem with not using analogies is that our lawmakers, enforcement officers, and general populous doesn't get it. At all.

      Something completely rediculous regarding information and electronic communication comes up from the legal system or whatever, and all of us that understand the technology go "What the fuck? How could they not get this?"

      Well it's simple, they didn't understand the technology and so they used an "analogy" to find an equivalent parallel and then just treated the situation like whatever. But of course since they don't understand the technology, they pick a horrible analogy.

      E.g. Downloading music is like shoplifting. (No it's not, it's not theft.) Hackers are like sophisticated evil genius supervillians. (No they're not, those kids just changed the URL so they could see their *own* admittance results.) DRM is like a lock on the producers warehouse. (No it's not, it's like a lock on every one of *my* CD's in my own house.)

      When people don't understand somewhat abstract ideas and concepts, they make concrete analogies to try and get a general idea of it. If we try and stop making analogies and start "thinking fresh", the common people and our lawmakers just won't get it... and they'll continue to use their shitty analogies as guidelines that will turn into shitty laws. We don't get it perfect, but maybe as a collective eventually we can find something pretty accurate.

  46. Re:Please RTFA by Anonymous Coward · · Score: 2, Informative

    And yet, that's how the Internet Archive tells people to remove previously archived material.

  47. We have this one every time... by Anonymous+Brave+Guy · · Score: 2, Informative
    I'm really sick and tired of companies that have absolutely no clue how the Internet and the world wide web works putting up sites and then expecting you to never cache them anywhere.

    <mini-rant> And I'm really sick and tired of people that have absolutely no regard for how the law works copying material off the Internet and then expecting never to get sued for it, claiming some legally naive and ethically dubious justification. </mini-rant>

    Seriously, we have this discussion every time Google or the Wayback Machine or whatever comes up. Putting material on the Internet does not give up your copyright on it, place it in the public domain, grant others the right to reproduce it any way they see fit, or otherwise work differently to copyright laws as they apply to all other media. There are necessarily certain implied rights, but arguing that actually ripping someone else's material and then making it publicly available after they've withdrawn it from their own site is a pretty big stretch to anyone without a vested interest.

    Before anyone shoots back the inevitable responses about information wanting to be free, not controlling the flow, yada yada, please stop and think for a minute. A lot of the useful content on the web is made available by volunteers or companies who don't expect to profit from it immediately, but whose future business may be damaged if the information is taken and republished by others. Many of these people will just stop putting information on the web at all (see Slashdot discussions passim) if you abuse the access, and that doesn't benefit anyone.

    In fact, limiting the rights of others to distribute your works in order to encourage you to make them available is exactly what copyright is for, and this sort of case is a textbook example of why the principle matters.

    --
    If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
    1. Re:We have this one every time... by Dr.+Evil · · Score: 5, Insightful

      Oddly, the Internet Archive honours robots.txt, so if you don't want people to surf your archive, you can just post their robots.txt file and it will block everything, even into the past.

      I would say that caching and archiving are so well understood to be part of the Internet that posting a web page and not expecting it to be archived or spidered is absurd. In other words, by posting their site to the web without a robots.txt, they knowingly published it in a medium which contains facilities for archiving and later redistribution.

    2. Re:We have this one every time... by ciscoguy01 · · Score: 4, Interesting

      Seriously, we have this discussion every time Google or the Wayback Machine or whatever comes up. Putting material on the Internet does not give up your copyright on it, place it in the public domain, grant others the right to reproduce it any way they see fit, or otherwise work differently to copyright laws as they apply to all other media. There are necessarily certain implied rights, but arguing that actually ripping someone else's material and then making it publicly available after they've withdrawn it from their own site is a pretty big stretch to anyone without a vested interest.

      Actually there is a simple principle here.
      The supreme court has ruled that directories cannot be copyrighted if the information they contain is purely factual in FEIST v. RURAL TELEPHONE, 1991
      An example is the telephone book, those are all facts and that was what the case was about.

      The wayback machine could be called a directory of old web pages, cached as they existed at the time. Facts.
      Thus protected from copyright claims.

      Well, there's their defense. It would be kind of fun to argue!

      In any case it looks like the wayback machine needs a couple hundred mirrors. Heh.

      --
      .
    3. Re:We have this one every time... by CausticPuppy · · Score: 3, Informative
      Oddly, the Internet Archive honours robots.txt, so if you don't want people to surf your archive, you can just post their robots.txt file and it will block everything, even into the past.

      From TFA:

      Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages.


      So it appears that the basis of the lawsuit is that the robots.txt was NOT honored. The plaintiff claims that the robots.txt is a "contract" and that the wayback machine violated the contract by still allowing archived pages to be viewed in a limited number of attempts, for reasons unknown.

      However, the TFA also does mention that honoring the robots.txt is strictly voluntary and does not constitute a contract.
      --
      -CausticPuppy "Of all the people I know, you're certainly one of them." -Somebody I don't know
    4. Re:We have this one every time... by Dr.+Evil · · Score: 2, Insightful

      "unfortunately for your argument, the legal truth is that copyright protection is the DEFAULT, so robots.txt has it backwards. the fact of the matter is that to be (more) compatible with existing law, there should be an allowcache.txt, not a robots.txt."

      I would agree, but I'm arguing that archiving and redistribution is part of the medium that the copyrighted work was published in. The webmaster of the site certainly knew that the site would be archived. They would also know that robots.txt is a voluntary process.

      In other words... if they wanted to make sure it wouldn't be archived, they shouldn't have put it on the web.

    5. Re:We have this one every time... by DerekLyons · · Score: 4, Insightful
      Actually there is a simple principle here.
      The supreme court has ruled that directories cannot be copyrighted if the information they contain is purely factual in FEIST v. RURAL TELEPHONE, 1991
      An example is the telephone book, those are all facts and that was what the case was about.

      The wayback machine could be called a directory of old web pages, cached as they existed at the time.

      No. Yahoo! is a directory of webpages - that is pointers to locations of web pages in the same fashion that a phone book is a pointer towards the locations of people/businesses. (I.E. the legal distinction between a URL and a phone number can be seen as being quite sleder.)

      The Wayback Machine on the other hand stores copies of pages, not copies of their adresses.

    6. Re:We have this one every time... by 99BottlesOfBeerInMyF · · Score: 5, Insightful

      Putting material on the Internet does not give up your copyright on it, place it in the public domain, grant others the right to reproduce it any way they see fit, or otherwise work differently to copyright laws as they apply to all other media. There are necessarily certain implied rights, but arguing that actually ripping someone else's material and then making it publicly available after they've withdrawn it from their own site is a pretty big stretch to anyone without a vested interest.

      Actually, while they do not give up any copyright, there are a number of explicitly stated, legal uses of copyrighted materials and there is a great deal of public benefit to enumerating a few more of them. Can you honestly argue it is not in the public's best interest that a historical archive of the internet exists, for educational reasons if no other? This case should be a poster child for just such legislation. A company published something, lied about it, and are now suing the people who made a copy and proved their guilt. Are you saying it is in the best interests of society that copyrights be used as tool to promote lies and censorship?

      Copyright is supposed to be about one thing and one thing only, promoting science and arts. That is the only constitutional provision for its existence. If someone is copying legally obtained works into an archive for educational, historical, or non-profit uses then they are almost invariably helping to promote science and arts, and anyone trying to stop them is up to no good.

      As to the letter of the law (which is probably unconstitutional although it is impossible to prove that) you're right. The internet archive is screwed in the U.S. and many other countries. They tried to do what copyright law originally required of copyright holders and the library of congress. If a work is to copyrighted then ethically it needs to be available. That is the whole point of copyright. According to the letter of the law it is probably illegal for me to print out the receipt some e-businesses display when I buy something online. The law needs to be fixed.

      In fact, limiting the rights of others to distribute your works in order to encourage you to make them available is exactly what copyright is for, and this sort of case is a textbook example of why the principle matters.

      What? How does this limiting of the rights of others encourage them to distribute the material? They, like the majority of copyright holders these days, don't want the work to be available at all. It does not encourage them to publish it, it just gives them a way to prevent works from being distributed.

      The archive is in trouble not because the violated the intention of copyright. They, in fact, are trying to uphold the very principals upon which it is founded. Unfortunately, the laws have been changed by the corrupt and greedy to create a situation where copyright does exactly the opposite of its original purpose. This is a perfect example of copyright laws that have been rewritten being used to hold back progress and remove works from public availability. It is unethical and sickening and your implication that a businesses financial considerations should trump both the rights of our descendants to have access to our works and that they trump the the ability to find and present the truth in the courts... well it makes me want to vomit. Go to hell.

    7. Re:We have this one every time... by dubl-u · · Score: 2, Interesting

      In fact, limiting the rights of others to distribute your works in order to encourage you to make them available is exactly what copyright is for, and this sort of case is a textbook example of why the principle matters.

      Really?

      I agree that content creators should have some limited rights that allow them to profit from their content, as that encourages the creation of more public content.

      But in this case, nobody it saying that they wanted to publish this content later for profit. The plaintiffs intentionally made something public to the entire world, and went to some trouble and expense to do so. Now they want to pretend it never happened because the facts have become inconvenient.

      What the Internet Archive does may or may not be technically legal, but it's certainly in harmony with the spirit of copyright law. When one publishes books, one is obliged to give a copy to the Library of Congress so that it remains on the permanent record.

      Personally, I think the Library of Congress should just fund the Internet Archive and bless the project with their special powers of copyright exemption. Failing that, Congress should make legal this sort of non-profit archiving of public material. Copyright is the right to reasonable profit from your creative efforts, not the right to manipulate the historical record.

    8. Re:We have this one every time... by mattOzan · · Score: 4, Funny
      However, the TFA also does mention that honoring the robots.txt is strictly voluntary and does not constitute a contract.

      Shouldn't that be "...the f***ing TFA article also does mention..."

    9. Re:We have this one every time... by DerekLyons · · Score: 2, Informative
      That's a ridiculous distinction.
      No, it's an important distinction - and one that does not rely on calling a tail a leg. One item is a pointer towards content, the other is a copy of the content. These are two very different things at every level.
      Those pages were accessable on the internet when the archive crawler archived them.
      So? That doesn't destroy the rights of the owners of content over that content. The CNN coverage of the Discovery launch I am currently watching is publically available, but even if I were taping it, I don't have the right to then make copies available to third parties. This is basic copyright law, well supported by precedent.
      They existed at that time for anyone to view. You can't take it back.
      Thats an assumption (read 'wishful thinking'), not a fact.
    10. Re:We have this one every time... by mysidia · · Score: 2, Interesting

      Putting material on the Internet does not give up your copyright on it, place it in the public domain, grant others the right to reproduce it any way they see fit,

      Putting material publicly visible on the Internet is a decision to distribute it, just like selling a book is a decision to distribute the book.

      A difference is that when printing a book, a limited number of copies are made, whereas when you put something on your web site, you have produced a theoretically infinite number of copies.

      Now what you have posted publicly is a matter of record and not only can be copied but ought to be, to ensure continued availability of the information.

      The Internet Archive is an electronic equivalent to a library where old works are preserved, the difference is, of course, since the public Internet is inherently a medium where infinitely many copies are made (which differs greatly from the physical world), there is an appearance that the Internet Archive is totally redistributing a work... ...in fact, they are just lending it out, lending out one of their infinitely many copies of the material

      Just like if you send someone an e-mail, you have given sent an unlimited number of copies of the message, because such copies will be made every time they launch their e-mail client and open the message... they can do this as often as they like, and every time they do this they have found a new copy of that which you sent them.

    11. Re:We have this one every time... by 99BottlesOfBeerInMyF · · Score: 4, Insightful

      Those are just the first few examples that come to mind, but the significance is clear: just because some information was available somewhere at some time, that doesn't automatically means there's a benefit to society to preserving that information in an obvious place for all time.

      The answer to problems with information like drafts and trade secrets being public knowledge after being published is simple, don't publish them. If you don't want people to read drafts of unfinished works, don't publish them online. You do realize copyright law, even today in theory, insures that all copyrighted works are to be preserved for the public and given over to the public for all time once it expires right? And how many better authors would we have today if we did have Shakespeare's drafts to look at to help understand his writing process?

      I'm going to skip your constitutional arguments, because copyright is an international convention, and most of the world isn't subject to your constitution. Can we agree the more neutral definition that copyright exists to promote the creation and distribution of works for the benefit of society?

      Most copyright law in the world is pretty similar to that in the U.S., but fine lets ignore the U.S. constitution. Lets talk about natural versus artificial rights. Freedom of speech is in my opinion a natural right. Copyright is, in my opinion an artificial right, granted as part of an agreement between authors and those who would benefit from said authorship. Authors are rewarded for giving works to the public with the rights to make money. What advantage does a copyrighted work that is not available to the public give to the people who are giving up their natural right to copy it freely?

      Your position is illogical. We're talking about material that has already been made available. If it's a work of value, then probably it was removed because the copyright holder was going to distribute it via some other means, or was working on a newer, better version and didn't want the out-of-date material getting in the way. If it's not a work of value, then there is little public interest to be served in preserving it, particularly if doing so causes any harmful effects to the parties involved.

      And here is where your argument falls apart completely. You're making a whole slew of assumptions here, most of which are not true. First you're putting responsibility for deciding what is and is not of value to the public into the ahnds of the copyright owner (note in most cases this is NOT the author anymore). Next you're assuming that not only will the copyright owners know what works are valuable to the public, but they will act in the best interests of the public rather than in their own best interests.

      You do realize that the vast majority of copyrighted works including art, literature, film, and music are completely unavailable to the average person right? About .05% of all copyrighted books are still in print and maybe 3% are still available either new or used. The same holds true for music. This is mostly because so many works are copyrighted, but no one knows who holds that copyright, or because the large companies that own millions of copyrights don't want older works to compete with current offerings. Is it in the best interests of the public as a whole to have no access to the majority of our artistic, music, theatrical, and literary heritage? How many great works are in those collections, that will never be seen ever again because the last copy is lost and it was illegal for anyone to make more except some company who did not see the profit in it?

      If you remove copyright...

      I never said anything about removing copyright, only reforming it. For example it used to be that every copyrighted work in the U.S. had to have two good copies sent to the library of congress to be archived for reference and to preserve the work for future generations. Sound familiar? If that law was still in effect

  48. Clueless Lawyers by Winkhorst · · Score: 4, Insightful

    And if you printed out the site, would they want to sue you for "reproducing" that site? Along the same lines, would someone want to sue you because you kept a book you bought 10 years ago and the author had written a new version? This all smacks of this "on demand" nonsense and self-destructing media and even shades of Orwell's 1984 where the Ministry of Truth modifies ancient history when it suits their purposes. This is all part of an attempt of corporations with the complicity of the legal establishment to place absolute control of all media in the hands of said corporations. Which all leads to the fact that it's time for the Congress to enact a Corporation Control Act that would finally put a leash on these rabid idiots.

    --
    "Is this Winkhorst a nova criminal?" "No just a technical sergeant wanted for interrogation."
    1. Re:Clueless Lawyers by Jah-Wren+Ryel · · Score: 3, Insightful

      Which all leads to the fact that it's time for the Congress to enact a Corporation Control Act that would finally put a leash on these rabid idiots.

      However, knowing the people in Congress, such a "Corporation Control Act" would not serve to control corporations. Instead it would give ultimate control of the country to corporations.
      It's all in how you read the title.

      --
      When information is power, privacy is freedom.
    2. Re:Clueless Lawyers by bnenning · · Score: 2, Interesting

      However, knowing the people in Congress, such a "Corporation Control Act" would not serve to control corporations. Instead it would give ultimate control of the country to corporations. It's all in how you read the title.

      Actually, it doesn't matter what the original intent was, the end result would be as you describe. See regulatory capture.

      --
      How to solve most of our problems: 1.Lots of nuclear plants. 2.Cure aging.
  49. Opinion: by xnot · · Score: 2, Interesting

    Anything posted on the web should automatically be in the public domain. The physical act of viewing a web page requires me to download its contents to my computer. That means the website in question is volunteering content for me to download (or at least view). Maybe if I'm a content provider, I have a right to be angry if someone uses that content to impersonate me, or whatever. But otherwise I must understand that I've just put the content on my readers computer- I have no real control at that point over what the reader does with that content.

    You know, it's funny. The web used to be mostly about free organizations offering up content. Then companies figured out that people like the web, and so they started jumping on. Unfortunately though, they don't seem to want to play the web as it was meant to be played. The web was not designed to support copywrite controls, and I can't understand why companies constantly expect that it does.

  50. Re:The library clause by dtfinch · · Score: 2, Informative

    The Internet Archive has received two DMCA exemptions from the US Copyright Office, but only for archiving copy protected software. I don't think they needed one to archive the web.

  51. Who has the right right to store store windows? by NetSettler · · Score: 4, Interesting
    almost everything (there have been a few mistakes) posted on the Web that is publicly accessible was put there to be seen!

    Yes, but the difference is that under copyright law, I can require that you not make copies of my site just so that you continue to come to my site to get updated versions of things.

    I do this routinely with my technical papers exactly because I know they will be updated. They're usually available for people to view and read and link to, but I ask people not to make copies elsewhere and technically that "request" is enforceable under copyright law.

    this suit is like complaining about a "for sale" sign in a store window being photographed, saved for years, and later viewed

    Now it's true that fair use allows some things to be copied for certain reasons. And, curiously, I think the need to copy for a lawsuit might stand up. But copying the entire of everything everywhere in anticipation of something being needed for a lawsuit sounds to me to be a questionable thing. To stretch the analogy one further: making a complete repository of photos of all store windows, almost as a workaround for the fact that those store windows were not directly accessible for use. That doesn't sound like fair use to me. One of the fair use criteria is about the totality of the work, and while it's undefined how that comes into play in each instance, it's clear that the amount of bulk matters here.

    Personally, I was initially uncomfortable with the Internet Archive, and I continue to be of mixed feelings about it. I think it serves a huge historical interest. However, in the nearterm it has some ill effects that run counter to the copying/distribution/presentation laws of copyright and may need some correction.

    I might think it reasonable if

    • the internet archive were allowed to make, but not immediately publish, a complete record
    • they could immediately sell 404 protection and wayback search tools and other such things if both they and the affected site wanted (since that's voluntary on the part of the copyright holder)
    • they had to hold off on full view for, say 25 or 50 years, or perhaps, the number of years copyright runs

    The disturbing part is that legal term of copyright seems to continue to lengthen with time. I'm a big fan of copyright as a form of personal control for authors to get income from their works, but copyright must lapse after some part and it is already well exceeding what I think is reasonable in that regard, with the trend looking to extend indefinitely as rich copyright holders influence congress to extend every time, say, Mickey Mouse comes into jeopardy.

    --

    Kent M Pitman
    Philosopher, Technologist, Writer

    1. Re:Who has the right right to store store windows? by Artfldgr · · Score: 5, Informative

      There are several law firms in the NY city area that pay to have every sidewalk and store front and such filmed on video... they then send that video into the state.... now when a person trips on a bad sidewalk they can get the case to court! i know.. you say WTF.. but its pretty simple. say there is a big upheaval in the sidewalk.. you trip, and try to sue the city for not maintaining its property, etc... (i am making this simple, there are all kinds of better examples but this is simpler). the city though will tell you and so will the courts that the city is not responsible. why? because you cant prove negligence. negligence is willfull, and not knowing there is a crack is not negligence. and here is the rub. being told that you have a problem and then ignoring it till something happens IS negligence. so in the past the lawyer would have to sepeona the cities records to see if someone reported the issue, if so, then great for the client, if not, their plum out of luck. so when the legal firm sends in the tapes, they are reporting the state of every block in that area... the city not looking at the tape that would define all the bad areas is negligence since now they DO have a method of seeing the problems and are ignoring them.. and voila, you now win cases that you couldnt before... so given that there is precident on such (and that store windows, especially in manhatten, are copyrightable, given that they are artistic displays!) my friend say i should have been a lawyer. :)

    2. Re:Who has the right right to store store windows? by NetSettler · · Score: 2, Interesting

      There are several law firms in the NY city area that pay to have every sidewalk and store front and such filmed on video...

      And while I have heard that some places/scenes have tried to assert a copyright claim on the "view of themselves" such that photographers wanting to photograph their public facing elements had to pay for that right, I think in general there is not a presumptive claim of copyright control over one's storefront view (though maybe some lawyer, which I am not, will assert otherwise). What seems to me different on the net is that there is a presumptive claim of copyright on the storefront.

      In defense of the idea that there could be a legitimate distinction, let me observe that it may be necessary for mere locomotion to get from point A to B in the real world by going through some point C. And to explain or document or comment about that route from A to B through C, pictures make sense for a variety of reasons not related to the purpose of the site. The presence of the storefront is in fact a secondary use of the space, the primary use of which is "for the public to be there". It is the storefront taking advantage of a public thoroughfare, not vice versa. In cyberspace, the "space" occupied by a website is not "in the way of" getting from place to place, and hence it is not a secondary use of some other space. So it makes actual philosophical sense that these should not follow the same rules. (It doesn't, of course, follow that the rules ought be of any particular nature. It merely doesn't follow that you can show the rules are right in cyberspace by showing they are long-accepted in the real world.)

      --

      Kent M Pitman
      Philosopher, Technologist, Writer

    3. Re:Who has the right right to store store windows? by Artfldgr · · Score: 2, Informative

      the tapes in order to be valid are made with street names recorded and such... so the report would have to include the location.. so your assumption that its not reasonable to search them all (outside of normal maintenance), is correct

    4. Re:Who has the right right to store store windows? by IgnoramusMaximus · · Score: 2, Insightful
      But there is no overwhelming public interest in robbing the original author of at least a period of time during which he/she can seek reasonable compensation.

      There is an overwhelming interest indeed. For many reasons. Such as 90% (or close) of literature, music or movies published are complete, utter, useless, crap which should never seen the light of day and only did because some vast marketing organizations sought to sell donkey manure wrapped in shiney packaging to the sheep known as "consumers". Or that writing is supposed to be art, and not an industrial process, and as such it is supposed to be sponsored by wealthy patrons, voluntary donations and art foundations. If you are a "technical" writer, you are supposed to be doing it under auspicies of academia or technical organisations whose members are financing you. Copyright, Software Patents (and soon storyline patents) and similiar attempts at treating information as if it were capable of being "private property" are perversions of logic, law and artifacts of pure greed. Greed stronger then common sense, science and morality. An all-encompassing greed which threatens to strangle all progress and destroy the humanity itself.

      Your reference to the "starving spouse" with fingers "worked to the bone" is a classic propagandist device designed to evoke sympathy for the "poor writer" who is toiling to manufacture yet another piece of mind-vomit in order to "score it big". Tough luck. If you are an artist and what you do is art, find a way to finance it. World could use far fewer inept amateur "artists" and more dedication and quality from those who remain. In case you did not notice, all the art before the age of copyright and even long after (as copyright did not apply to music and paintings for a long time) was produced this way. I'll take Shakespeare, Plato, Da Vinci and Mozart over the likes of Rolland and Britney any day. If you are in it for the money, screw you, find real work and stop lobbying for laws which attempt to rape us all for your benefit. I already consider anyone who thinks Intellectual Property laws are "beneficial" to be either confused beyond hope or a vicious enemy of humanity whose only agendas are his ego and wealth. In either case a mortal enemy of mine.

  52. Punish the offender by nuggz · · Score: 2, Interesting

    If obtaining evidence illegally is acceptable, what ensures it was properly secured and documented? Why not just forge it?
    Part of legal evidence gathering is ensuring you have a trail to prove it is valid.

    Secondly what rights are trampled in the cause of getting that evidence?
    By making illegally obtained evidence invalid you encourage proper behaviour. If the legal evidence is used to convict a killer the polic will get off with a slap on the wrists despite any crimes they may have committed.
    If illegally obtaining evidence could cause that same killer to get off, you can bet the police would be extremely careful to ensure that the evidence is collected properly.

  53. For some reason all that comes to mind by mcc · · Score: 3, Insightful
    For some reason all that comes to mind when I hear the reasoning behind the filing of this lawsuit is "Liar, Liar".
    JIM CARREY: I object!
    JUDGE: On what grounds?
    JIM CARREY: It's devastating to my case!
  54. Sue this by fluor2 · · Score: 2, Funny

    I demand that Slashdot will remove this comment after 3 days! Or else I'll see you in court!

  55. Did you even read the article you linked? by Safety+Cap · · Score: 3, Informative

    From the article:

    During the case it was discovered that McDonald's required franchises to serve coffee at 180-190 degrees Fahrenheit (82-88 degrees Celsius). At that temperature, the coffee would cause a third-degree burn in two to seven seconds.

    Testimony by witnesses for McDonald's revealed that:

    • consumers were not aware the coffee was so hot that there was a risk of serious burns
    • McDonald's did not warn customers of this risk
    • they could offer no explanation as to why there was no warning
    • McDonald's did not intend to reduce the heat of its coffee

    ~.

    Documents obtained from McDonald's also showed that from 1982 to 1992, more than 700 people were burned by McDonald's coffee with varying degrees of severity.

    [Emphasis mine]

    Frivolous Lawsuit? Hardly.
    Excellent Spin-doctoring on McDonald's Part? Absolutely.

    --
    Yeah, right.
  56. The problem by phorm · · Score: 2, Insightful

    Legal decisions are often based on previous decisions, and at this point they are comparing "similar" (though not the same) situations in the physical world with those in the digital world.

    In terms of public domain, visibility, and various other terms the analogies aren't bad. It's quite similar to patent cases... just because you do something on the internet doesn't make it unique (which makes for a lot of dumb patents), and really taking an electronic 'snapshot' of a publicly visible webpage shouldn't be any different than taking a picture+photocopy of a physical notice/bulletin/sign/etc in a publicly visible location.

    If they'd published the same information on a sign in their front lawn... what's to seperate it from the e-Version other than the fact that one is paint and the other is bytes?

  57. In Other News by Compulsion · · Score: 2, Funny

    The IP law firm Dewey, Cheatem, & Howe is suing Akashic Inc. for 1.436e17000 violations of US and International Copyright Law.

    Akashic Inc is accused of "rampant distribution of every single piece of copyrighted material concievable throughout the history and future of humankind."

    Timothy Leary from Akashic Inc. commented that "I have no doubt that we will emerge triumhant in no less than 99.9% of all possible universes in which the case is currently active."

  58. Usenet parallels by Anonymous+Brave+Guy · · Score: 2, Informative
    I wonder if there have been suits over the Google, formerly DejaNews, archive of Usenet.

    As far as I know, there hasn't been any definitive court case anywhere on the basic concept of Usenet archives. They at least have some kind of defence, because you inevitably expect Usenet messages to propagate beyond your control, and you expect that services providing Usenet feeds will charge money to their customers in exchange for relaying the information. IMHO, for Usenet archives it's more a question of whether permanent storage is a reasonable expectation for which permission is implied by posting, which can at least be argued reasonably either way (e.g., it usually isn't and it's common expectation that messages will disappear after a few days vs. the technical standards not saying anything about necessary expiration and considerations of increased cheap storage space at service providers compared to when Usenet was first running).

    I'm pretty sure that at least one business that reproduced Usenet via the web and added those annoying automated keyword-linked ads on top of someone's posts has been screwed for it in court, though; IIRC, they were found to be publishing a derivative work without permission. I've come across at least one techie forum that was abusing many posts I've made to a programming newsgroup this way, which I did find inappropriate (they are generating ad revenue purely from distorting words I wrote, even advertising compilers in a post whose whole point was that you shouldn't write code depending on a specific compiler!), so I don't have much sympathy. If anyone can remember the case that established this one, I'd appreciate a reference.

    --
    If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
  59. Good! Sue them offline by arrowman · · Score: 2, Funny

    The Internet Archive is the worst copyright infringement in the history of mankind. It's about time someone stops them. (Don't you dare mod me as troll, flamebait or funny! I'm dead serious.)

  60. Outdated, I'm glad we're challenging it. by Paradox · · Score: 3, Insightful
    You can. Just don't distribute it without permission. Don't like it? Tough. That's the law.
    Not to interject the topic into this discussion, but, the plaintiffs in the case are doing this because they want to hide information that is potentially damaging to their current revenue stream in a court of law.

    Not exactly the kind of people you want to be defending. The fact that copyright law can be used this way suggests it is broken. The fact that it was created before our modern information economy was formed also suggests it is broken and in need of revision.

    Does right of first sale apply websites? If so copyright on the internet is meaningless. Which I'm sure would make many people here very happy (although I wonder how many of them make a living from their website).
    Credibility is not something you can easily steal, and the people who "get there first" tend to get the lion's share of credibility, even when competing against companies much larger than themselves.

    So your cheap shot about how people on Slashdot have nothing to lose is just that, a cheap shot with no substance or truth. We're not saying that copyright laws need to be entirely abolished, but they do need to be updated to reflect our modern society and information infrastructure. The fact that we're compelled to lay unenforceable law after unenforceable law down on top of copyright law in a vain attempt to keep it afloat in its current state should be evidence enough that things need to change.

    AFAIK, press releases are made by the author to be redistributed. Websites aren't.
    Excuse me? What? A public website is by its very nature meant to be redistributed. It is replicated on millions of machines per day for many different purposes. If you do not agree to at least some redistribution for your website, then take it offline because it doesn't work without redistribution.

    Sue the caching proxy servers! Sue people who use internet cache! Sue Proximitron users! Sue link-of-the-day sites because they helped people replicate the data. Wait, why not just sue everyone online, because they were party to the crime by using the same routers!

    Does that mean I can abuse your copyright and write my own book and have your characters in it?
    Nice, avoiding his point entirely. Part of copyright law is the intent with which you distribute it. This helps prevent entrapment scenarios. If you place a public site on the internet, your intent is to have it treated like a public site. This means it will be crawled by search engines, cached by proxies, linked to by interested users, downloaded for personal offline browsing, pre-cached by Earthlink and AOL services, and archived by the wayback machine.

    This is the cost of doing business on the internet. This is how it works, and how it's going to continue to work. If you are not willing to express this intent with your website, take it down now.

    Copyright owners have many rights. One of those rights is the ability to say what happens with their work. That work does stop at what someone does in their private home, but it does cover redistribution. Just because YOU don't want it to, doesn't mean it copyright doesn't cover it.
    Bear in mind that the article is about a large insurance corporation trying to deny benefits to a group of people, using copyright law as a club to beat evidence into inadmissibility. We're not talking about chinese knockoffs ruining a poor independent artist. We're talking about Yet Another Way Out for corporate America's scumbags.

    If copyright law allows this, then we need to tear it down and fix it, because I'm not willing to pay such a stiff price for a basic kind of protection that I probably can't afford to fight in court anyways.

    --
    Slashdot. It's Not For Common Sense
  61. Illegal searches and the Fourth Amendment by Brightest+Light · · Score: 2, Insightful
    I think you've just described the same thing twice.
    I respectfully disagree.

    The wrongs that would ensue if illegally obtained evidence is allowed to be used are the same as the wrongs that are created by illegally obtained evidence.

    If by "the wrongs that are created by illegally obtained evidence" you mean the results of a crime (say, the wrong created by a handgun fired at the cashier of a liquor store during the course of an armed robbery, resulting in the cashier's death); then I would argue that the wrongs are not the same at all.

    The idea of not allowing illegally obtained evidence to be used in a criminal trial is to protect 'The People' from abuses by 'The State'. Using evidence that has been illegally obtained may result in the conviction of a guilty person, except that doing so is (usually) found to be a violation of an U.S. citizen's right "...to be secure in their persons, houses, papers, and effects, against unreasonable searches and seizures".

    I believe the theory is that if the police were regularly permitted to use illegally obtained evidence in a trial, it would moot the Fourth Amendment protections of the Constitution, resulting in something awfully similar to a police state. Think of this: if a police officer knows that any "evidince" they find during the course of an investigation will be permitted in a trial, then there is no check upon their power to search and seize (illegal searches are often the reason why evidence is suppressed). If an officer knows they don't need a warrant or probable cause to conduct a search, what's to stop them from randomly searching ANY person, place, or thing, at ANY time, for ANY reason (under the guise of 'conducting an investigation', of course)?

    An innocent person may feel they have nothing to hide, but do you really want the police tossing your home at 3AM because you fit the description of somebody who committed a crime recently? What if that description was merely "black male"? What if you happen to be a black male, and the cops go digging through your home looking for a firearm and find your marijuana stash instead? Oops, now you're gonna go to jail for drug charges, and it doesn't even matter that you were nowhere near that liquor store when it was robbed. The sort of damage that such a system would do to our freedoms is far worse than the damage done to the prosecutor's case by suppressing evidence of a murder/armed robbery that had been illegally obtained.

    Furthermore, what if the police don't find anything during the search of your house, but decide that they need a conviction, so they simply plant evidence instead? Things like warrants and chain of evidence are designed to prevent such abuses, but if police do not have to follow those procedures, you can kiss that notion goodbye.

  62. Re:obvious man question (now, in a 2nd Ed.) by drakaan · · Score: 5, Insightful
    &copy 2005, by Adrian Stovall

    If that's true, we had all better be careful not to visit *too* many pages on a given website during a given day. Either that or make sure that our web browser is set to immediately flush all downloaded content once it has been rendered.

    The argument being made is that copyright is being violated, but the way the archive works might well be considered fair use, since the *only* reason it exists is for archival purposes. If having a copy of website content is illegal, in and of itself, then everyone who uses a web browser (unless they're running knoppix or something that doesn't store anything to the HD) is just as guilty as the Internet Archive.

    I hereby rescind your permission to copy any of my posts, which means that if you're reading this, you're in violation of copyright law.

    Okay, I now release my copyrighted work officially into the public domain. You're safe now.

    --
    "Murphy was an optimist" - O'Toole's commentary on Murphy's Law
  63. If robots.txt were a true access control mechnism by Nom+du+Keyboard · · Score: 2, Insightful
    If robots.txt were a true Access Control Mechnism, then it would need to act something like this:

    Spider: Hi, I'm an Internet Spider, may I access this page?

    RT: No, no, one thousand and twenty-four times NO! I will not give this page to a spider.

    Spider: Okay. How about this other page?

    If this were the case, then the only way of bypassing this mechnism -- and one that would violate the sprirt (IMHO), if not the letter (IANAL), of the DCMA would be for your Spider to not identify itself as a spider. Then it would be trying to trick an Access Control Mechnism.

    BTW, it's my on opinion that once you publish a page on the Internet for public viewing, you cannot complain if they've Time-Shifted that viewing to a later point by recording -- ur, saving -- it on recordable media. Seems to me that the plantifs are totally wrong, got caught at it by their own web-postings, and are now trying to kill the messenger.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  64. Re:The Archive faces a lot of potential problems.. by millennial · · Score: 2, Informative

    In the United States, putting any creative work into a fixed tangible form automatically confers the protection of copyright, effective from that moment. No notice or registration is required.

    That's simply not true. You have to be able to prove that you created it first, and if you want the right to be the sole receiver of royalties from your work, you have to register it with the copyright office. This isn't free, either.

    --
    I am scientifically inaccurate.
  65. This is a frivilous lawsuit by jafiwam · · Score: 2, Informative

    Archive.org has always had a good policy to removing data on request.

    They have an automatic version that allows use of robots.txt, when forbidden to crawl they go back and make the other, older versions unavailable as well. (It only works when the re-crawl happens, though I think you can initate it by going to the site.)

    Furthermore, additional requests can be made via email to remove content. The only "damage" here is that the wrong (in their opinion) law firm got ahold of the data before they could do that.

    The company suing, broke the law, got sued, got fucked, and now wants to sue to recover money due to them breaking the law and getting busted for it by going after archive.org that provided evidence in the original lawsuit. Sorry guys, you got fucked when you first stole trademarked stuff of someone elses web site. The rest of it is just sour apples. They should be charged with intimidating a witness and put in pound me in the ass federal prison for it. It's racketteering like that that gives lawyers such a bad name.

    Had they any brains, they would have employed a geek to go seek out these cached sources and remove them the first time around.

    AND the company suing the original offending company, should have used a simple entry in their HOSTS file to keep from accidentally causing requests to go to the original web server, that's simple data forensics.

    Let me tell you a story about my week in mid September, 2001. After wasting tons of time reading news I got a desparate call from a certain client (soon to be rather). Their web host was in the towers, and both server farms were demolished, along with all the backup tapes. Their site was gone. AND due to other complications they were losing customers left and right.

    I used Archive.org, Google cache and a few bits they had to reassemble the web site and get it back on line. In this case, un-pre-approved caching was critical in keeping this company from going out of business.

    There 1,000s of other systems that cache data and make it available later, Inktomi, Akami, corporate networks, those "high speed dial-up" things, my friggin open source firewall does that (Squid?). It's simply stupid to sue archive.org for that. Caching is part of the web, get fucking used to it.

    It's the webmasters damn job to know or learn all about this stuff (including caching). Slapping HTML up on some server is not the end of web managment. There's a whole lot more to it.

  66. Sue a witness? by Neurotoxic666 · · Score: 4, Informative

    Can you sue a witness because he remembered the facts against you during a trial the same way the Wayback Machine is being sued because it "remembers" old facts and saying and has been used in courts?....

    --
    You are more than the sum of what you consume. Desire is not an occupation.
  67. A better summary by Stankatz · · Score: 3, Insightful
    Here's a diagram of the lawsuits:
    xxxxxxxxxxxxx Internet Archive
    xxxxxxxxxxxxxxxx /\ xxxxx |-
    xxxxxxxxxxxxxxxx II xxxxxx \\(wrong?)
    xxxxxxxxxxxxxxxx II(2b)xxxx \\
    xxxxxxxxxxxxxxxx II xxxxxxxx \\
    xHealthcare Assholes ===> Harding Elmer
    xxx of Philly xxxxxx (2a) Fudd & Frailey
    xxxxxxxx II
    xxxxxxxx II(1)
    xxxxxxxx II
    xxxxxxxx \/
    xx Health Copycat

    I know you're all going to find this shocking, but it looks like the /. summary was wrong. The lawsuit against the IA was brought by Healthcare Associates of Philadelphia. Here's how it went down:

    1. Healthcare Associates of Philly sues Health Advocate. The law firm representing the plaintiff is McCarter & English. The defendant's is Harding et al.
    2. Healthcare Assoc. modifies robots.txt to tell IA not to allow access to older versions of their site. 3. Harding et al. manages to get the IA to give them a peek anyway*.
    4. Healthcare Assoc. sees "rapid fire" requests from Harding in their logs, and a few times the IA slipped up and granted access anyway.
    5. Healthcare Assoc. sues Harding et al. and the IA.

    *My guess is that the IA checks the current robots.txt everytime an archived page is accessed. If the server doesn't respond quickly enough, the assume it's OK to give access to the archived files. Harding et al. might have realized this and requested the pages over and over in rapid succession to slow down Healthcare Assoc.'s servers enough to trick IA into thinking they're not responding. This is all just my speculation, so take it with a grain of salt.

    (Sorry for the crappiness of the diagram. Apparently Slashdot is more concerned with preventing 13-year-olds from posting ASCII art versions of gotse man that will be modded down to -1 in 2 seconds than it is with allowing people to make diagrams to illustrate something. And why the eff doesn't the "ecode" tag work properly?)
  68. MODS: Parent is wrong. by Goobermunch · · Score: 2, Insightful

    You do realize that this is patently incorrect?

    Pardon me if this sounds pedantic, but tort law is so misunderstood that I'd like the opportunity to correct this post.

    In Common Law countries, all people have a duty to act as would a reasonably prudent person in the same or similar circumstances. A person is negligent if they breach that duty and cause injury to another.

    In other words, the city is negligent if it fails to repair a sidewalk that a reasonably prudent person would have repaired. In situations where a party lacks notice of a defect, the same analysis applies: should a reasonably prudent person responsible for the maintenance of a sidewalk have been aware of the defect.

    Here's an example: 10 minutes ago, NYC experienced a minor, highly localized earthquake that fractured the sidewalk outside my apartment. 7 minutes ago, I realized I was out of milk. 4 minutes ago, I stepped out of the building, tripped on the damaged sidewalk, and broke my leg in 12 places.

    A reasonably prudent city probably wouldn't be able to repair the damage in the six minutes between earthquake and injury. Probably, the city government wouldn't even be aware of it by then. Thus, the city couldn't be negligent.

    A more likely explanation is that the law firms are videotaping the sidewalks and sending them to the city government to put them on notice of serious flaws in the sidewalk. Then they can argue that the city was on notice of the defects and failed to act reasonably by not repairing the damage.

    But negligence does not require evidence of willful conduct. Negligence is merely a failure to act as a reasonably prudent person under the circumstances.

    --AC

    Now that you've modded the parent down, you should mod me down too.

  69. Re:MODS: Parent is wrong. by techno-vampire · · Score: 2, Informative
    But negligence does not require evidence of willful conduct. Negligence is merely a failure to act as a reasonably prudent person under the circumstances.

    That's true, now. There was a time that a specific act had to be shown, and the person specified. Over a hundred years ago, a man was injured when a loose barrel came flying out of a brewery and hit him. He sued for negligence, and won, even though nobody could be shown to have caused it. This was because the incident was so outrageous that there was no possible explanation without assuming negligence, and it established a new legal priciple: res ipse loquitur, the act speaks for itself.

    --
    Good, inexpensive web hosting
  70. RTFA--the Internet Archive is absolvable by Nukenin · · Score: 2, Informative

    From the FA (emphasis mine):

    But on at least two dates in July 2003, the suit states, Web logs at Healthcare Advocates indicated that someone at Harding Earley, using the Wayback Machine, made hundreds of rapid-fire requests for the old versions of the Web site. In most cases, the robot.txt blocked the request. But in 92 instances, the suit states, it appears to have failed, allowing access to the archived pages.

    I'm fairly certain that the Internet Archive has no control over access to Healthcare Advocates own webserver(s). I'm also fairly certain that the Internet Archive would not log access to archived web content back to the "Web logs at Healthcare Advocates". So someone at Healthcare Advocates or its legal firm is really, really grasping at nonexistent straws here, or just plain stupid/ignorant. Suing the Internet Archive because Healthcare Advocates' own webserver(s) served up outdated content that they themselves left accessible? (robots.txt is no substitute for simply removing the old files from the webserver(s)' document tree or otherwise restricting access at the server side.)

    Hopefully sanity prevails and this lawsuit is dropped. Either that or Healthcare Advocates and/or its legal representation is made a laughingstock in the courtroom.

  71. Re:The Archive faces a lot of potential problems.. by ubernostrum · · Score: 2, Informative

    That's simply not true.

    IANAL, but... actually, it is. Since 1978, copyright has been granted automatically on the creation of the work, with registration required only to exercise certain legal options such as recovering statutory damages. See Title 17 USC Chapter 3, Section 302 (a) for this, and Chapter 4, Sections 411 and 412 for the limitation on what you can do without registering. The exclusive rights granted by copyright (Chapter 1, Section 106) remain in effect regardless of registration.

  72. Re:Completely beside the point.... by eeyore-on-thorazine · · Score: 3, Interesting

    | That's akin to advocating free speech, even if it means shouting "Fire" in a crowded theatre

    The correct quote is "The most stringent protection of free speech would not protect a man falsely shouting fire in a theater and causing a panic." and it has absolutely nothing to do with the issues at play in this case. The statement is a limitation where it endangers public health and or safety, and I dont think anyone would apply those standards to this case. Similarly, Defamation has no relation to this case, and Breach of Contract in the form of robots.txt is already a bone of contention in the case.

    I'm not having this discussion about free speech. I'm having this discussion about the scope of current copyright arguments. I'm not arguing that this suit is trying to muzzle the IA in violation of the first ammendment. That would be a flawed argument anyway - the IA is an organization, not an individual, and thus not subject to constitutional protection in most cases (including this one). I'm arguing that the value proposition of copyright does not extend to preventing organizations like the IA from documenting the state and history of any publicly accessible website. It is a documentarian, and educational role, and as such, is every bit as appropriate as the library system.

    The Internet Archive is, in essence, cataloging the evolution of one of the most significant cultural phenomenons of the last century. That IS of tremendous historical significance - most especially because 80% of the content that is there now will be lost forever within the next year if they DONT do it.

    The internet is evolving at a breakneck pace. As new business models, new buisnesses, new display models, new UI elements and new languages crop up, older sites are redesigned, replaced or relegated to non-existence by darwinian market forces. What is there now bears little resemblance to what was there a year ago, and likely little resemblance to what will be there a year from now. This evolution is of tremendous social, scientific and cultural significance.

    Do I believe that documenting this growth and evolution trumps the overextended copyright argument presented by the plaintiff in this case? Yes, I do. The plaintiff had no reasonable expectation of privacy where the public website was concerned. The information was placed in what amounts to a public space, with no access controls and no barriers to public navigation. To assert then, that the IA was acting inapproriately to document the state of the public portion of that site is patently absurd.

    Now, to your other points...

    We aren't discussing Joe User's website, we are discussing a website developed and deployed by a corporation at some expense with the express purpose of giving them a 'web presence'. Many jurisdictions will now permit the argument of publication where an agency has spent time and money to make information available to an audience. The expense and resources expended to make it available to the public presuppose an intent and desire that it be seen and consumed by the public. Further, the case in question references the DMCA, and specifically, a means to bypass security measures to prevent copying. That clause of the DMCA (and arguably the whole the whole law) applies _specifically_ to published media.

    Regardless, flyers distributed in a parking lot or on a public bulletin board also carry all of the assumptions needed to carry the rest of my argument. The 'provider' of such information has no control over the future use or reference to those fliers in an educational, historical or editorial context.

    Again, free speech is only a peripheral issue here. Fair use doctrine is as much, or more a limitation of copyright as it is a defense of the first ammendent. While first ammendment protections have been used to DEFEND fair use in the past, it does not necessarily follow that all questions relating to fair use are also first ammendment challenges. At question here is the IA's right to

  73. Truth by Scroatzilla · · Score: 2, Interesting

    I've been wondering if the issue isn't simpler than all of this legal wrangling? What I mean is that, whatever has happened all throughout history, we only have 1) evidence of things through artifacts, interpreted by those who find and study them, and 2) the written word, by those who research, then try to wrap up "facts" in a coherant package.

    Technology such as the internet archive now exists to automatically, systematically, and rather thoroughly store very specific artifacts (old web pages). These artifacts happen to also be the written word. The complication is that much of that written word (that the legal system and corporations care about) is propaganda which, by its very nature, is not 100% true. What is true from a historical perspective, though, is that it existed as a part of the Internet/WWW which, in turn, is a huge part of our society and culture.

    So do we view it in the context of an accurate historical representation of a body of knowledge that existed at a given snapshot of time, which is a decent encapsulated version of "truth," which is theoretically what a good justice system should be rooted in? Or, do we blatantly use outmoded, weasel-ish legal wranglings to suppress what is, indeed, truth that is relevant to deciding a given court case?

    If we choose the latter, what does that say about the integrity of our justice system?