Slashdot Mirror


Wayback Archives as a Law Tool

Carl Bialik from the WSJ writes "The Wayback Machine's internet archive and Google's cached pages are becoming indispensable tools for some lawyers, especially specialists in intellectual-property law. Dell has used copies of expired websites to get the domain name DellComputersSuck.com transferred to it, the Wall Street Journal reports. EchoStar used Wayback in a case against a Polish TV company. Playboy checks Wayback to look for infringers of its trademark bunny or other images. And Wayback was even used to discredit a witness and reach a mistrial in a Canada murder case."

149 of 198 comments (clear)

  1. Text (Yes) Images (not always) by bigwavejas · · Score: 3, Informative

    WBM works great for pulling up historical text content, but I've noticed it tends to be hit-and-miss with images. Try pulling up a website and chances are you'll see broken image links.

    --
    "Simplify, simplify, simplify!" Thoreau
    1. Re:Text (Yes) Images (not always) by el_gordo101 · · Score: 3, Informative

      That's because they don't save the image files, as far as I can tell. The images actually point back to the site that was archived through some sort of re-direct on the WBM site. If the images files no longer exist on the original site, they will not display on the WBM archived page.

      --
      TODO: Insert witty sig
    2. Re:Text (Yes) Images (not always) by Adult+film+producer · · Score: 2, Informative

      That's because they don't save the image files, as far as I can tell. The images actually point back to the site that was archived through some sort of re-direct on the WBM site.

      I don't think that's the case, I don't think it even tries to grab the images from the server. At least with my webpage, I just clicked on the Nov 17th, 2001 archive... it had all the old images that I've long since deleted... and the server logs show no hits/404's for those images...

      Maybe that's not always how it operates though, *shrug*

    3. Re:Text (Yes) Images (not always) by Myko · · Score: 1

      And heaven forbid they actually archive and serve up externally linked css files...

    4. Re:Text (Yes) Images (not always) by cybersaga · · Score: 2, Funny

      They seemed to have captured Goatse just fine.

    5. Re:Text (Yes) Images (not always) by el_gordo101 · · Score: 3, Informative

      Interesting. Our sites date back to 1999. The images files from the older versions of sites do not show up, as these files were deleted long ago. The newer versions that use the images files that are still resident in our /images directory work fine. I am not sure how they handle images.

      --
      TODO: Insert witty sig
    6. Re:Text (Yes) Images (not always) by Adult+film+producer · · Score: 1

      would the size of the image file have anything to do with it ? Sheesh, I bet there's a simple faq on their website that answers this heh

    7. Re:Text (Yes) Images (not always) by el_gordo101 · · Score: 2, Interesting
      From the FAQ:

      Why am I getting broken or gray images on a site? Broken images (when there is a small red "x" where the image should be) occur when the images are not available on our servers. Usually this means that we did not archive them. Gray images are the result of robots.txt exclusions. The site in question may have blocked robot access to their images directory.
      --
      TODO: Insert witty sig
    8. Re:Text (Yes) Images (not always) by the_true_cirrus · · Score: 2, Insightful

      As far as I could tell from experimenting with an old site of mine WBM tends to have multiple copies of sites from different times. If an image, page or any other object is missing from one of those archived copies it will redirect to the next most recent copy of that file in the WBM and so on. If the latest archived copy still doesn't have the file THEN it redirects to the actual website.

      I've often noticed that WBM pages load very slowly and I suspect that this is partly due to all the chained redirects your browser goes through before it finally reaches a copy of an image or a definitive 404 error if it is gone.

    9. Re:Text (Yes) Images (not always) by Virak · · Score: 1

      Ahhh....that bring back memories. :P

    10. Re:Text (Yes) Images (not always) by Caecias · · Score: 1

      I didn't bother keeping a copy of our old website (from 2001), which included pictures of the construction of our new building. It turns out one of our board members was injuried in the construction zone and is suing the company. We were able to find most of the pictures on the Way Back Machine to use in our defense in the lawsuit!

    11. Re:Text (Yes) Images (not always) by coopex · · Score: 1

      You sir, are my hero. I hope I can follow in your footsteps someday by posting an on topic goatse link.

      --
      The road to hell is paved with good intentions.
  2. And this is a big deal why? by Shadow+of+Eternity · · Score: 5, Insightful

    Peaple go to the library and dig through hundreds of old newspapers and records, whats the big deal with using wayback for websites?

    --
    A bullet may have your name on it but splash damage is addressed "To whom it may concern."
    1. Re:And this is a big deal why? by ucahg · · Score: 2, Insightful

      For one its not quite as verifiable. Who is to say, for example, that someone with access to the Wayback servers couldn't put their own content and dates on there, and then use that as "evidence" for some suit?

      I don't know how (if?) its regulated, any insights into this?

    2. Re:And this is a big deal why? by TTK+Ciar · · Score: 5, Interesting

      For one its not quite as verifiable. Who is to say, for example, that someone with access to the Wayback servers couldn't put their own content and dates on there, and then use that as "evidence" for some suit?

      I don't know how (if?) its regulated, any insights into this?

      I work at The Archive. There are only two people, three at most, with the expertise and access to pull something like this off, and if someone tried Brad would almost definitely notice. There are checks in place to detect bitrot in the web archive, and altering older ARCs to include new information would be detected as bitrot and flagged for closer attention. They would then be compared against the copies in our sister organization's data cluster in Europe, and possibly also compared against the copies in the datacenter in Egypt.

      To make it work, you'd pretty much have to get Brad to play along, and he is fanatical about the integrity of the web data. I don't think you could pay him enough to do it, and he doesn't have any sons or daughters you could kidnap for blackmail.

      How one would go about demonstrating all of this in court, though, I do not know. IANAL.

      -- TTK

    3. Re:And this is a big deal why? by Kenja · · Score: 4, Funny

      Your right! We must put a stop to this movable type menace before the serfs use it to spread anti authoritarian pamplets!

      --

      "Have you ever thought about just turning off the TV, sitting down with your kids, and hitting them?"
    4. Re:And this is a big deal why? by iminplaya · · Score: 1

      They have been trying since 1710. In more recent times Sonny Bono and Disney did their part for the cause. And of course now there's DMCA. Looks like they're working pretty hard at it.

      --
      What?
    5. Re:And this is a big deal why? by Anonymous Coward · · Score: 1, Interesting

      Peaple go to the library and dig through hundreds of old newspapers and records, whats the big deal with using wayback for websites?

      Well, copyright infringement for one. Wayback's copyright infringement does lead to damages to some people.

    6. Re:And this is a big deal why? by Thud457 · · Score: 1

      When all else fails, we can post our grievences on Craigslist, print it out and nail it to the bank's front door in the middle of the night.

      --

      the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

    7. Re:And this is a big deal why? by owlstead · · Score: 1

      You could put up a time-signing server somewhere safe and let it sign the secure hashes of content along with the time. That would really restrict the possibility to mess with the archives without somebody noticing. There are even HSM (hardware security modules) which will let you do that while leaving the key in a safe space. I doubt the signature would normally hold up as a legal signature, but it would make it much harder to attack the content.

    8. Re:And this is a big deal why? by geekoid · · Score: 1

      SO the system rely's on the integrity of one man.

      That's fine, assuming he actually has integrity, and that he will live forever.

      For the sake of arguement, I believe that this Brad guy has a lot of integrity.
      Now, does he live forever? Will he be in this position forever?

      if either of those is No, then the system will fall apart.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    9. Re:And this is a big deal why? by BlogPope · · Score: 1

      This implies that the WBM goal is to be a legal and absolute reference point. What about the case where a web servers recognizes the WBM and serves it different content than the rest of the world.

      --
      My other car is a Popemobile
    10. Re:And this is a big deal why? by sconeu · · Score: 1

      All 95 grievances?

      --
      General Relativity: Space-time tells matter where to go; Matter tells space-time what shape to be.
    11. Re:And this is a big deal why? by Shadow+of+Eternity · · Score: 1

      so does the cache your web browser makes every single time you view a page.

      --
      A bullet may have your name on it but splash damage is addressed "To whom it may concern."
    12. Re:And this is a big deal why? by werdna · · Score: 1

      Maybe not conceptually, but before archive.org, there simply was no one doing this. We don't know today, which website pages today, if we could reconstruct them, would be worth $10M as evidence for a legal case. Wayback gives us the answer, because it captures nearly all the web.

      The point is that in many cases today, the contents of a website over time can be gold. I have cracked many cases this way -- it used to be my secret weapon. Now everybody knows. Darn.

  3. Not everything is archived .... by awacs · · Score: 4, Insightful

    ... WBM respects the site's decision to not allow archiving. Unfortunately, those sites who might be the most interesting know that, and know that they can block archiving.

    1. Re:Not everything is archived .... by pilgrim23 · · Score: 5, Interesting

      Another point of note: Net Nanny and Surf Watch or other such tools blcok the main sites. they do NOT block the WBM archive of goatse.cx or the like. AND THAT IS A GOOD THING!!!

      Example:

      www.copstalk.com used to be the home page for a maker of Macintosh to PC via Appletalk cross platform communications tools. They were later bought out. If you wish to look at documentation on their older products, go to the WBM. www.copstalk.com these days IS A PORN SITE.

      --
      - Minutus cantorum, minutus balorum, minutus carborata descendum pantorum.
    2. Re:Not everything is archived .... by MattHawk · · Score: 1

      Users stuck behind Websense (such as me during the school year), however, are not so lucky. Websense (admittedly, somewhat correctly) classified it as a proxy avoidance service about two and a half years ago, so no WBM for me :(

  4. In Contrast to by goneutt · · Score: 3, Interesting

    A few weeks ago /. had a bit on an Internet Archive that got sued for having material that was ordered withdrawn by a court. How can they have things both ways? I think I'm begining to understand why law school is so hard, you have to learn to think like a lawyer, which is like learning to type with your nose.

    --
    Bacardi + slashdot = negative karma.
    1. Re:In Contrast to by garcia · · Score: 1

      you have to learn to think like a lawyer, which is like learning to type with your nose.

      Well, unless the keyboard is new the keys are dirty you know...

    2. Re:In Contrast to by slashrogue · · Score: 1

      The previous /. story is referenced in the article: " Philadelphia company, Healthcare Advocates Inc., says it used a robots.txt file to block access to older versions of its site. When a law firm used the Wayback Machine to nonetheless access old material from the site, Healthcare Advocates sued, alleging computer fraud and violation of federal copyright law. In its suit, the health-care firm contends the law firm "intentionally circumvented" the robot.txt's blocking mechanism by making repeated search requests. Healthcare Advocates is embroiled in a trademark dispute with a client of the law firm, Harding Earley Follmer & Frailey."

    3. Re:In Contrast to by fritter · · Score: 4, Funny

      OMG D00D you're totally right! Some lawyers argue for one thing, while others argue for another! I just heard about this case the other week where this lawyer made a case that some guy was guilty of murder, but another lawyer was arguing the EXACT SAME GUY was innocent! I cannot believe they tried to have it both ways! Unlike every other profession in the world, where everyone thinks exactly alike!

      Seriously, how did this get modded "Interesting"?

    4. Re:In Contrast to by goneutt · · Score: 1

      If you're going to use the wayback as an archive you can sue someone for using it. It's like suing someone for reading an old phone book to get your number and address that you've since made unlisted. (Although that might be stalking) Or going the library to look up an old magazine where you get an article that embarasses someone.

      BTW, your bit isn't that funny.

      --
      Bacardi + slashdot = negative karma.
    5. Re:In Contrast to by Macadamizer · · Score: 2, Insightful

      Lawyers dont have this long term morality thing that we do

      Has nothing to do with morality -- being a lawyer is a job. A lawyer can argue one day about poor quality tools and the next day that those tools are high quality (as long as it is in a different case, that is!) because that's his or her job -- to zealously represent the interests of his or her client. Lawyers don't deal with "truth" in that way -- the truth is for the jury to determine -- so its not a moral issue of lying one day and telling the truth the next. Besides, maybe, based on the particular facts of the case, in the context of the facts the Sear's tools ARE poor quality in the context of one case and ARE high quality in the context of the other case.

      Besides, other than the paycheck and their oath to zealously represent their clients, lawyers typically don't have "a dog in the fight" -- civil cases are typically not personal, so again, it's all just another job, and you do your best for your boss, whoever that might be on a particular day...

      --

      "That's not even wrong..." -- Wolfgang Pauli
    6. Re:In Contrast to by goneutt · · Score: 1

      Civil cases are very personal. The lawyer is saying whatever will get him his cut. Just because its a job doesn't mean you do whatever it takes. It is about twisted moratlity. I know in engineering you have a code of ethics, but I'm not sure about lawyers. If their code of ethics is to argue whatever side your on today, it's not a moral code of ethics.

      Also, since IANAL, I wonder if the other side would be able to use the lawyers previous statements in the next case.

      --
      Bacardi + slashdot = negative karma.
    7. Re:In Contrast to by Macadamizer · · Score: 1

      Civil cases are very personal.

      For the parties involved, that's very true. But the attorneys themselves generally do not have a stake in the outcome.

      Just because its a job doesn't mean you do whatever it takes.

      Actually, you do have to do whatever it takes (as long as it is legal and ethical, that is) -- attorneys take an oath to do so. If an attorney didn't do everything legally and ethically possible to win his client's case, then he is not being the "zealous advocate" that he promised the state he would be when it issued him or her a license to practice law.

      I know in engineering you have a code of ethics, but I'm not sure about lawyers.

      Of course lawyers have a code of ethics -- and unlike the code of ethics for engineers, the code of ethics for lawyers is codified into law in every state in the U.S. Lawyers can lose their license or go to jail for violating ethics rules. Few other professionals face jail time for violating ethical, as opposed to legal, rules.
      For lawyers, their ethical obligations ARE legal rules.

      If their code of ethics is to argue whatever side your on today, it's not a moral code of ethics.

      Depends on how you define morals, I guess. If you believe the only way to be ethical is to always tell the truth, then maybe you have a point -- but remember, in a civil case at least, there are two sides to every story -- where the "truth" lies may not be apparent from either side. As I noted earlier, in civil trials, it is the jury, not the lawyers, who determine what the truth is based on the evidence and testimony presented.

      Also, since IANAL, I wonder if the other side would be able to use the lawyers previous statements in the next case.

      Generally not -- but that's because what the lawyer says in court is NOT testimony and usually doesn't have a lot to do with what the "truth" of the matter is. The testimony of witness and evidence presented, however, may, in certain cases, be brought up again in another trial. Usually a participant in a trial may not take a legal position if that same participant took a contrary position in a case where the facts and issues in the case were the same as before.

      However, there are lots of things that many people might not find "ethical" about these aspects of being a lawyer, and I suppose that many might find what lawyers do immoral. But being a lawyer is a job, a job that you are obligated by law to do to the best of your ability.

      --

      "That's not even wrong..." -- Wolfgang Pauli
    8. Re:In Contrast to by geekoid · · Score: 1

      If you had understood his point, you would know why it got marked interesting.

      --
      The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
    9. Re:In Contrast to by jahudabudy · · Score: 1

      Actually, you do have to do whatever it takes (as long as it is legal and ethical, that is)

      Surely lying to a judge and jury is illegal for lawyers. I mean, it is for witnesses, why would lawyers be allowed? I see your point about tools being low quality for certain situations, but high quality in different situations (MySQL is very good for a high traffic, read only web-database system, but I wouldn't bank at a bank that kept my accounts in a MySQL system), but in the vast majority of cases, completely contradicting an earlier position makes me suspicious you were lying one of the times. Saying that it was necessary to win might make you a good lawyer, but it is pretty hard to pretend it makes you a moral (or is it ethical?) person.

      --
      ...sometimes, in order to hurt someone very badly, you have to tell that person terrible lies. - PA
    10. Re:In Contrast to by InvalidError · · Score: 1

      We live in a world of double standards.

      New Windows vulnerabilities announced here lead to a rehash of all the FOSS vs closed-source jokes and other stuff while tones mysteriously get more serious when FOSS vulnerabilities are found.

      Internet archives are bad and violations of copyright which lawyers will complain loudly about when they have to face incriminating evidence coming from such services... but when internet archives contain evidence needed to close cases, they suddenly become indispensable tools.

      N-tuple standards are annoying but that is the world we are in.

    11. Re:In Contrast to by Macadamizer · · Score: 1

      Surely lying to a judge and jury is illegal for lawyers.

      Of course it is. In fact, here's the relevant statute i California:

      Professional Conduct, Rule 5-200

      In presenting a matter to a tribunal, a member:

      (A) Shall employ, for the purpose of maintaining the causes confided to the member such means only as are consistent with truth;

      (B) Shall not seek to mislead the judge, judicial officer, or jury by an artifice or false statement of fact or law;

      (C) Shall not intentionally misquote to a tribunal the language of a book, statute, or decision;

      (D) Shall not, knowing its invalidity, cite as authority a decision that has been overruled or a statute that has been repealed or declared unconstitutional; and

      (E) Shall not assert personal knowledge of the facts at issue, except when testifying as a witness.

      completely contradicting an earlier position makes me suspicious you were lying one of the times.

      And that would be true, if the two factual situations were identical, or nearly so. However, in the overwhelming majority of cases, there are specific facts that, when viewed in the context of all of the issues and facts in the litigation, would support a position that would appear, on its face, to contradict a position taken in an earlier case. Most cases are distinguishable from another when looked at carefully, even if they look the same after a brief glance.

      --

      "That's not even wrong..." -- Wolfgang Pauli
  5. Employers are using Google too by October_30th · · Score: 4, Insightful

    Maybe this is a bit off-topic, but employers are also known to use Google and web archives to check up on the past of a potential employee. So be careful what kind of statements you make on the net using your real name.

    --
    The owls are not what they seem
    1. Re:Employers are using Google too by op12 · · Score: 4, Informative

      It's like that CNN article (I think it was posted here) from a few days back:

      Bloggers learn the price of telling too much

    2. Re:Employers are using Google too by mattsucks · · Score: 1
      It's like that CNN article (I think it was posted here) from a few days back:
      Please don't let that stop you from posting it again, this _is_ /. after all.
    3. Re:Employers are using Google too by op12 · · Score: 1

      Well then maybe I should submit it for the main page :)

    4. Re:Employers are using Google too by Ratbert42 · · Score: 1
      When we hired a few months ago, I Googled (and looked at past employers, domain registries, etc.) for the 5 serious candidates and got significant results for two of them. They both looked much worse as a result of their online rants and other activities.

      This is exactly why I use this and other disjoint usernames. I have managed to get almost all of my old usenet postings pulled from DejaNews / Google Groups. Only one of my 3 blogs has my name on it and my website's essentially blank, though I'm close to putting up a nice vanilla inoffensive site just to promote my resume.

      Of course, you could turn the tables and use it to plant information. Make blog posts about how hard you worked this quarter, the big sale you closed, etc. Or even fake a background like making Usenet posts with a forged email address. Or whole fake company web sites listing you as CTO.

  6. I wonder... by Bryansix · · Score: 2, Insightful

    I wonder if people will try to sue website owners for content that they already pulled off of thier website. I mean I would hope not but I could see how this could happen. A person realizes that certain content is copyrighted and then pulls it and later on some lawyer for the owner of this content sues and uses google cache or WBM as a tool to prove it posted copyrighted material.

    1. Re:I wonder... by Peyna · · Score: 3, Insightful

      Just because you stopped doing something, doesn't mean it wasn't illegal while you were doing it.

      See, if I am beating the crap out of you, but stop before the police get there and witness it, that doesn't mean I wasn't beating the crap out of you and therefore guilty of battery.

      It's a weird example, but it works.

      If you've ever read some of the RIAA threat letters you'll notice they specifically state that just because you listen and pull down the offending material doesn't mean they're giving up their right to sue you for posting it in the first place.

      --
      What?
    2. Re:I wonder... by Bryansix · · Score: 1

      Right, I know that the website commited a crime in the past but they didn't know it was a crime at the time so it's not the same as beating the crap out of someone. I now the RIAA puts that clause in thier threat/cease and desist letters but I doubt they sue unless they have non-compliance or they know the people knew it was illegal to do what they were doing.

    3. Re:I wonder... by Iriel · · Score: 1

      In a sense, it's already happened.

      http://yro.slashdot.org/article.pl?sid=05/07/01/13 55234&tid=123&tid=95&tid=155

      This one just relies on taking an old post out of context to 'shed light' on a current situation.

      --
      Perfecting Discordia
      www.stevenvansickle.com
    4. Re:I wonder... by Peyna · · Score: 2, Informative

      Right, I know that the website commited a crime in the past but they didn't know it was a crime at the time so it's not the same as beating the crap out of someone.

      Ignorance of the law is no excuse.

      I can't believe I actually have to point that out to someone.

      --
      What?
    5. Re:I wonder... by Bryansix · · Score: 1

      It's not an ignorance of the law but rather a lack of due diligence to make sure the material was not copyrighted. This could happen fairly easily especially if other people have copied the material before and even claim it to be thier own but then say it is public property.

      Plus I would hope that most companies would be more concerned about removing the content then for seeking damages which usually would be very limited and subjective.

    6. Re:I wonder... by walt-sjc · · Score: 1

      Lets say person A posted copyrighted info on your web site. You didn't know it was copyrighted at the time. Person B tells you that it is and you pull the content. Copyright owner (C) later sues your ass since they found it on the WBM.

      Make sense now? Even if you are NOT ignorant of the law, you can still be screwed.

      Just the process of defending yourself can bankrupt you (that is if you had enough $$$ to mount any kind of defense at all in the first place.)

    7. Re:I wonder... by Peyna · · Score: 2, Interesting

      Lets say person A posted copyrighted info on your web site. You didn't know it was copyrighted at the time. Person B tells you that it is and you pull the content. Copyright owner (C) later sues your ass since they found it on the WBM.


      It's copyright infringement, whether or not you know the material is copyrighted, or have reason to know.

      Copyright law does provide for reduced penalties in the case where the "infringer was not aware and had no reason to believe that his or her acts constituted an infringement of copyright."

      So, the law has in essence created a duty to users of documents to determine if they are copyrighted prior to distributing.

      --
      What?
    8. Re:I wonder... by Peyna · · Score: 1

      I don't think it would be fair to say that someone should be allowed to record everything you say out loud and then provide it to someone to use it against you in a court of law.

      If you said it in a public place, it's fair game. The Internet is a public place for the most part.

      The rules are a little different from criminal liability, but if you say something in public, it's public, and you can't do much about stopping others from using it, even in court.

      --
      What?
    9. Re:I wonder... by finkployd · · Score: 1

      It's not an ignorance of the law but rather a lack of due diligence to make sure the material was not copyrighted.

      Yup, and you can get in just as much trouble for that.

      Plus I would hope that most companies would be more concerned about removing the content then for seeking damages which usually would be very limited and subjective.

      I would hope that too. Sadly, it rarely works out the way we hope. Most content companies these days are more concerned about "making an example" and hoping that example will act as a deterrent.

      Finkployd

  7. It also got wayback sued. by Peyna · · Score: 4, Interesting
    --
    What?
  8. Infinite Loop by kensai · · Score: 4, Funny

    If Google cached pages from the WBM and the WBM archived Google cached pages, wouldn't that cause an infinite loop. j/k

  9. One easy workaround... by DJ+Rubbie · · Score: 5, Informative

    $ cat robots.txt
    User-agent: ia_archiver
    Disallow: /

    My site is not archived there, problem solved.

    (Of course, if another of these service pops up...)

    --
    Please direct all bug reports to /dev/null
    1. Re:One easy workaround... by HyperChicken · · Score: 1
      (Of course, if another of these service pops up...)
      Isn't that what <META NAME="ROBOTS" CONTENT="NOARCHIVE"> is for?
      --
      Free of Flash! Free of Flash!
    2. Re:One easy workaround... by baadger · · Score: 2, Interesting

      What about the HTTP/1.1 header "Cache-Control: no-store"? Although the rfc description of what it does seems rather confusing to me.

    3. Re:One easy workaround... by Motherfucking+Shit · · Score: 1

      You both miss the OP's point. Archive.org respects robots.txt files that ask them not to spider a site. Other archival services may not be so friendly, regardless of whether you're using a META tag, an HTTP header, or a robots.txt file.

      --
      "BSD: Free as in speech. Linux: Free as in beer. Windows 10: Free as in herpes." --Man On Pink Corner in #52607549.
  10. How do you think he does it? by zippthorne · · Score: 1

    Is his name Tommy?

    --
    Can you be Even More Awesome?!
    1. Re:How do you think he does it? by Donut2099 · · Score: 1

      He has crazy flipper fingers.

  11. So perhaps censoring the archive is wrong? Signed? by expro · · Score: 4, Informative

    Destroying evidence? If I don't want to be caught and ask for older web pages to be removed, that may contain incriminating evidence such as illegal copies of things or illegal links, is this different from a request by any other copyright holder to have his pages removed, and can it be punished? What are the archives retention policies, and have legal orders been served to prevent destruction of evidence?

    What would be even better would be if the archives digitally signed their archives and kept signatures even of those things that had been asked to remove so that the validity of a copy could be established if made for legal purposes (SCO, Scientologists, and other things come to mind) even if later censored.

  12. Hey Mr. Peabody! by DigitalReverend · · Score: 4, Interesting

    Stand back Sherman as we set the dials on the wayback machine to 1845....

    Seriously, theres been many times I would want to kiss the person running wayback. I lost my home a few years back and had several websites that I lost because I hosted out of my house. I have been able to rebuild, or come fairly close to duplicating those original sites.

    As for lawyers, if there wasn't somebody already archiving all these sights, they'd get someone to do it for them and then it would not be accessible to the public. I guess we need to take the good with the bad on this.

    --
    I read Slashdot for the headlines, because the headlines, unlike the articles, are usually original and never duplicated
    1. Re:Hey Mr. Peabody! by Thud457 · · Score: 1

      Jorn is that you?!!!

      --

      the preceding comment is my own and in no way reflects the opinion of the Joint Chiefs of Staff

    2. Re:Hey Mr. Peabody! by Jherek+Carnelian · · Score: 1

      I lost my home a few years back

      You should have tried harder to find it, whenever you lose something, it is always in the last place you look.

  13. What about copyrights? by donleyp · · Score: 1

    I wonder how this relates to copyright laws. If I write a disertation on my blog, copyright it, and publish it, wouldn't it be a copyright violation for WBM to republish it for profit?

    --
    You got any karma man? I really neeed it. Just a little hit! Come on!
    1. Re:What about copyrights? by jeepnut · · Score: 1
      WBM doesn't charge for this use if I am not mistaken (haven't taken a look at WBM for a while). Look at WBM as a sort of library in a sense. Although are doing is keeping an available archive for public use. The same thing as someone holding a million old editions of a newspaper (sort of like microfilm).

      If WBM starting charging for the service, it might get messy, but considering they could attempt to justify it as charging for the service not the information, they could probably avoid copyright infringement. Essentially, copyrights and the like protect the owners interest in profiting from the information among other things. If in fact, WBM does not profit but is merely maintaining an archive, they will survive a threat of copyright infringement much the way a library would.

    2. Re:What about copyrights? by walt-sjc · · Score: 1

      OK, you may look at WBM as a library, but IS it? Even libraries don't have the right to violate copyright AFAIK. Google also got itself into trouble for caching things from the Church of Scientology and others IIRC...

    3. Re:What about copyrights? by drooling-dog · · Score: 1
      wouldn't it be a copyright violation for WBM to republish it for profit?

      Not so sure. Is it a violation for a library to circulate a book after it's gone out of print? Or for a bookstore to sell one?

    4. Re:What about copyrights? by donleyp · · Score: 1

      Actually, that is not the case. Copyright laws protect the IP from any copying even when you're giving it away.

      And to answer another poster's question regarding libraries keeping out-of-print books in their collection: In the case of a book no copy has taken place. You can even resell the book without infringement. It is the act of copying that violates the copyright.

      --
      You got any karma man? I really neeed it. Just a little hit! Come on!
    5. Re:What about copyrights? by Ngwenya · · Score: 5, Informative

      OK, you may look at WBM as a library, but IS it?

      Yes. It really is. It's a registered member of the American Library Association. Details on http://www.archive.org/about/about.php

      It's an honest to God library, which also means that Section 108 of the USC on Copyright applies. Public libraries in the US (and here in the UK) have some pertinent exemptions to the copyright restrictions that bind us mere mortals.

      --Ng

    6. Re:What about copyrights? by donleyp · · Score: 1

      But aren't Libraries still required to obtain the license (e.g. buying a book) legally before they can excersize thier unholy rights under "Section 108 fo the USC"?

      --
      You got any karma man? I really neeed it. Just a little hit! Come on!
    7. Re:What about copyrights? by Jherek+Carnelian · · Score: 1

      But aren't Libraries still required to obtain the license (e.g. buying a book)

      If downloading information specifically made public for the purpose of downloading isn't "obtaining the license" then I don't know what could be.

  14. My favorite use of Archive.org by hackwrench · · Score: 1

    http://web.archive.org/web/*/http://sonicblue.com/ > before Dec 2, 2000. I miss that site! Slashdot totally messes up the url though.

    http://sonicblue.com/>http://web.archive.org/web/* /http://sonicblue.com/

    The first is by the url method, the second a href. Get them on it pronto.

  15. But see, they signed a peice of paper by Mr+Guy · · Score: 4, Insightful

    In the article, it mentions one of the archive's technicians signing an affidavit saying they think it's a true archive. No one would ever lie about that for a big corporate payout.

    1. Re:But see, they signed a peice of paper by sholden · · Score: 2, Informative

      They might. But if the other side can provide some evidence that it isn't a true archive than said technician is in deep shit.

    2. Re:But see, they signed a peice of paper by sholden · · Score: 2, Interesting

      And?

      That the priginal point isn't a problem was my point - that it was sarcasm is irrelevant to what was expressed.

      Determining who to believe is what courts do. The technician swears under oath that the archive supplied hasn't been tampered with. The other side can argue the testimony is untrustworthy or that the technician can't know. It's no different than a doctor saying "yes those X-rays are of the person in question and were taken on this date".

      Or a police officer saying "He ran a red light".

      Yes it's not foolproof. People can lie. People can be paid off. But it's the system as it stands - and the opposing side gets a chance to discredit the person making the claims and also to provide evidence that contradicts the claims.

    3. Re:But see, they signed a peice of paper by PakProtector · · Score: 1

      No, they're not. The Technician is merely saying, to the best of his knowledge, it is a true archive.

      In court, a witness can not tell the truth and still not be commiting perjury. Haven't you ever seen or heard or read a transcript of a trial where one attorney will ask a witness, "And to the best of your knowledge?"

      --

      Edward@Tomato - /home/Edward/ man woman
      man: no entry for woman in the manual.
      "Qua!?"

  16. Huh? by Seumas · · Score: 1

    Most companies send a cease and disist. So if they have to use the wayback machine to find infringements (because people are not CURRENTLY infringing), then there's no order to cease and disist to be sent, yes? Or are they going to drag your ass to court and sue you for something that you stopped doing already, even though the Internet Archive is CURRENTLY doing it (with your material AND the infringed party's material).

    1. Re:Huh? by SilentShriek · · Score: 1

      A good point. I would think that if copyrighted content is to be archived for retrieval, they would need express permission from the owner.

  17. Childish Grudges by SilentShriek · · Score: 2, Insightful

    The childish nature of these corporations is ridiculous. Looking through archives of up to nine years just to point out: "Hey, you said we suck!" Who cares.

    If Dell did not suck, they would not have to be so defensive.

    1. Re:Childish Grudges by droptone · · Score: 1

      I was with you until the last part. That "If Dell did not suck, they would not have to be so defensive" reasoning, well...sucks. Just because you respond to someone making an accusation does not mean you're admitting that they are speaking the truth. Yes, it is more mature to not respond but as a business I don't think they would consider this an option. I agree that the laws need to be reformed so DellComputersSuck.com is not able to be sued by DellComputers, but that is another issue.

      It is actually this train of thought to means Dell HAS to respond to this sort of thing. If you can make outrageous claims against something and have people believe the claims merely because you made the claim...well, then we are a sad lot indeed.

      --
      Every post I make begins with the assumption P=~P.
    2. Re:Childish Grudges by SilentShriek · · Score: 1

      If you can make outrageous claims against something and have people believe the claims merely because you made the claim...well, then we are a sad lot indeed.

      Well put. My final statement was a bit sarcastic--no actual reasoning involved.

      I do think it is unfortunate that resources are spent chasing after such ventures. If your product is decent and fits into a certain market, it shouldn't matter what anyone has to say about it, although I understand it does--I would hope that people would see an outrageous claim and find out the facts for themselves, if they give it any credence at all.

    3. Re:Childish Grudges by shawb · · Score: 2, Informative

      Well, the DellComputersSuck.com website actually DESERVED to be sued. Is this a free speach issue? Not really. They were selling another brand of computers, so trademark infringement would actually come into play here.

      --
      I'll never make that mistake again, reading the experts' opinions. - Feynman
    4. Re:Childish Grudges by droptone · · Score: 1

      My apologies. I simply assumed that the website was like BestBuySucks, not another vendor trying to be cute. Thanks for the clarification.

      --
      Every post I make begins with the assumption P=~P.
  18. Sweet History! by Shadow+Wrought · · Score: 1

    Is there nothing it can't do?

    --
    If brevity is the soul of wit, then how does one explain Twitter?
  19. Proof and Evidence : The 'secret' to wining a case by markpapadakis · · Score: 3, Interesting

    Sometimes being able to see in the past is more valuable than being able to see in the future. It makes sense for lawyers to try to find ways to look back, for evidence and proof can only be found in the past.

    If you have evidence, you can prove your claims. If you can prove your claims, you win a dispute. If you win the dispute in favor of your client, that makes you one good lawyer.

    --
    Technology ramblings : Simple is Beautiful
  20. False info being planted by up2ng · · Score: 3, Interesting

    In the future what will stop someone or some entity from falsifying information knowing that the legal system will use this.
    How can the info in wayback.org or google be trusted ? You can make redirect pages based on googlebot or wayback that have nothing to do with what is really on the site.

    In the article it is mentioned that vodaphone.com was taken by a squatter and they used wayback to show that her intentions were "intended to misleadingly attract consumers" ,what if she had WBM's bot goto a "nice" page and not the real site that case could have gone differently.
    I think that if someone wants to they can plan ahead and use this in a nefarious fashion.

    --
    Success is not the result of spontaneous combustion, you must set yourself on fire.
    1. Re:False info being planted by Krach42 · · Score: 1

      That's why you can selectively enter evidence.

      Their case would have gone forward just the same, except that they would have avoided showing that page.

      If the person were to bring it up in an attempt to say, "hey, this wasn't nefarious!" then they would attempt to prove it discredible.

      The same person was able to do the same to the WBM. Show us that the page that it's claiming that you were displaying at this date isn't what you really had up there, and the jury can decide.

      --

      I am unamerican, and proud of it!
    2. Re:False info being planted by shawb · · Score: 2, Informative

      That's why the archive's administrators signed an affidavit stating that the information was, to the best of their knowledge, not tampered with. And it would be up to the other lawyers to prove that you were falsifying the information, which could lead you into further trouble or at the very least remove any doubt of your guilt from the jury and show that you were indeed acting in bad faith.

      --
      I'll never make that mistake again, reading the experts' opinions. - Feynman
  21. The question is... by cached · · Score: 1, Funny

    Can the wayback archive be used as a tool for showing admins on /. that this article is basically a dupe?

    --
    +1 funny, -2 overrated. Life isn't fair.
    1. Re:The question is... by Dunbal · · Score: 1

      Actually that's quite dangerous. If you use the wayback machine to look for slashdot dupes, you will essentially be causing dupes of dupes, of dupes of dupes...this process will accelerate exponentially until there is a huge rift in the space time continuum somewhere in the near future...

      --
      Seven puppies were harmed during the making of this post.
    2. Re:The question is... by toddestan · · Score: 1

      Can the wayback archive be used as a tool for showing admins on /. that this article is basically a dupe?

      Well Mr. "cached", why don't you tell us?

  22. robots.txt by Cyburbia · · Score: 3, Informative
    Even if the Wayback Machine archived your site, adding an appropriate robots.txt file to your Web site's root directory will make _all_ previous archives inaccessible to the public. I discovered this by accident, after I blocked the Wayback Machine robot by accident in an attempt to control malicious spiders. After I modified robots.txt, all the old archives reappeared after a few weeks.

    I used the Wayback machine to grab thousands of messages from an old WWWBoard-based message board that I ran, for eventual conversion to vBulletin. Some years, the Wayback Machine crawled every month; others it didn't even visit. Probably 80% of the messages that were posted before 2000 are lost to the ether of cyberspace. Guess you can't expect it to archive everything.

  23. Re:Proof and Evidence : The 'secret' to wining a c by (A)*(B)!0_- · · Score: 3, Funny
    Thank you Captain Obvious.

  24. Re:School by Saeed+al-Sahaf · · Score: 1

    Many organizations that block access to sites also block the Wayback Machine. Here in DoD where I work, this is the case.

    --
    "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
  25. I like hardware sites... by Momoru · · Score: 2, Insightful

    From Apple.com circa 1996:

                New PowerBook Family
    Addressing the needs of customers in small offices, home offices, business and education, Apple announced the Macintosh PowerBook 1400 series, combining 117MHz PowerPC speed with a removable CD-ROM drive and expansion options.

    Ah 1996 was how long ago? I remember lusting after those 117Mhz.

  26. Old Drivers by TheSeventh · · Score: 4, Informative

    I was able to use the wbm last year to find some old device drivers for a no-name motherboard I had from '97. The company went out of business, and their remaining stockpiles were bought by some other Chinese company, but the wbm actually had old copies of the drivers, and even a bios update for the board. Now, I always check there when I am having a hard time finding stuff that I knew used to be around.

    /bq

    --
    Just because you're paranoid, it doesn't mean that they're not out to get you.
  27. Re:School by awacs · · Score: 1

    And when they block archive.org, what will you do then? Even stupid admins tend to grow clues from time to time.

  28. I don't know by drooling-dog · · Score: 1

    What makes him so good?

  29. Transposition by ndansmith · · Score: 1
    Wait, shouldn't it be "Internet Archive's Wayback Machine" instead of "The Wayback Machine's internet archive"?

    Or maybe this was intended: "Internet Acrhive's Wayback Machine's internet archive."

  30. time to buy google stock again by infonography · · Score: 1

    Lawyers rarely google or do research, Paralegals do it. Pretty soon it's going to trickle up to the actual lawyers who use this stuff where it's coming from. Then being rich, they will go buy loads of Google stock. If for no other reason then to keep these goodies flowing.

    --
    Sorry about the writing. Robot fingers, you know? Cliff Steele in DOOM PATROL #23
  31. Playboy by floppy+ears · · Score: 4, Funny

    Playboy checks Wayback to look for infringers of its trademark bunny or other images.

    So they're basically just sitting around surfing porn too, eh?

    --

    "If I could live to be several hundred
    I could take a walk and really wander, really wonder."
  32. Re:Put another way... by SilentShriek · · Score: 1

    Dear Mr. FlameBait:
    No need to defend myself against imbeciles who have no idea how to speak to a lady.

  33. Re:Proof and Evidence : The 'secret' to wining a c by Paradise+Pete · · Score: 1
    If you have evidence, you can prove your claims. If you can prove your claims, you win a dispute.

    +1, Duh

  34. Re:School by LnxAddct · · Score: 4, Informative

    Its been a bit since I was in highschool and had to get around filters, but the most sure fire way is to run an ssh server at home on a port that your school's firewall lets through (most let 22, but to be less suspicious choose like 25 or 443 or something) and then carry putty around on a pen drive. Whenever you need unrestricted access, pop open putty, connect to your server and create a dynamic tunnel, on that pen drive you can also have firefox and have a socks proxy set up to use port like 1080 or whatever port you choose for a dynamic proxy. There you go, unlimited, encrypted surfing all bypassing your school's filters and beening tunneled through your house. This all assumes that your school's firewall doesn't block based on protocol, but rather ports.
    Regards,
    Steve

  35. Carbon Dating by Anonymous Coward · · Score: 1, Insightful

    It would be extraordinarily difficult to forge old yellowed hardcopy of newspapers or microfiche copies that are duplicated across numerous library archives .

    Thank goodness its equally difficult to forge a few html pages and file time stamps. No-one could ever do that.

    Virtual reality indeed. I just shake my head and wonder.

  36. Someone had to do it by jolande · · Score: 3, Funny

    yuhuhpoiu sauick

  37. If only there was a Firefox extension by Rurik · · Score: 3, Informative

    Google Caching and Wayback lookups. You could easily look URLs up by right-clicking on them.

    Oh, wait, there is one.

    /shameful plug

  38. A recent situation by dangrover · · Score: 1

    One time, the administrator of WrongPlanet.net that I co-founded it with (I'm no longer running the place) put up a "mirror" of one autism site that went down. It was actually the Archive.org mirror displayed in a frame. The owner of the site was irate and rude about the whole thing, and wanted to sue for "copyright infringement". My friend changed it to a text link in a forums post to the archive.org link, and they still persisted, talking about how horrible it was, and how much pain it caused them, how we were "exploiting" them.

    A lot of people really don't understand the internet, it's crazy.

  39. disclaimer by TTK+Ciar · · Score: 3, Informative

    I do not speak for The Archive. The above post should not be considered to reflect the official position of The Archive. It is purely my own personal opinion, and it was uttered under the influence of painkillers (I had my wisdom teeth yanked out of my jaw Wednesday, qv my Slashdot journal entry). Else I probably would have refrained -- talking about this at all while there's a court case pending was probably a really stupid idea, and I (usually) know better.

    -- TTK

  40. has anyone used this for finding terrorists by asscroft · · Score: 3, Interesting

    I imagine if you searched for sites talking about 9-11 pre-9-11-2001 you'd find some interesting things. Post 9-11 there were a zillion references, but pre-9-11 there couldn't have been that many.

    Same with the london bomings, no?

    --
    because I have been enjoined by this Holy Office to abandon the false opinion which maintains that the Sun is the centre
    1. Re:has anyone used this for finding terrorists by gatkinso · · Score: 3, Funny

      I would imagine that pre 9/11/2001 very few web sites mentioned the September 11th attacks...

      --
      I am very small, utmostly microscopic.
    2. Re:has anyone used this for finding terrorists by hkfczrqj · · Score: 1

      I know you can't use WBM to read USENET posts (google is the only choice I know of)...

      I recall that on 9/11, many news outlets pointed to alt.nostradamus (or something like that). There were a couple of posts of the sort "the world is coming to an end". Of course, being this alt.nostradamus, nobody paid attention to them. The second post was in the fisrt week of september, in reply to replies to the first post. The purpose of the second post was to correct the date from "11 days from now" to "september 11". Again, nobody took serious note of that. Until 9/11. This guy replied something like "I told you so". Thousands of replies ensued: "We'll nuke your terrorist asses"...

      Funny that I can't find those posts in Google Groups. Probably I should make a new tinfoil hat.

    3. Re:has anyone used this for finding terrorists by Rurik · · Score: 1

      There actually was a kid who claimed that the world had better watch out for 9/11. He made the claim about a week prior to the incident. A lot of people were up in arms, and reported his posts, etc.

      However, it came out that he made the same claim for 9/1. He was basically combinations for 911, and when 9/1/1 nothing happened, he tried for 9/11, and something did happen.

      From what I heard he shit a brick when the attacks came. I wish I could find the posts now.

    4. Re:has anyone used this for finding terrorists by hkfczrqj · · Score: 1

      I think we're talking about the same kid. BTW, here is the ~1000 post thread:
      Google Groups Link

    5. Re:has anyone used this for finding terrorists by gatkinso · · Score: 1

      But then they would have to refer to the attacks as the "11-9 attacks"....

      --
      I am very small, utmostly microscopic.
  41. [OT: DoD] Re:School by Ngwenya · · Score: 1

    Here in DoD where I work, this is the case.

    And they don't mind you posting with the nym of the ex-Info Minister of Iraq? Hooray! A little rose of humour blooms in the darkest depths of the Pentagon's labyrinths!

    --Ng

    1. Re:[OT: DoD] Re:School by Saeed+al-Sahaf · · Score: 1

      I've often found it interesting that while many forums are blocked, Slashdot isn't (neither is K5)...

      --
      "Who are in control, they are not in control of anything - they don't even control themselves!" - Glen Beck
    2. Re:[OT: DoD] Re:School by cpghost · · Score: 1

      A little rose of humour blooms in the darkest depths of the Pentagon's labyrinths!

      What about Shub-Internet?

      --
      cpghost at Cordula's Web.
  42. logos and other images by notnAP · · Score: 3, Funny

    Playboy checks Wayback to look for infringers of its trademark bunny or other images.

    In a related story, managers at Playboy have taken note of productivity differences between John Salem, who was tasked with finding instances of people illegally using the playboy logo, and Henry Waxman, who has been looking for instances of "other images," but has been observed taking frequent bathroom breaks.

  43. To beat a dead horse... by SilentShriek · · Score: 1

    Well, why should anyone care what someone may or may not have said about a company in the past?

    Furthermore, there was no logic involved in saying Dell sucks--merely a statement of preferential opinion that is absolutely meaningless in the broader scope of things; does this mean Dell should sue me, too? Therein lies the point.

  44. I almost cried ... by rahuja · · Score: 1

    I never knew that such a thing (Wayback Machine) existed! I almost cried tears of joy to find that while I had to take down (when the hosts started charging) the sites I developed during my college years long back, almost all of it, including all changes, is there! Bwahaha!!! http://web.archive.org/web/*/http://www.thomso.net />

  45. Re:School by Spy+der+Mann · · Score: 1

    And when they block archive.org, what will you do then?

    www.archive.org.nyud.net:8090. Ta-da! :D No one will be able to block you then! Oh, wait...

  46. new Slashdot poll: by GojiraDeMonstah · · Score: 1

    How many people immediately went to dellcomputerssuck.com to see what was there?

    --
    "Stop throwing the Constitution in my face, it's just a goddamned piece of paper!" - George W. Bush Nov. 2005
    1. Re:new Slashdot poll: by Jens_UK · · Score: 1
      The answer seems to be: enough.

      Safari can't open the page "http://dellcomputerssuck.com/" because it can't find the server "dellcomputerssuck.com".

  47. Re:So perhaps censoring the archive is wrong? Sign by hankwang · · Score: 1
    What would be even better would be if the archives digitally signed their archives and kept signatures even of those things that had been asked to remove so that the validity of a copy could be established if made for legal purposes

    But how can you force the submitter of the removal request to store a copy, let alone an exact copy, from which the checksum can be calculated?

    If the submitter keeps the page it is not necessarily the same set of bytes as the removed one (think dynamic pages).

  48. So the archives are still kept even w / robots.txt by eric777 · · Score: 1
    If this guy's info is accurate, that means robots.txt just hides the archived data, it doesn't remove it.

    So if some criminal tries to use robots.txt to hide information, the lawyers can still get it, either by subpoena/discovery to wayback, or an injunction to get the guy to remove his robots.txt, at which point the data returns to wayback!

    cool!

  49. Re:Put another way... by SilentShriek · · Score: 1

    Thank you for adding some intelligence to this inane debate. Cheers.

  50. they should fund it by akb · · Score: 1

    Wow, who would have thunk that an archive of history could be useful to anyone, let alone for rich companies to use to sue?

    I think its widely recognized that the Internet Archive is general societal good, it should be funded as such.

  51. close by geekoid · · Score: 1

    you have to learn to think like a lawyer, which is like learning to type with your nose, and look good while you're doing it.

    --
    The Kruger Dunning explains most post on /. http://en.wikipedia.org/wiki/Dunning%E2%80%93Kruger_effect
  52. Quick! Someone tell Karl Rove... by Lester67 · · Score: 1
  53. One of those IANAL questions by Nom+du+Keyboard · · Score: 1

    So they're checking for past infringments. Is that the same as current infringment? I.e. even if you get a takedown notice and obey it, are you still equally guilty that once upon a time you in violation? That's what it's soundling like here.

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
  54. Canada murder case? by djKing · · Score: 2, Funny

    They killed Canada! Those bastards!

    - Peace

    --
    Free as in "the Truth shall set you..."
  55. There ought to be a Law by Nom+du+Keyboard · · Score: 2, Informative
    Requests from third parties to remove information are generally denied. The Wayback Machine makes exceptions in certain circumstances, for example if the Web pages contain personal information provided in confidence, such as medical data.

    I bet the (so-called) Church of Scientology gets everything they want pulled.

    In addition, Web-site operators can prevent material from remaining in the public domain by using a piece of computer code, known as a robots.txt file, which stops bots belonging to the Wayback Machine and regular search engines from copying pages.

    This is pretty bogus because it only works if there is still a current web-site at the spidered address that is on-line and can deliver a robots.txt file saying DON'T! It has already been proven in another case that rapid-fire multiple requests to WBM will cause it to give up pages even when robots.txt says not to.

    I see two ways to fix this problem of misuse of a valuable archive:

    1: Federal law PROHIBITING the use of evidence from the Wayback Machine in court trials. This is a valuable historical archive that will be less valuable if people worry that it can be used against them in the future in unforeseen ways, and block contributing to it. How many sites already block the WBM TCI/IP address range?

    2: WBM could simply announce that they refuse to cooperate in any future trials -- AND THEN DO EXACTLY THAT! Without them to attest to the accuracy of the retrieved data, many cases relying on that data would fall flat on their faces.

    Think for a moment. The WBM was not created to make lawyer's lives easier, and their law firms richer!

    --
    "It's the height of ridiculousness to say for those 9 lines you get hundreds of millions."
    1. Re:There ought to be a Law by mousse-man · · Score: 2, Insightful

      What I'd do is to force WBM to either become legally compliant (ie archive their data on WORM media) or not being admissible in court.

      Problem solved since WBM can impossibly keep that much data on revision-proof media.

  56. Re:School by imboboage0 · · Score: 1

    BS. How can I be modded Offtopic? Offtopic would be if I had said something about paving streets. THAT is offtopic. Referring to the Wayback machine IS NOT OFFTOPIC IN THIS ARTICLE. try to RTFA next time you mod me down.

    --
    Honesty may be the best policy, but by process of elimination, dishonesty is the second best policy.
  57. Re:So the archives are still kept even w / robots. by Cyburbia · · Score: 2, Interesting

    Here's what happens, I think: 1) At first, no reference to Wayback machine in robots.txt. Site is spidered and the archive placed on line. 2) Add Wayback Machine to robots.txt. The site is no longer spidered, and old archives are hidden from public view. 3) Remove Wayback Machine from robots.txt again. Spidering resumes, and all the old archives of the site reappear. However, there is no archive of your site from the time robots.txt was up; remember, it wasn't spidered then.

  58. Re:School by Fallen_Knight · · Score: 1

    when i was in highschool i just plugged in a laptop, used the right static IP and could do whatever i wanted:)

    district office REALLY didn't like encrypted traffic going in and out via ssh thou..

  59. Re:So the archives are still kept even w / robots. by cpghost · · Score: 1

    Here's what happens, I think:

    This is absolutely correct and consistent with previous experiences with WBM.

    --
    cpghost at Cordula's Web.
  60. Usenet and the Internet Archive by yppiz · · Score: 1
    Much of Google's older Usenet archive content was originally collected by DejaNews. At the time Deja went under, I and the other Archive folks were interested in getting it before it went to the great bit bucket in the sky. But I believe Google made a better or quicker offer for the data.

    For what it's worth, the Internet Archive has at least at one point in its history collected Usenet posts. This isn't in the Wayback machine, though.

    http://www.archive.org/about/faqs.php#The_Wayback_ Machine

    Do you archive email? Chat?

    No, we do not collect or archive chat systems or personal email messages that have not been posted to Usenet bulletin boards or publicly accessible online message boards.

    --Pat / zippy@cs.brandeis.edu / blog / pics

  61. Re:Quick! Someone tell Karl Rove... by Netssansfrontieres · · Score: 1

    Rove is still a lyingscumbagcrook. The WBM reference states: "He is married to the former Valerie Plame and has two sons and two daughters." (he = Joe Wilson)
    Rove, may he burn in hell, added the little extra: 'VP work in covert ops at the firm'.

  62. The WBM by Netssansfrontieres · · Score: 1

    Not mentioned in the WSJ article: this is run as a not-for-profit. It is privately funded.

    Unlike Google/cache, where the 'do no evil' motto is looking more & more as though it comes second to ... 'unless we can take over the world by actually doing evil', WBM has no objective other than, say, making all data & knowledge on earth available to everyone for free.

    Little known fact: the Internet Archive also has a search engine ... as it must in order to find all those things to cache. It searches more of the Internet than Google.

  63. Re:Put another way... by HardCase · · Score: 1

    Sorry to offend your delicate senses. However, the conclusion that you drew in your original post was terribly flawed. If you can't see that from the overly dramatic response that I posted, then I guess it's hopeless.

    Incidentally, whether or not I know how to speak to a lady has no bearing on the serious flaw (both logically and morally) in your original post. But it's a nice way of shifting attention away from my point.

    -h-

  64. Re:Put another way... by SilentShriek · · Score: 1

    Having an opinion about a machine does not necessarily involve so much logic, and especially not morals. I own a Dell and I can say it sucks if I feel like it. If that's immoral, off to Hell I go. :-P

    You work for Dell or something?

  65. Wayback has its problems by tuxlove · · Score: 1

    I was involved in a lawsuit over contents of a web page, and we used the Wayback machine to gather evidence of a web page over time. The only problem was that the dates the page was archived in the Wayback database were inconsistent and sometimes too far apart. Some significant contents of the site were missed because they were only available for a few weeks, and were skipped over by their spider. In the end, it didn't help much, it only hurt our case.

  66. Proof of facts is straightforward . . . by werdna · · Score: 1

    Usually, you don't have to make a big proof of facts. During a deposition, you ask a witness whether the website said such-and-such at a given time. You show them the document, and ask again. Since there is high probability the archive.org content is unadulterated, the witness is usually pretty dumb to deny under oath, and more often will simply admit and authenticate the document.

    Sometimes, they may shrug their shoulders, and the best you can get is a "dunno," and acknowledgment that they can't deny whether it said what it said at that time, but don't know that it did. Many witnesses cannot do this credibly, without diminishing their credibility when they later testify they are sure about something else, but the facts dictate the likelihood that depositions and admission can get you all the evidence you need, and the document becomes more of a demonstrative exhibit than principal evidence.

    If that doesn't work, you have two threshold evidentiary issues: (1) authenticating the document as legit; and (2) overcoming the hearsay rule.

    Authentication isn't hard. You get a declaration from archive.org saying it is a legit business record, and the other side rarely has any evidence to the contrary. That will be enough to get it in, and probably enough for the jury to give it credence.

    Then you have the hearsay problem, which may be one-way or two-way hearsay. The document is a record of Archive saying that susie.com said "Jimmy eats rice" on a particular date, and is being offered for the truth of that statement. It may also be offered for the truth of the proposition that "Jimmy eats rice," as opposed to the mere proposition that "Suzie said 'Jimmy eats rice'" in which case it is double hearsay.

    There are a host of exceptions that may be applicable in every particular case. The primary hearsay issue is routinely overcome using the business records exception, and other exceptions may or may not apply depending upon the nature of the statement, the party making the statment, independent indicia of credibility and so forth. It is both easier and harder than it seems to solve these problems, and this kind of stuff is why you pay lawyers the big bucks.