Slashdot Mirror


Deep Linking 2.0 At NYTimes

Gentleman Goat writes: "The NY Times has a well-written article exploring the recent court decision about Deep Linking in closer detail. " Free registration required. This one goes deeper and talks about Web crawling bots and other issues related to deep linking. Honestly I think the spider problem is a separate issue. I think people should be able to say, "Please don't spider this page" (robots.txt for example, but it gets stickier with copyrighted content) but I don't think anyone should ever be able to say, "You may not link this page" since that is fundamentally the anti-point of the Web. Check out the ruling from Japan that linking, in some cases, is illegal.

157 comments

  1. Not really... (Was:Similarity to "piracy"?) by Anonymous Coward · · Score: 1

    This paragraph of the article seems to me somewhat like the software industry's claims of damages resulting from piracy. Given that certain people would never have purchased a product due to various factors, simply downloading a pirated version doesn't really cost them any money.

    Sigh. This is nonsense, and you know it. If it's easy to d/l a pirated version of software, there is no incentive to buy it. I will agree that there is an element that will *never* buy a piece of software, but there is a percentage of people who probably would, but because of the availability of a pirated version, don't. It's the people who could buy the software and who don't because it's "free", that the software companies want to address. And rightly so.

    ... posted anonymously to avoid needless loss of karma to idiotic moderators who actually believe that pirated software doesn't cost money.

    1. Re:Not really... (Was:Similarity to "piracy"?) by Shadowlion · · Score: 1

      Lest we want to start an all-out war here, I think you're both right and wrong.

      You're right - there is some amount of people that would not buy software because there is a free, pirated version available.

      The flip side, however, does exist - there are some amount of people that would not buy software even if they were unable to obtain a pirated version.

      When the software associations release their figures, they automatically assume that every pirated release constitutes a lost sale. You've demonstrated that in some cases, that is true. In others, however, it's not. That means that some subset of the figures that the piracy associations use are incorrect - at best, exaggerated, and at worst, purposefully misleading in order to make the problem seem worse than it is.

      In summary, while I don't think you're completely wrong (in fact, I think you're mostly right), I also think that saying the blanket equation of:

      monetary loss = (copies pirated) * (price)

      is not entirely accurate. Unauthorized software duplication is certainly something that shouldn't be done; on the other hand, saying that everyone who steals would've bought a copy is untrue as well.

  2. Re:I happen to think.... by Anonymous Coward · · Score: 1

    ...that if you put something up on the web, you've made it publicly available for people to link to. There is clearly a limit to this. Just because my financial institutions make it possible or me to conduct business via the web does NOT make it ok to deep link to my bank account information. Neither is it ok to deep link to sites that provide content on a fee paid basis. People providing content that are paid through advertisements rightly view some types of deep linking as a danger to their economic life.

  3. ... by mosch · · Score: 1

    as if we don't all know the magic l/p which always works, anyway.
    ----------------------------

  4. Re:rant - sorry. by Eccles · · Score: 1

    Now, ideally, they can make up the ways that their information is spread, and they finally have control under their own terms.

    But they *do* have that option via various techniques. It's just that the default for the web was meant to be "link to whatever you are able to". Putting the onus on the linker of making sure he has permission to link goes against the grain of the web.

    Now as I understand it, web servers can determine where the request came from, but it is possible to forge that information. I think that forgery is rather more questionable. Did the court decision touch on this?

    --
    Ooh, a sarcasm detector. Oh, that's a real useful invention.
  5. Re:Play nice by kip3f · · Score: 1

    You are an old skool internet hippie. Information wants to be paid for!

    --
    ****Gfx Scrollbar Special case hit!!*****
  6. Linking, pointing by nerdin · · Score: 1


    A guy is driving. Stops and asks directions:

    Simple link:

    Guy: Excuse me sir ... Where's Eighteenth National Bank?

    Joe Linker: (Pointing with finger) Right over there. They
    are an excellent bank, highly recommended...

    Guy: Thank you, sir.

    Five minutes later, the bank is robbed.


    Deep Linking:

    Guy: Excuse me sir ... Where's Eighteenth National Bank?

    Joe Linker: Next corner. And watch out: there's
    a guard who carries both a automatic .45 gun and a shotgun. He lunches
    at 12:08 everyday, and finishes at 12:32 and is sleepy after that. Now
    its 12:12. There are four cameras but cashier #5 is not well covered and
    images are blurry. There's the automatic alarm if cashier takes out
    all the money. Money arrives Mondays (today!) at 10:30. They've
    been robbed twice in the last 3 months so everyone is scared as hell...
    it can be robbed any day.

    Guy: Thank you, sir.

    Bank is robbed at 12:34:30

    Copyright infringement:

    Guy: Excuse me sir ... Where's Eighteenth National Bank?

    Joe Linker: Open that door, give me that shotgun and let's
    take all fsking the money from there, it's a piece of cake.

    Guy: C'mon buddy!

    Bank is robbed at 12:33:32


    I think that ruling on deep linking should be based on intention of
    linker. Freedom of speech doesn't mean taking others work for profit
    and that's what tickets.com did.


    If your friends forgot you, your enemies never will.

  7. Re:I happen to think.... by Exanter · · Score: 1
    There should be NO limit to that. If your financial institutions make it possible for you to conduct business on the web, then they should take the necessary precautions with said data as well. We don't need the courts/government/whoever, telling us that we can't link anywhere. If you put the data up, it is your responsibility to manage it however you wish. If you don't want to do that, then don't put your info up for all to see on a website. That same argument goes for the people living and dying on banner ads...

    The web is basically a free-for-all. A freely accessible library useable by anyone with a web-broswer and a net connection. If you put data up, but want it visible by only a select few, take the time to manage your data that way. Otherwise, you can't bitch.

    Besides, I can't see why people would bitch about it. They are still getting hits, and by others linking to them (deep or not), getting more exposure. This is what the web SHOULD be. It's sad that we may need the courts to decide that for us. In ticketmaster's case, I can't really believe that they are bitching because they still got the sale. Now misrepresentation is another matter entirely, but that is already illegal, web or no. That shouldn't be a determination of whether deep linking is legal or not.

  8. It works both ways by perfecto · · Score: 1
    I basically wrote this page after a company called Asimba told me I could no longer link to their pages:

    http://w3.nai.net/~perfecto/ejercisio/asimba.html

    They told me i couldn't link to them since I had pornography on another page. The funny thing is that it's probably one of the lamest pornography pages on the 'net. But anyway, I removed the link not because they threatened me but because a company this clueless doesn't deserve the benefits associated with being on the net. The net is the ultimate in free customer referral. If they don't want customers passed on to them, fine. Let them rot in hell. I do just fine with my healthy quarterly Amazon checks!



    --
    J Perry Fecteau, 5-time Mr. Internet
    Ejercisio Perfecto: from Geek to GOD in WEEKS!

  9. DOH! Re:Do not link to this page ... by bpdlr · · Score: 1
    So what's the fscking point of him putting this Web page up in the first place, unless he wants people to look at it! If 500,000 people look at his site in the first day, he may get charged an extra $1k, the site may go down, but he's just had more interest in his site than most commercial sites get in a year. Mission accomplished, as far as I can see. He can now take down the site and get on with the project.

    --
    Barry de la Rosa,
    public[at]bpdlr.orgASM,
    tel. +44 (0)7092 005700

    --

    --
    Barry de la Rosa,
    public[at]bpdlr.org
    My /. ID is lower than Bruce Perens'!

  10. Deny based on Referrer by Kozz · · Score: 1

    You can deny based on referer using mod_rewrite. Read some suggestions on it here:
    http://bugs.apache.org/index.cgi/full/968


    Quidquid latine dictum sit, altum viditur.

    --
    I only post comments when someone on the internet is wrong.
  11. Re:thoughts from a bot builder. by Mongoose · · Score: 1

    Greetings brother spiderer,

    I don't bother with robots.txt at all. I asume the user of my bot knows what is right or wrong -- If I was to sell a gun to a guy I wouldn't feel responisble with what (s)he did with it. You just worry about doing what you do, since you can't expect every j(ohn|ane) doe to be an upright citizen.

    When it comes to data mining you don't want to limit yourself - that's why user agent spoofing is so widely used... I wonder how many tracking sites out there count spoofing bots on http as either mozilla or IE.

    Of course I'm an evil basterd that makes a spider lib with an example simple minded mp3 spider...

    *** Information wants to be free ***

  12. Re:the real issue here by cgori · · Score: 1
    That would be fine and all, but the problem here is that tickets.com was using ticketmaster's content as part of their business strategy.

    (Good lord, I am defending ticketmaster. I think I am going to be sick!)

    I kinda recall people being up in arms last year, or two years ago, about one of the "Learn Perl in 21 days" books that copied blatantly and heavily from the perl FAQ's, unattributed. That's because it's called plagiarism. Now, if the FAQ is under "Open Content Licensing", probably you would be OK if you at least attributed the source. But to take someone else's work and use it as your own, probably isn't OK.

    (Good lord, I am defending ticketmaster. I think I am going to be sick!)

    And yes, ticketmaster probably should use a more "secure" means of allowing access to those pages (the "lion at the gate" referred by others), but should they have to?

    (Good lord, I am defending ticketmaster. I think I am going to be sick!)

    The primary problem is that we are all suspect of ticketmaster's ability to "play nice" because they are a monopoly, and they are accustomed to their artificially created power over all tickets in the world. They don't like it when some upstart draws back the curtain and proves that there is nothing magical about what they are doing.

    I can't come with an alternate example that would placate the slashdot masses, because very few businesses work like ticketmaster, but I have the sinking feeling that they might have a right to sue for what they are suing for. Now, the deep linking, as an academic point, well that clearly can't be illegal. I think we will need a legal standard similar to the one in academia, that unattributed copying is bad, n'kay?

  13. Solution by nathanm · · Score: 1

    Copying somebody else's material is copyright infringement, plain and simple. On the other hand, linking should be kept legal and mostly unrestricted. The only exception being that you can't deliberately mislead people to believe the link is your material, i.e. when linking another page in a frame on your site.

  14. then expiring user id in url by just+someone · · Score: 1

    then program in an expiring tracking ID in the url. sid pid pud pig, whatever.

  15. Technology is the solution. Check the headers by just+someone · · Score: 1

    All these people spending money of fucking laywers when they could have just spent the money and built the capability into thier web site to check the header. If it did not come from your site, reject it, send people to the front page, or redirect with _top to make sure that you kill any framing.

    Someone should write companies whose web masters allowed laywers to be used, instead of a simple basic technology solution. Of course, since it's not a standard IIS feature, then it does not exist ;)

  16. Google/Copyright by philj · · Score: 1

    If they brought copyright issues up then google would be up shit-creek without a paddle. Storing cached copies of pages that users can view? Suits you sir.....

  17. Re:A contract won't go away just because it's evil by Tim+C · · Score: 1

    Speaking of terms and conditions on sites...

    I work for a web hosting/design agency here in the UK, and a site I recently did some of the coding for has, in its terms and conditions, a section that specifically forbids the caching of pages by proxy servers.

    Now I very much doubt that they could make it stick in a court, but if nothing else, it shows you the sort of mentality some people coming to the web have these days (not to mentoin showing you the sort of people I have to deal with sometimes :-( )

    Cheers,

    Tim

  18. Re:Similarity to "piracy"? by Cool+Hand+Luke · · Score: 1

    This paragraph of the article seems to me somewhat like the software industry's claims of damages resulting from piracy. Given that certain people would never have purchased a product due to various factors, simply downloading a pirated version doesn't really cost them any money.

    Let's not forget those who *would have* bought the product, but don't have to because they have a free, pirated version.

    I understand that ads generate revenue and that ticketmaster would be upset by people bypassing ads. However, the "offending" deep linking still takes the user to a page containing a banner, and ticketmaster will still receive a service charge, so what are they complaining about? Perhaps they never would have made that sale had the user not gone through the site doing the deep linking. Just a thought...

    I'm sure Ticketmaster wants people to travel through *their* pages with *their* banner ads to get more impressions. Of course, these deep linking lawsuits are nothing but lousy alternatives to hiring a savy web head to design these sites correctly so people can't deep link.


    George Lee

  19. Re:Biting your nose. by Shadowlion · · Score: 1

    I didn't say it made sense. :) I simply said that's what Ticketmaster was complaining about.

    I agree, Ticketmaster does make money on the sales. But when have you ever known a demonstrably greedy corporation like Ticketmaster to not pursue their attempts to squeeze the last dime out of the revenue stream through any means necessary, up to and including suing the living blazes out of anyone that gets in their way?

  20. Re:Huh??? by Shadowlion · · Score: 1

    It's not that Ticketmaster is bitching about people not buying from them. Ticketmaster is bitching because people using Ticket.com don't hit the pages with banner advertisements, and as a result Ticketmaster gets lowered revenues from click-throughs and impressions.

  21. Re:its not about deep linking ... by Kaa · · Score: 1

    The widgets model doesn't work any more, because you are using their widgets through the deep link on your site, not your widgets.

    Well, do you think it is OK for me to stand in front of their store with a big placard saying "You can find widget XYZZY in the middle of aisle 15, left side, third shelf from the bottom"? After all the store wants their customers to wander around since they are more likely to buy something else...

    Kaa

    --

    Kaa
    Kaa's Law: In any sufficiently large group of people most are idiots.
  22. Dynamic web page? by sugarbomb · · Score: 1

    Couldn't Ticketmaster.com generate pages dynamically, thus preventing any page from having a fixed location that could be linked to? This way, pages would be created each time, based on what the user clicks on the first set of pages.

  23. Re:Similarity to "piracy"? by Quack1701 · · Score: 1

    I understand that ads generate revenue and that ticketmaster would be upset by people bypassing ads. However, the "offending" deep linking still takes the user to a page containing a banner, and ticketmaster will still receive a service charge, so what are they complaining about?

    This one is easy. If they make you travel a maze of 5 web pages to find the "deep-link" your looking for, they can place numerous ads on each page. Since they get pay by the number of people who see each ad, they want you to visit every page so you can see every ad 5 times so they can charge 5 times as much. The problem here is, if you make to hard for me to find what I'm looking for, I will stop looking. Also, if I don't click on the banner ad the first time I see it, what makes you think I will the 5th time I see it?

    Quack

  24. Re:I happen to think.... by sklib · · Score: 1

    Linking directly to a zip is no problem, because if you wave your mouse over the link, you see the URL at the bottom. The real problem, I believe, is linking to another's frame, such that all the site's identification is lost. It's not linking that's the problem, it's direct usage (of frames, tables, images, what have you) that is the problem.
    I'm sure Ticketmaster would not have done squat if tickets.com said "Buy these here" and linked to somewhere deep in their hierarchy. It's all about attempts at misrepresentation. Linking to a zip does not misrepresent like that, although apparently linking to warez/mp3s is illegal, or soemthing.

    --
    -S
  25. Solve this problem with technology, not law. by jtgold · · Score: 1

    Joe Developer should know better than to make a deal like this with his ISP because traffic on the web is fundamentally unpredictable. Blaming Taco is preposterous. This is a law of nature. A court can meddle with it, but only at the expense of creating strange social anomalies that would present a real danger to the economics of the web.

    The deep linking case is disguistingly easy to solve with technology. Simply create a web-server that allows access to the super-secret deep-directory only when the referer field comes from the same site. Apache probably already does this with one module or another. Compare this to the bevy of lawsuits and legal terrorism required to enforce this in court.

    The law should give people an incentive to protect their own business interests, because this is feasable, rather than to protect those of everyone else, which is not. Taco can't be expected to keep track of every wacky deal offered by every internet provider, and neither can you. The New York Times, on the other hand, can be expected to find a business model that doesn't require changing the nature of the web. Joe Developer should find an ISP that charges a flat rate. Caveat emptor.

    1. Re:Solve this problem with technology, not law. by milph · · Score: 1
      > The deep linking case is disguistingly easy
      > to solve with technology. Simply create a
      > web-server that allows access to the
      > super-secret deep-directory only when the
      > referer field comes from the same site

      I suspect that they would still like customers to be able to bookmark particular items, and come back to spend money. Requiring you to hit intermediary pages first would probably lose them [Ticketmaster] rather a lot of money. It would also irritate consumers. Repeat customers are important too.

      --
      -- Chapman's Observation #1: Nothing is ever simple
  26. Re:I happen to think.... by schon · · Score: 1

    Just because my financial institutions make it possible or me to conduct business via the web does NOT make it ok to deep link to my bank account information.

    If your bank account publishes your financial information as a page on it's web site, without using some form of access control, I think it's time you changed financial institution.

    Similarly, anyone providing fee-based content, who doesn't understand .htaccess DESERVES to get deep-linked.

    Speaking from experience (I designed a pay-per-porn site a few months ago) it's not rocket science; the first thing you realise is this: every piece of content _MUST_ be protected. I think it's pretty naieve to to say "Please do not bookmark this page, because once your subscription has expired, we can't stop you from viewing all our content." and expect people to actually do as you ask.

  27. Deep linking can be prevented anyways. by Restil · · Score: 1

    If a website does not want any outsiders linking to any page other than the main page, this CAN be prevented. The webserver knows the referring webpage and can therefore refuse any request from any source other than an internal page or a trusted site. All that is required is a little CGI work (if that even), and the problem is completely solved.

    Of course, instead of spending a few hours to properly configure a website, they'd rather make a legal issue out of it. Seems to be the trend these days.

    -Restil

    --
    Play with my webcams and lights here
  28. Re:its not about deep linking ... by thrig · · Score: 1

    The widgets model doesn't work any more, because you are using their widgets through the deep link on your site, not your widgets.

  29. Re:Then use a firewall. by nevets · · Score: 1

    First off, this happened years ago, but the consequences are still with us. We are even a new company and the rules still apply.

    I believe the story went that the person that took the information was actually a vendor working for another project. There was an NDA for the information that the person was working on but not for the document he read. Although the laws may have changed since then, I believe you are responsible for keeping proprietary information locked up, otherwise you risk having to give the cleaning staff a NDA.

    Steven Rostedt

    --
    Steven Rostedt
    -- Nevermind
  30. Then use a firewall. by nevets · · Score: 1

    A good example agains this is internal coporate information.

    I agree with the first poster. If you put it on the web then it is like posting it on the outside of your building. The Internet is a public forum, and all information (like it or not) on the Internet is public. If you want security, then use ssh and other secure utilities. Rules against deep linking is not sufficient to secure documents. If you need an internal way to communicate in your company, then set up an internal internet and hide it with a firewall. This is what we do at our company, as well as other companies.

    We were nailed in court that even documents that are left out on the desk is open for other employees to use if they leave the company. Someone actually read proprietary documents that they were not responsible for and when they left the company they used the information that they gathered. When this was taken to court, the judge ruled that the documents where not secured and thus the person was free to look at them. Now we have to lock all proprietary documents up when they are not in use or it is a security violation. It is different in court if someone breaks into a desk and reads documents then if someone just reads the documents on top of your desk.

    So I may contradict myself a little here. I believe that if you don't take any measures to secure your web pages, then they are free to be linked to by others. If you take "reasonable" steps (now that term could take lots of explaining itself) then those that try to link to the secured pages (via cgi or what not) are in violation.

    Steven Rostedt

    --
    Steven Rostedt
    -- Nevermind
  31. Simple Solution & DEEP linking by CentrX · · Score: 1
    He puts up a banner ad, this would pay for the ISP bill and he might even make a little money. In the end it's much better to be linked to. Even if you don't make any money, your ideas are being disseminated.

    This article is regarding DEEP linking, not simple linking. It's not really about the number of hits, but about bypassing any crap that might be on the frontpage, etc.

    Chris Hagar

    --

    "The price of freedom is eternal vigilance." - Thomas Jefferson
    1. Re:Simple Solution & DEEP linking by |DaBuzz| · · Score: 2

      He puts up a banner ad, this would pay for the ISP bill and he might even make a little money.

      So now to be able to utilize the web as a publisher and still control how your content is accessed, you have to become a commercial entity? I don't buy this argument one bit.

      What if Joe's site is about why internet advertising is the downfall of the internet itself? Wouldn't quite work then now would it?

      My point is, the only simple answer is to respect those who wish to have their content not linked to, all other "solutions" avoid the overall issue here which is regardless of the "open" nature of the web or the "public" aspect of the internet, a publisher of content deserves to not be trampled over by millions if he doesn't want to be.

      And if you read the original post, you'd see I wasn't addressing DEEP linking, I was addressing Taco's statement that he feels you shouldn't be able to stop people from linking to any pages, not just DEEP linking.

  32. Re:irrelevant by Delusion_ · · Score: 1

    You honestly thought I was flaming or trying to discredit you? If I gave you that impression, I apologize. I'm not discrediting you, I'm disagreeing with you and giving you an idea why. If you go quote McLuhan, however, I might not be able to maintain my composure for long ;)

    I'm not saying Tim is not disassociated from the web, but rather that the W3C followed Netscape and the other folks into a designed-web environment, rather than lead them to it.

    As such, the importance of W3C has been diminished to a degree, but that's not really the issue for me. It seems to em no more relevant to ask Tim his opinion on the copyright and liability issues of deep linking as it does to ask the designer of the first skyscrapers his opinions on NYC zoning laws: certainly he has some, but that's not his field of authority or expertise.

  33. Workarounds for deep linking by big-c · · Score: 1

    At ay place of work, we have a site that is prone to having other sites deep link to the content. We have two workarounds for this.

    1) Server side script which checks the refering URL against the real domain name. If the request is from www.domain.com show the page; if not then redirect to site home page.

    2) Javascript: add a "jump-out" of frames option. This is a way to remove the frameset, and present your info in 100% of the browser. We have found that people were more app to drop the "other guys" frameset and stick with our content 100%.

    Some might view deep linking as a problem, but there are many ways to workaround the actions of others on the internet.

  34. Hmmm...... by AppyPappy · · Score: 1

    Isn't this the same company who fired a whole bunch of people for swapping emails?

    --

    If you aren't part of the solution, there is good money to be made prolonging the problem

  35. Security software to prevent deep linking by silvexis · · Score: 1

    While this may go against the grain of the general consensus that feels that deep linking is something that should be fundamentally allowed, there is now software that will provide the ability to control the entry points into your web site as well as secure the entire site as well (AppShield from www.perfectotech.com). Legally, I don't believe there should be any controls in place that say what you do on the web, but if sites want to come up with a technical means to control who and how people access their site, then the more power to them. I think the point I am trying to make here is that a web site is not public property, that your access to that site is not a divine right, and that regardless of how one feels on the spirit of the web, companies do not and will not provide information to the public unless it serves in their best interest. Otherwise why would they? Give sites the power to control things like deep linking, but lets not make laws about it.

  36. Re:Huh??? by Cuthalion · · Score: 1

    Well, you can persue both. There is a way around each method
    A way around each method, for tickets.com, or for the user? If 40% of their customers can't use their ticket sales service because they are using a browser that honors the HTTP_REFERER field, then they need a different transaction model.

    Huh?? Why would Ticketmaster want to stop anyone from buying with them.
    Ticketmaster wants people to buy tickets from www.ticketmaster.com. If people buy tickets from www.tickets.com, thier 'brand loyalty' goes to tickets.com, which is fine for NOW while tickets.com is using ticketmaster, but if tickets.com becomes successful, one of the things they would be able to do is drop ticketmaster. Basically what it ammounts to is they are using ticketmaster's infrastructure (and giving ticketmaster a bit of money for it), without any sort of permission at all.

    There's also the more straightforward issue of lost ad revenue.

    --
    Trees can't go dancing
    So do them a big favor
    Pretend dancing stinks!
  37. Legal vs technical solutions? by Cuthalion · · Score: 1

    I don't understand why they feel it is more convenient to persue a legal solution to this than a technical one. In the case in question, tickets.com is making forms whose SUBMIT buttons send the data to Ticketmaster to be processed - well, Ticketmaster is already doing CGI why is it more than a 2 minute hack to disallow purchases from people who have the "referrer" variable set to Tickets.com? (or even NOT ticketmaster.com)

    --
    Trees can't go dancing
    So do them a big favor
    Pretend dancing stinks!
  38. Re:The funny thing about deep linking by wass · · Score: 1

    Wow, that was totally ironically awesome. Thanks for figuring that link out.

    --

    make world, not war

  39. Re:WWTBLD: What Would Tim Berners-Lee Do? by Jeffk67 · · Score: 1

    At least in the early days the purpose of the internet and the web was the disemination of information. Any decision that forces us to choose between the free flow of information and the abililty of someone to make a profit should be decided in favor of the former. Deep linking is an issue mainly because of the commercialization of the web. Maybe there should be a separate network designed for commercial interests but I would hate to see the surrent internet go any further in that direction.

  40. Re:Spiders, copyright, and resumes by Phrogman · · Score: 1

    If you don't want your data copied by spiders, use a robots.txt. By definition spiders are expected to obey the instructions in the robots.txt file. You can direct which directories and files are okay to index, and which are not. That is the entire purpose of robots.txt - it is not the spider's fault if you did not know to use it. Now if you did use one and it ignored it, thats another matter.

    As for the methods used by the spiders, it is simply more efficient in most cases for a spider to make a local copy of the file and do its indexing on that copy rather than to index directly over the web. In either case it has to actually download the entire page to do so, its simply a matter of whether or not it keeps the copy it downloaded in its database.

    The spider I helped to create had 2 parts, one of which visited the site and built up a list of links on the server after reading the robots.txt to determine which were legitimate targets for indexing. It saved the files locally on our HD. The second process read the files locally and indexed the results, storing them in a database. The reason for this was simply to avoid having to visit a site twice - and avoid problems occuring when a page was changed in between the first visit and the indexing (grabbing links is much quicker than indexing so the indexer tends to fall behind over time).Also indexing from local files is much faster and more generally efficient.

    It is basically impossible to make a spider that does not download the page (after all that is exactly what you do when you view it with a browser), and keeping a copy on the HD is also entirely practical. Otherwise you end up with twice the traffic when the spider hits your site.

    --
    "The first time I got drunk, I got married. The second time I bought a chimpanzee, after that I stayed sober" Arian Seid
  41. Re:Why sue? by ryleyb · · Score: 1

    Well, I'm no guru when it comes to this stuff, but in my experience, the HTTP-Referer field isn't always filled. I haven't researched it too far, but I think some versions of Netscape don't send it at all. And cookies... turn them off, and boom, no more security. I know there are other ways to do it (my site uses one of them :) but ummm... yeah.

  42. Re: ? by Money__ · · Score: 1

    junk micros~1 tags. boooooooo
    _______________

  43. Re:A contract won't go away just because it's evil by tkr · · Score: 1

    Site publishers have some information you want. They don't owe it to you. In an ostensibly-free society, they are entitled to decide under what conditions they're willing to share what they've created. You, in turn are free to decide to accept the conditions and access the information, or reject them and do without. So say you, Exquire! They put it on the Web, they have shared it, and thanks to 'em. It's what the Web is. If they don't want to receive phone calls, they shouldn't pick up when it rings.

  44. Re:The deep linking devil's advocate argument by tkr · · Score: 1

    Go ahead, do it! That's what Yahoo is doing, profiting from the work of others. We should all benefit from each other's work. The Web is the place where this can happen. Lawyerland is where this cannot happen.

  45. Re:What's the Prob With Deep Linking? by tkr · · Score: 1

    So, again, why are these corp people upset about deeplinking when it effectively refers customers to them?

    Well, Prof, it's not the corp people so much as it is the special monopoly people like Ticketbastard. Exempt from the law of supply and demand, they don't think like normal folks.

  46. Re:I happen to think.... by Eruantalon · · Score: 1

    I've two thoughts on this topic. Linking serves a useful purpose for any website. It allows readers to gain information that you may not deal with directory on your site, but they are interested in nonetheless. Therefore, it's a good thing. The problems lie in how you link, IMO.

    Linking to someone else's homepage, I see no problem with whatsoever. It's the homepage, any banner ads will be viewed, and the reader knows that the information contained there is not on your site. There is no reason why this type of linking should be illegal. It's the purpose of the Internet.

    Deep-linking, however, can be problematic. This is where webmasters have to be cautious. Every time I've deep-linked on my website, I've first dropped an email to the webmaster of the site I wanted to link to asking permission to do so. In every case, I've been granted permission to use a deep-link. Now, I've never tried to link to any commercial sites such as Apple or ZDNet, but I assume you won't get so quick and kind a reply as you would from an individual. On the whole, however, people (and companies, I assume) aren't really bothered by deep-linking, as long as they know who's doing so and what.

    I like the idea of using a CGI to protect specific materials from being freely distributed net-wide. I do not, however, think that most people should worry about doing this. Most of the information on the Internet isn't the type of information that needs to be protected from viewers. If so, I'd think a secure site would be the better choice. Articles that are copyrighted by a site, however, should be protected to a point. No one should be able to take your article and post it on their site as their own work. That's plagiarism. What I do and have seen done is to place a copyright notice below each article saying it's my work, and if you want to use it, email and ask me. This usually serves its purpose - either people don't link to/copy your article, or they ask you for permission. If they do, you can nail them for breaking a copyright. That's how I think the Internet should operate, anyways.

    Eruantalon

  47. Re:Great idea, but... by Eruantalon · · Score: 1

    The sad truth is that most people, when not being actively watched, are assholes.

    Well, I want to agree and disagree with you here.... I think I'll have to disagree in relation to linking, though. I don't think most people "don't give a damn about whether or not you want them to link to a particular page" - I think most people don't think about possible repercussions of doing so. Sure, there are plenty of assholes out there who don't give a damn. However, I think that most people, after seeing that you have a copyright notice on a page, or a notice saying please email me if you want to use/link to this page, will email you and ask permission. I've never had a problem with my site, nor any sites I've linked to (granted, I don't get /. hit quantity, but I get enough visitors that it could be a problem).

    America, the land of the frivolous lawsuit, the land of laws against driving without a seatbelt or selling alcohol on Sunday, the land where morality and intelligence are expected to come from legislators, not ourselves. I love this country!!

    I hear you there. If I may say so, I think this country needs to get its head out of its own collective ass. That's all I'll say about that subject, however.

    Eruantalon

  48. Re:Why sue? by Eruantalon · · Score: 1

    But if you don't want people deep-linking to your website, why not use technical means to keep them out?

    Well, I figure that most people/companies suing the hell out of each other over stuff like this either don't know or care about using technical means to serve their ends. They'd rather sue and get both publicity and money from the suit. The best means of protecting a site's contents is the Referrer info. (OK, so maybe not protecting contents, but at least it's the best means of protecting against deep-linking, if you consider that harmful to your site/advertisers. People can always go through your site the long way & steal your articles. They just won't be able to deep-link to them on your site.) If this is used, then the person/company will not have to go through with expensive suits. They won't get as much publicity, either.... Stupid fuckin corporate USA. Whatever happened to the time when people running this country knew things?

    Eruantalon

  49. "You may not read this line!" by try67 · · Score: 1

    You're quite right, there is no way to view a page without creating a copy of it on your local machine, either by cache or just in the RAM until you move on to another page.
    Such a restriction would be unfeasible in today's net, and would render all on-line communications illegal.

    --

    To the fool, he who speaks wisdom will sound foolish. ---Euripides
  50. Re:The funny thing about deep linking by mcrandello · · Score: 1

    That is a good point, however given the subject matter of the article, it wouldn't be very sporting of them ;^) Maybe one day someone there will realize that we simply don't *want* another damn user/pass.


    ...5 years from now everyone will be running free GNU on their 200 MIPS, 64M SPARCstation-5.

  51. A sample robots.txt by Tenement · · Score: 1

    A sample robots.txt file from someone who loves to message those darn robots that like to tickle domains:

    [CENSORED] -- cat robots.txt
    We are the borg. Disable your weapons and surrender your ships. Resistance is futile.
    [CENSORED] --


    Cheers Tenement
    --

  52. The Public Threshold by MedBob · · Score: 1

    This is part of a larger debate that cries out to be settled. I propose a new doctrine known as "The Public Threshold". Anyone who pushes information beyond "The Public Threshold" is making the choice to lose certain rights with regard to that information. Deep linking falls within this area. If you expose the interface for public use, you lose the right to cry "foul" if someone sidesteps your ads, or login, or personal picture pages or... (ad nauseum). Some rights should be protected of course, but the information/content/link is fair game for (perhaps the old doctrine of..) "fair use". This applies as well to broadcast television. If a program crosses the "Public Threshold" you lose the right to beat other local/remote stations and cable providers with the "Exclusive Rights" club. We need to simplify our classification into 3 types, Public Domain, Public Copyrighted, and Propriatary.

  53. They're just too cheap by cprincipe · · Score: 1

    As others have said, a little CGI or HTTP-REFERRER could solve this problem - but that assumes Ticketmaster hires the best people in the business and pays them accordingly.

    The reason why they are suing is that they have lawyers who have the expertise to contest the case but not the technical talent to remedy it.

    --

    bun-fhuinneog agam!

  54. Re:I happen to think.... by pak21 · · Score: 1

    ..that if you put something up on the web, you've made it publicly available for people to link to.

    Yes, but there are right ways and wrong ways to do it - to take an example, the fantastic World of Spectrum has thousands of ZX Spectrum games available. Some people then link directly to the games (say here), trying to make it look like they've put the work in to build up this collection (I've seen sites with "here are some of my games for you to download" and then linking to WoS).

    Phil

  55. Re:Implicit copying of a web page by borzwazie · · Score: 1
    No, your lawyer could subpoena that standard. What is destroyed is not the scientific research, but rather the balloting and voting results of who voted how on the standard. All the people involved on the standard are engineers (whom are quite keenly aware of liability, I might add). The only things that are destroyed are the records of the processes by which descisions are made, not the decisions themselves, nor the research. As I'm not an engineer myself, I'm not totally qualified to give you all the gory details of the process. Perhaps some other /. readers do?

    As for the information being public information, how do you propose funding of standards development, if not by sale? This is not sarcasm, but a genuine question.

    --

    "We apologize for the inconvenience."

  56. Re:Seeing a contract != agreeing to it by RickHunter · · Score: 1

    Looks like a couple of mentors think that your contract really is binding. ;)


    -RickHunter
  57. Re:Seeing a contract != agreeing to it by RickHunter · · Score: 1

    Moderators, even! I knew I should've previewed! :-(


    -RickHunter
  58. this is not going the right way by jonnywadd · · Score: 1

    doesn't it seem wrong that a lot of people who are not computer scientists are judging the web and forcing us to act in a certain way on the web? In this case it was not a bad thing but still, what the hell does a judge who spent his life studying Roe v. Wade know about the web? Really, who is he to judge anything, and why are we allowing the future of our toy to be decided by an outsider?

  59. Re:I happen to think.... by b_pretender · · Score: 1

    I don't agree with you that all material on the web should be publicly available.

    A good example agains this is internal coporate information. Putting this on the web reaps the benefits of being easily available to the employees of the company, while not being public information.

    As you pointed out, CGI, is a good way to do this. But like I said it isn't public information.

    -

  60. Re:The funny thing about deep linking by b_pretender · · Score: 1

    Great! :-(

    Now you are going to cause a /. effect at nytimes partners section, and then they will get rid of "partners" deeplinking.

    How's that for irony?

    -

  61. Re:Play nice by festers · · Score: 1

    Since (almost) all stories posted to slashdot are submitted by people in their own words, personal opinion is going to be a part of *every* story. Since slashdot authors choose to post a particular story, there's another example of bias. It's time to face reality: there is no such thing as "unbiased" reporting. The best we can do is read from more than one source and see what other people are saying (slashdot comments are great for this).

    In light of this, I have no problem with /. authors commenting on a story.


    --------

    --


    -------
    "Every artist is a cannibal, every poet is a thief."
  62. Re:The funny thing about deep linking by ScottMaxwell · · Score: 1
    When accessing the nytimes site, you can use this username/password combination, which was posted to Slashdot by someone else several months ago (searching didn't turn up the responsible individual, so I can't give credit where due, sorry):

    User: wheredoyou
    Password: wanttogotoday

    Let 'em save this info with a cookie, and you don't need to log in any more -- it's as transparent for you as using partners.nytimes.com is, but it keeps nytimes happy. And since there are lots of us using the same username/password combination, they don't really know who's who.

    --

    --

    ``Life results from the non-random survival of randomly varying replicators.'' -- Richard Dawkins
  63. Re:Play nice by Trombone8vb · · Score: 1
    Why don't people just play nice. I mean if you are going to link to a page and you are not sure if the people want you to do so, ask them out of politeness. We shouldn't be making a law about this. I don't understand why people don't just respect other people's wishes. We don't have to make a law about this. Just because it is there is no law against it doesn't mean it is right. Respect peoples wishes when it comes to these issues. Why does our society feel obligated to determing what is o.k. to do and what is not o.k. to do by making a law for everything. Simply place nice with each other. If they don't want you to link to their site, don't out of decency. Not because there is some law against it.

    We already have a law about this, it's called the DMCA! It allows copyright holders to Control access to a copyrighted work which is exactly what the deep linking argument is about. The difference is that these websites are not trying to control the access by technological means, but by suing.

  64. Umm... by lacinyc · · Score: 1

    So sooner or later google will be illegal

    Ticketmaster shouldn't win, or who knows what they will come up with next

    give a mouse a cookie and he will want a glass of milk (or something like that)

    Oh and doesn't slashdot link to blockstacker without telling you that is what it is doing?

    I've only been awake for 28~32 hours so the above may not be completely lucid, I'll hit my second wind soon I think..

    --
    -- "My dad used to play sports with me... I don't like sports" -Tim
  65. Re:Quashing Deep linking.... by Fillup · · Score: 1

    hypertext

    A term coined by Ted Nelson around 1965 for a collection of documents (or "nodes") containing cross-references or "links" which, with the aid of an interactive browser program, allow the reader to move easily from one document to another. See also hypermedia.

    It's called HTtp for a reason.

    HYPERTEXT transfer protocol....not LOGIN transfer protocol, not PAY TO PLAY protocol....the entire system is HYPERTEXT.


    --
    --
    "I think there is a world market for, maybe, five computers." __ IBM Chairman, 1943 __
  66. Technical Solution vs Legal Solution by infra-red · · Score: 1
    I know what I would prefer to be the case, but if there is a technical method to enforce a rule on content (ie no deep linking) is the technical methods presence enough to prevent the legal solution from being adapted?

    I'm wondering if ticketmaster is using the legal method to hurt/destroy tickets.com rather then to protect their content. If they implemented a technical solution, they really wouldn't have a case against tickets.com, and wouldn't that be a shame.

    1. Re:Technical Solution vs Legal Solution by DrgnDancer · · Score: 1

      This arguement make some sense when used in this case, but Ticketmaster used this same tchnique against Microsoft, and they did not have a lot of chance to hurt/destroy that behemoth. On the other hand, it worked so maybe I am worng.

      --
      I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
  67. Re:Similarity to "piracy"? by LaoK · · Score: 1

    Indeed, it can fairly be said that Judge Hupp left the door open for a link-averse Web operator to ban linking via a contract that a Web surfer is forced to agree to before being allowed to enter a site. He implied that those who deep link in violation of this conspicuous and assented to "agreement" would have a potential breach of contract problem on their hands.

    Oh, great... so now we're going to have "shrink wrap licensing" on web sites?

    (I mean, other than pr0n sites... which give "deep linking" a whole different meaning.)

    LaoK

  68. Re:The funny thing about deep linking by LaoK · · Score: 1

    Ahh... the irony!

    So, if in the future anyone wants to post a link to a NY Times story, please
    use the deep link, and not the annoying "free registration required" link.

    Information w/o registration!

    LaoK

  69. Re:The funny thing about deep linking by yerricde · · Score: 1

    User: wheredoyou

    So what happens when www.nytimes.com is slash-DoS'd, and 53% of the hits are from one user `wheredoyou'? That account gets rm'ed. Hard.

    --
    Will I retire or break 10K?
  70. Re:deep linking to non-html by graikor · · Score: 1

    A fair question.

    Generally speaking, most images are copyrighted unless specifically released to the public domain, but many graphic artists were dismayed to see their hard work illegally appropriated by thieving web designers.

    If you knew for a fact that the image had been explicitly released to the public domain, I don't see that they'd have any reason or right to protest. I find that quite unlikely, though.

    I imagine you could legally use a copyrighted linked image (but not necessarily a background image) on a not-for-profit site if you included a caption to the image attributing the source and acknowledging copyright (I'd suggest an informative ALT tag, as well), but some of the issues in this case might put even that in jeopardy, depending on the purpose of the websites involved.

  71. Re:terms and conditions by RalphSlate · · Score: 1
    This is an extremely subtle but very important issue: Can a "Terms and Conditions clause" supersede things like fair use or the concept that facts cannot be copyrighted?

    For example, can a web site that publishes the official daily average temperature in Tucson have a clause that says "by using this site you agree not to use any of the data herein for any purpose"?

    Can an online book have a clause that says "No part of this book may be used in any way, including reviews of the work"?

    Both of these events are legal under copyright law, but can a contract make these actions in violation of the contract?

    Ralph
    http://www.hockeydb.com

  72. WWTBLD: What Would Tim Berners-Lee Do? by xee · · Score: 1

    How come, in all of the hoopla, controversy, mahem, and mainstream news articles, no one asks Tim Berners-Lee (inventor of the Web) what his opinion is. IMHO, his thoughts should play a major role in the legal judgments made about linking in general, and deep-linking inparticular. If I were a judge, this would be the first question to ask.

    The issue of linking - it's original design, is well described in "Weaving The Web", Tim's book about the web. The sort of links we're dealing with in this issue are more formally described as "Soft Links" because there's no hard relationship between the two documents, just an informal pointer: Check this out too kinda thing.

    The idea of illegalizing this is as absurd as the notion of outlawing citations, or bibliographic entries in a book. Or, more specifically, forcing you to say "World Book Encyc. 1999" in place of "World Book Encyc. 1999 Vol. 13 Pg. 231 Par. 4"

    --
    Oh shit! I forgot to click "Post Anonymously"...
    1. Re:WWTBLD: What Would Tim Berners-Lee Do? by xee · · Score: 1

      Fortunately, there is a project working on the antithesis of that. Internet2 is (currently) limited to educational institutions and (I think, could be wrong about this) a few commercial organizations. I don't know the facts here, so don't quote me.

      just trying to give some hope for the future.

      --
      Oh shit! I forgot to click "Post Anonymously"...
  73. Re:irrelevant by xee · · Score: 1

    I believe what you're referring to is the "Symantec Web" (no relation to the company). Berners-Lee describes this toward the end of the book, as part of his ideas for the future.

    Besides that, my argument is that as the original creator of the web, and Chairman of the W3C his opinion counts. You say "If we were still using Tim Berners-Lee's web" as if he died. Although I recognise that the reccomendations are designed by third parties, he IS the chariman of the W3C, he's not totally abstracted from the current state of affairs.

    And another thing, going off on a tangant to try and discredit my ideas just to give you a feeling of self worth is an aweful way to win the fight for our side. It doesn't take a good actor to spot a bad one, thus, discrediting me does not make you any smarter than I am. Remember this: An enemy of my enemy is my friend. We're on the same side here, we don't need to be fighting each other.

    --
    Oh shit! I forgot to click "Post Anonymously"...
  74. Speaking of linking... by HiyaPower · · Score: 1

    This is slightly off topic (sorry), but on the topic of the legal status of links, there is a story over at ZDNet about our friends suing over links to DCSS. It don't seem to matter about right or wrong no more, just the size of your legal department...

  75. Re:Seeing a contract != agreeing to it by HiyaPower · · Score: 1
    It appears that someone read the contract. Now lets try for the home run:

    By viewing this text you agree to be bound by the terms and conditions of this contract. This contract stipulates that you may not view this post with any money remaining in your pocket without immediately putting it in an envelope and mailing it to me.

    Contracts imply the exchage of consideration. When no consideration is given, no contract exists...

  76. It's all in the context... by Locked · · Score: 1

    A hypothetical situation...

    You maintain a website about, say, gay politics. You have on your website an essay extolling the virtues of tolerance and acceptance towards other people. It doesn't actually mention homosexuality, but people who read it via your index page would understand what it's about.

    Now imagine that NAMBLA (North American Man/Boy Love Association) linked directly to your essay. Regardless of whether or not they explain what the link is about (or why they link to it), it might cause people to think that your article is defending paedophilia (or that there is a link between paedophilia and homosexuality, etc).

    When deep-linking somewhere, be considerate of the possible inferences that may be drawn by the surfer. In many cases no such thing will happen, but it can happen.

    Locked

  77. Re:I happen to think.... by Ron+Harwood · · Score: 1

    Then they need to do a better job of identifying the site in a header/footer if they want to make sure that it isn't mis-represented as someone else's site.

  78. Re:"Please don't" vs "You may not" by DrgnDancer · · Score: 1

    Does it seem to anyone else that this case is about being lazy? I am not the most masterful coder in the world, but it seems relatively trivial to me to write a CGI script that protects the pages in question from deep linking if you don't want them linked. Instead of linking to the .html that you want protected, link to a cgi script that looks at where you just came from (I know this is possible) and based on that either sends you to the protected html or to an error page. You could spoof the script, but that is more than 99.9% of the world would know how to do, and would be clearly illegal, thus eliminating the need to set new precedent. Like I said, I am not a great programmer, if anyone sees a reason that this would not work, please respond.

    --
    I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
  79. Irony here by DrgnDancer · · Score: 1

    There is an irony here. In order to read the "Licence Agreement" for a site I would have to go through it's homepage (at least I assume that they are not going to post several KB's of legal text on every page in the site). If I was "deep linked" into the site, I can honestly say that I was unaware of the licence that prevents me from deep linking.

    --
    I don't need a million points of light, just two points of multi-mode fiber and a 10 Gig-E router.
  80. Disaster by r-jae · · Score: 1

    If the judge in this case rules against the defendant and establishes a precedent, it will have devestating long-reaching effects on the WWW as a whole.

    Why be so dramatic I hear you ask? Well, if webmasters can't link to a specific page in a site, it will be a navigators nightmare. Average users will have a hard time finding what they are looking for if they are thrust onto the front page of a site.

    Even slashdot will be greatly effected. In articles posted on slashdot, the author won't be able to link straight to the news story, rather they will have to link to msnbc.com, or cnn.com, or nytimes.com for example. Quite often, the news article is a few days old, and won't appear on the front page. Most articles discussed on slashdot aren't (in the webmasters eyes) front page worthy, so they place them in a subsection. And us ./'ers will have to roam the sites and use the often inadequate search pages to find what was mentioned on slashdot.

    Is this what we want for the already ailing web? My fear is that this judge has no idea what the implications of his judgement will be.

  81. What's the Prob With Deep Linking? by Prof_Dagoski · · Score: 1

    When this debate first surfaced, it had me scratching my head. The sites I run, I'm real glad to have someone hit 'em. I don't really care how they get there, just that the web statistics keep justifying the salary I get. I'd think the same thing would apply to commerce sites like E-Bay. Who care if someone out there indexes your site? It brings people to your site who will click on your banner ads, and use any fee based services you might have. It seems like the deeplinkers are actually providing a service to a lot of sites. In the real world, people make money and get perks for refering someone to a company. Why shouldn't the same thing apply on the web? If your site does have stuff you don't want people linking to, then I imagine you can protect that with a little creative javascript or something else like cgi. I've even done this on my own sites--mainly to make sure people didn't fill out a huge application form out of sequence. So, again, why are these corp people upset about deeplinking when it effectively refers customers to them?

  82. Re:this is not going the right way(flamebait) by Prof_Dagoski · · Score: 1

    Wake up folks! The NSF net was turned off about six years ago. The Internet is an instrument of the public now. It has been for a long time. There are no longer such things as outsiders. You want an exclusive little playground, fine, run a private ip network behind a firewall. That said, the newcomers need a serious education in the concepts behind references. Us academics love references because in terms of a print document it tells you where the author got his/her information and points the way to related works. The original idea on the web was to make to make this kind of research work instaneous. Want to check a reference to get more information? Click on the link. Much quicker than journal searches and inter-library loans. Now, we've gone from the Net being the playground of a few weird scientists to a medium where money changes hands. Now we have to make distinction between private and public. Fortunately, the technology exists to accomplish that. Having all the capitalists and other such unwashed barbarians on the net doesn't destroy it by any means, but it does mean we need think about things like property.

  83. Re:"Please don't" vs "You may not" by Fishstick · · Score: 1
    >Without deep linking, Slashdot wouldn't exist, now would it? :)

    I must really mis-understand this issue then. What slash does is publish hyperlinks to stories and articles all over the web. What I thought the ticketmaster thing was about was someone else linking to their content out of ticketmaster's site and having it render in a single browser window with their own content so as to appear to be coming from their own site and not ticketmaster.

    Seems like an important issue is how the law distinguishes the difference between the two. I agree with the common-sense arguments that once you publish something on the web, it's out there for the public to use and you should just deal with it. If you have a web page that is accessible through a hyperlink, you are way off if you claim that no one should link to your page but you.

    But if you have a web site, and someone uses individual elements such as graphics by simply referencing them on your server from their own site, I see that differently.

    When it's an evil company who is using the web as their storefront and some other company comes along and "borrows" some of their content by referring to it on their page in a way that makes it look like it is all their own page, the evil company cries foul and we have little pity. "Too bad, that's the way the internet works, get used to it!"

    What would be /. reaction if the NY Times were to find a really interesting comment by Signal 11 and render a page that hauls the text out of slash's database and presents it with NYT frames and banners and hides any hint that it came from another site? :-)

    If it is a hyperlink on a NYT page, its fine. If it's a fully rendered page that looks like NYT with /. content?

    --

    There is much cruelty in the universe, John.
    Yeah, we seem to have the tour map.

  84. Re:Do not link to this page ... by gms · · Score: 1

    I Don't think it would be wrong for Taco to keep the link active.
    Joe could Temporarily take down his server, remove the stuff getting slashdotted, write a CGI that sends a small error message and apology to 99/100 requests that have slashdot as the referer.
    Joe has Lots of Options here.
    I think that if you can't stop people from doing something (linking, copying a DVD, whatever) in Code (or otherwise technologically) Maybe you shouldd't be trying to stop them.(As opposed to having the courts back up your useless code)

  85. Deep Linking & Personal Information by JCMay · · Score: 1
    ...that if you put something up on the web, you've made it publicly available for people to link to.

    There is clearly a limit to this. Just because my financial institutions make it possible or me to conduct business via the web does NOT make it ok to deep link to my bank account information.

    I would imagine that these types of pages are not static HTML, but are instead built on demand by a CGI script or something. I seriously doubt that my bank (which does have a web presence), has a page for each of its members (it's really a credit union).

    Neither is it ok to deep link to sites that provide content on a fee paid basis. People providing content that are paid through advertisements rightly view some types of deep linking as a danger to their economic life.

    Most of these sites have access controls that don't allow this. I imagine that people running these kinds of places have thought of that "loophole," and have taken measures to make sure it's not open.

  86. Take reasonable precautions by Dhericean · · Score: 1

    I agree with this and believe that a site that does not wish deep linking should check where the system has navigated from and actually substitute the top page, a login screen, or at least a warning/copyright front page.

    Part of the problem is that a lot sites are created by people who are more adept at glitz than at the nuts and bolts of secure navigation and http. If those in positions of responsibility were more aware of the possibilities then hopefully they would insist that their developers use these methods where appropriate.

    If the courts were aware of the measures that could be taken they they would be less likely to uphold a complaint made where such measures had not been taken.

    There is a need for education of the people in positions of responsibility so that appropriate measures are taken which do not involve excessive legal involvement (Why give money to the lawyers?).

    --

    Gamma Testing - Where testing is extended to the full user community (AKA Shipping the Program)
  87. Deep Linking by mfinke · · Score: 1

    Fortunatly the deep linking ruling went as it did. However had it gone the other way, wouldn't that cause problems for sites such as /. Many articles here link to their source item, wich is rarely the front page of a site. It could alos have had major ramifications on search engines, which link you directly to the relevant page. I know that the ruling was only regarding deep linking and not telling the viewer that they are changing sites. But court rulings have the potential to be applied rather liberally.

    Thank god someone knows a frivilous lawsuit when they see it.

    --
    The following statement is true. The preceding statement is false.
  88. The squeakiest link gets everybody greased? by Robogeek · · Score: 1

    The Japanese ruling bothers me quite a bit. How far do they take this? If Site A carries a link to Site B (which is a movie review site, but carries no illegal material itself), and Site B carries a link to Site X (an adult movie review site, which carries material considered illegal in Japan), can Site A be prosecuted under the current interpretation of the law? It has, after all, "increased the number of ways to access obscene sites", which seems to be the basis for finding the defendant guilty under their "Article 62" (Aiding and abetting crimes involving the distribution of pornographic material.) If so, how deep does one take this? If there are 3 other sites in the chain of links between you and the offending site, are you still liable? 4? 20? It almost seems as though you could make a case for every site on the Internet being liable, as you could probably find a way to follow links from most any site to an adult site. (Wasn't there something said a couple of years ago that most sites are only 6 links away from pornographic material?) Another problem - what if Site A carries exactly one link, which points to XYZ.COM, a on-line ordering site for children's books, and a year down the road, XYZ.COM is bought by an adult magazine and begins featuring porn, without the knowledge of Site A's owners? I can see the need for sites to be able to request that no links be made to them (as in the example someone posted, wherein a small site is taken down due to links from a high-traffic site), but making one site legally liable for material presented on a site it links to seems absurd to me.

    --
    "What about that time we caught you naked in the kitchen with a bowl of Jello?!?" "Hey-I was HOT and I was HUNGRY!!!"
  89. Re:I happen to think.... by wallyman · · Score: 1

    Let me make a (not so) bold statment and say that deep linking is OK, but only down to the page level. It should not be kosher to link to individual elements within a page. For example, linking directly to an image or directly to an audio/video file. This gives the linker all the credit and the linked all the cost.

  90. terms and conditions by geekpress · · Score: 1
    The NY Times article indicated that it may be the terms and conditions that forbid deep linking. Given that few people ever read the legal mumbo jumbo on a web site, it's somewhat disturbing that it may actually be legally enforcable. I am reluctant to call anything a contact that I haven't signed, let alone read!

    My site, GeekPress deep links to news on other sites, like slashdot. My hope is simply that those links will be regarded as bringing users into the site that might not have otherwise arrived.

    One more thought: Why doesn't ticketmaster block or redirect incoming traffic from tickets.com? Shouldn't technical remedies come before legal ones?

    -- Diana Hsieh

    --

    -- Diana Hsieh
    GeekPress: The Weirder Side of Tech News

  91. Look on the bright side! by Alien+Perspective · · Score: 1
    CmdrTaco wrote: I don't think anyone should ever be able to say, "You may not link this page" since that is fundamentally the anti-point of the Web.

    Okay, everyone...put up your "do not link to pages on this site" notices. Then sue the crap out of Mattel when they stick your URLs in their blocking software.

  92. Should everyone be able to link to you? by Anonymous Coward · · Score: 2

    This brought to mind a post I saw a few years ago from someone who ran a web site providing information about foreign adoptions. They were upset because a pro-pederasty web site was linking to their page. Are their no instances in which links should be forbidden? I can imagine the adoption agency would be quite upset to find their name coming up in a web search describing how to get children to abuse.

  93. His ruling is simple by Gleef · · Score: 2

    Deep linking (and any sort of linking) is not illegal in and of itself. On the other hand, just because it is a link does not protect you from other laws, such as "passing off", in this case, where one company pretends to be tightly connected with another. Similarly, the fact that it's a link should fail to protect you in cases of libel, fraud, and other informational crimes.

    Linking should be free, but that is not a defense against doing things that should not be free, and that's what I see the real issue here as. Finding someone liable for passing off via a link won't have a chilling effect on links, it will just have a chilling effect on passing off, which is way too common on the web.

    ----

    --

    ----
    Open mind, insert foot.
  94. Re:Why sue? by sjames · · Score: 2

    Well, I'm no guru when it comes to this stuff, but in my experience, the HTTP-Referer field isn't always filled.

    That's the case with older browsers. However, if you were Tickets.com would you be willing to have 90% of everyone who clicks the buy link on YOUR site get a simple black on gray page that says "We won't sell it to you because you're a flatulating butthead"? Or perhaps a page that explains that you can't actually serve the customer, but we can! followed by a link to the home page.

    I strongly suspect that that would put a stop to it rather quickly!

  95. The deep linking devil's advocate argument by CaseyB · · Score: 2
    A frame is simply a special case of a link. So, to make some extra cash, I just write:

    <HTML>
    <H1>CaseyB's Amazing Web Links Database</H1>
    <!--miscellaneous banner ads, etc.-->
    <IFRAME SRC="http://www.yahoo.com/">
    </HTML>

    With some clever Javascript, I could probably even size and scroll the frame so that Yahoo's ads never appear on the screen.

    Now I just advertise this to unsuspecting web users, and make some cash.

    Note that this doesn't involve my copying Yahoo's data, nor even accessing their servers myself. Yet I'm making a profit off of their work.

  96. Re: ? by CaseyB · · Score: 2
    junk micros~1 tags. boooooooo

    I'll do that when the anemic little Netscape/Mozilla browser provides such an obvious, useful functionality.

  97. Re:Deep Linking, to and other files by Jac_no_k · · Score: 2

    I always hated people who link directly to the images on my site avoiding all the marketing crap and sucking up my bandwidth. I wonder if there will be a court case where a company gets sued for breaking links from the outside world.

    I implemented a little file name rotator for the images on the website I manage. Every 8 hours or so, the app will rename all the images and update the URLs on the site to match. In the place of the old filenames, I placed message pointing users to the site where they can get the file (and all the marketing shit.) This effectively broke all the links.

    Now I'm just waiting for some idiot to sue my company over this...

    -jack jsuzuki@ix.netcom.SPAMSUCKS.com

  98. the real issue here by Merlinus · · Score: 2

    is that "deep linking" or linking in general is
    no different than a footnote in a book, an
    entry in a bibiliography, or just a conversation
    between two people where one supplies the
    source of a piece of information to another.

    Indicating the source of a piece of information
    is in no way the same thing as supplying that
    information. DUH. Therefore there can be no
    patent violations, threats for linking to
    dangerous/controversial (to some people)
    information, etc...

    It doesn't do this issue justice to say that
    it shouldn't be allowed just because it is
    anti- the purpose of the web.

  99. Deep Linking, Kevin & Kell... by Robotech_Master · · Score: 2

    One of the sites I regularly visit, the Kevin and Kell online comic strip, asks people not to link or inline directly to the comic strip--they've had some trouble with people doing that in the past. I had some concern that this linking decision threw open the door to people to do just that--but from the NYTimes article, I see that it's still open to debate (and legal action).

    --
    Editor Emeritus and Senior Writer, TeleRead.org
  100. Needed: better automated mechanism than robot.txt by orpheus · · Score: 2
    I've had at least one site since 'the early days' and I recall how indexing bots were a real problem long before the web (e.g. gopher, FTP, etc.) Back then we were more worried about server load and bandwidth than content (which was presumed to be open and free)

    ROBOT.TXT has some very real problems. One is that the file must be placed at the root directory of a site (per the original spec) and this is not compatible with some hosting services. Another is that it was a one-stop 'shopping list' of targets for the less-than-scrupulous. And of course, as everyone knows, compliance is voluntary.

    At the very least, allowing robot.txt as a per-directory access restriction would make far more sense today: it would be a little more flexible, and would not provide a shopping list. (it was not adopted in the original, because it was more bandwidth intensive)

    However, we really need a more flexible plan from the ground up, to deal with the needs presented here today. At the very least, it would help the 'cooperating' bot owner to better understand the wishes of the site owner. Today, I suspect that most suites that care about bots at all would allow indexing of some content but not all, and would like to specify access based on use.

    The compliance issue, alas, is unlikely to be resolved anytime soon. It's up there with Direct Marketing dinnertime phone calls and spam.

    __________

    --

    If you can go to bed, knowing you did a valuable thing today, you're very lucky. If you can't... it's not bedtime

  101. Question About Lawyers by SnatMandu · · Score: 2
    This story makes me wonder about how big companies interact with their legal department.

    Here we have a case where there was a cheap, more or less foolproof, technical remedy that could have been implemented in well under one man-day, most likely. Yet instead they go for the legal solution.

    This makes me wonder who's calling the shots? Is the "problem" that suits talk to their lawyers and not to their techs? Or is it that lawyers "sell" themseleves to these clients:

    "Hey, Mr. CEO, I noticed Company X is deep-linking to our site. As your counsel, it is my responsibility to inform you that by doing so they're engaging in blah blah blah, and we should sue the pants off them. " Is anyone out there in a position at work where they deal with corporate lawyers? Is it really the companies that "sick their lawyers on them", or do lawyers "sell themselves" to their clients, by painting pictures of legal doom-and-gloom if they don't sue?

    Obviosly in this case, the problem could have easily been solved without an expensive lawsuit - yet we see not technical solution, and an expensive lawsuit.

    Any anecdotes would be appreciated.

    Don't forget to post as AC

    1. Re:Question About Lawyers by nyet · · Score: 2

      Path of least resistance...

      Its much easier (and cheaper) to get a lawyer to do a PHB's dirty work than it is to find a competent geek who will 1) understand what the PHB is saying and 2) agree to do it.

      Good, competent, amoral technical people who can communicate with non-geeks are hard to find and expensive.

      Pay any old lawyer enough money and he will litigate for you until doomsday, regardless of the cause, and no technical expertise needed.

  102. Re:I happen to think.... by Mike+Schiraldi · · Score: 2

    One of the criteria the judge specified is that deep links must not mislead the user to the point where they don't understand what site they are at. I think that directly linking to a ZIP like that crosses the line.
    --

  103. Re:Do not link to this page ... by woogie · · Score: 2

    mod_rewrite is your friend...

    RewriteEngine on
    RewriteLog logs/rewrite_log
    RewriteLogLevel 0

    RewriteCond %{HTTP_REFERRER} ^[^http://somehost.com/].*
    RewriteRule .* - [F]

    This should reject any access not referred from somehost.com. Of course, this is off the top of my head, so I might have totally blown it.

    Woogie

  104. Re:"Please don't" vs "You may not" by Surak · · Score: 2

    Well, anyone from Slashdot is prejudice in this case, really, with our without CmdrTaco's comment.

    Think about it. Without deep linking, Slashdot wouldn't exist, now would it? :)

    FWIW, I agree with CmdrTaco: the Web is primarily a broadcast medium. One of the advantages of the hypertext nature of the WWW is that hyperlinks, indexing, and the like make it easier for users to find information.

    The WWW was never designed to be a medium where you only go in to Web sites through the "front door" so-to-speak. The fact that I can read something on a topic, click on link to jump to a related page, and then keep following links to find the information I want is the whole idea: this is POWER. And if it weren't for this kind of power, I think a good number of folks who use the Web for serious research wouldn't be using it, because it wouldn't be practical.

    Take away linking and you take away power. Take away power and the users will follow. People quit using the Web, and the minor tech stock correction you saw in the Nasdaq recently will seem like nothing.

  105. Use the referrer! by Stephen · · Score: 2

    If Ticketmaster wanted to prevent deep linking, why didn't they just check the Referer: header instead of calling in the lawyers?

    Of course they'd have to allow requests without a referrer to get through, and one can fake a referrer, but it would stop almost all deep links from a rival site, for people using a regular browser. They could even have redirected them transparently back to their home page!

    --
    11.00100100001111110110101010001000100001011010001 1000010001101001100010011
    1. Re:Use the referrer! by locutus074 · · Score: 2
      Don't most browsers have a preference setting that lets you disable sending a referrer? (Just asking, since mine does.)
      I don't know about "most" browsers. But "most" of the general population would never have the idea of diabling it occur to them. Unless tickets.com published instructions stating "If the ticketmaster.com home page appears instead of the concert ticket page, hit 'Back' in your browser, then go to Edit|Preferences..." :) Yeah, I can definitely see that happening. ;)
      You can never, ever trust a client. All clients should be considered hostile.
      Witness the EverQuest (or whatever it is) fiasco.

      --

      --

      --
      We have fought the AC's, and they have won.

  106. Re:its not about deep linking ... by Kaa · · Score: 2

    .. its about unfair business practices. one of which is deep linking into ticketmaster as a part of tickets.com business process.

    The fact that a company has an unsustainable business model does not impose any obligations on anybody to change the law so that the model becomes sustainable.

    Sure, tickermaster would like everybody to go through the front page where they can be exposed to ads. So what? Ticketmaster would also like to become the sole legal source for all tickets to anywhere.

    If somebody is selling widgets fo $1.00, it's perfectly legal and moral for me to open a store next to him and start selling the same widgets for $0.90. Of course this will make the guy upset, but it's not a good reason to forbid me to sell widgets.

    Kaa

    --

    Kaa
    Kaa's Law: In any sufficiently large group of people most are idiots.
  107. Play nice by Eman · · Score: 2

    I got two comments:

    1. Could the people posting the stories please put their comments with everybody elses instead of with the story. We don't all have the same opinion as you and you are biasing the story.
    2. Why don't people just play nice. I mean if you are going to link to a page and you are not sure if the people want you to do so, ask them out of politeness. We shouldn't be making a law about this. I don't understand why people don't just respect other people's wishes. We don't have to make a law about this. Just because it is there is no law against it doesn't mean it is right. Respect peoples wishes when it comes to these issues. Why does our society feel obligated to determing what is o.k. to do and what is not o.k. to do by making a law for everything. Simply place nice with each other. If they don't want you to link to their site, don't out of decency. Not because there is some law against it.
    --
    Eric Anderson
  108. Re:Do not link to this page ... by brydon · · Score: 2
    Let's say he practices what he preaches (see quote above) and says "You can't tell me not to link it!" and leaves the link up. Joe Developer gets 500,000 hits in one day and goes over his allotted bandwidth by 20GB. At $5 per 10MB over his allotment, he now owes his hosting company a $1000 in overage fees and Joe Developer removes his site and goes bankrupt due to the "fundamental point of the web."

    The fairly obvious solution is for 'Joe' to simply remove his page; maybe replace it with a message saying 'please return next month'. Should reduce the number of bytes transfered.

    If you dont want people to read your pages, the simple solution is to not put them on the web! Surely that is the point of the web.

  109. How about... by Wah · · Score: 2

    ...deep linking DoubleClick ads?

    Practice what you preach /.^H^HAnd^H^H^HVALinux.

    --

    --
    +&x
  110. Two points... by Shotgun · · Score: 2

    Indeed, it can fairly be said that Judge Hupp left the door open for a link-averse Web operator to ban linking via a contract that a Web surfer is forced to agree to before being allowed to enter a site. He implied that those who deep link in violation of this conspicuous and assented to "agreement" would have a potential breach of contract problem on their hands.

    How would such a ruling affect "click-through" licenses? I think this is something to watch.

    The second point is the nature of the net. Everything is public. If I write a story and placard it on a billboard next to a busy section of I-40, can I stop the newspaper from printing a picture of it? I see attempts to stop any sort of linking in much the same light. How much control do I retain over works I publish in such an uncontrollable area?

    --
    Aah, change is good. -- Rafiki
    Yeah, but it ain't easy. -- Simba
  111. Re:Do not link to this page ... by |DaBuzz| · · Score: 2

    I Don't think it would be wrong for Taco to keep the link active. Joe could Temporarily take down his server, remove the stuff getting slashdotted, write a CGI that sends a small error message and apology to 99/100 requests that have slashdot as the referrer.

    And if Joe is out of the country on vacation and only has web and mail access, what then? What if Joe is in the hospital and has no clue about the slashdotting he got until his webhost sends him the $5,000 bill? What if Joe had a mexican lunch and is on the can for the first hour of slashdotting, he's already OVER his bandwidth limit before he can even START to code a CGI solution to his problem. What if his project has NOTHING to do with the web at all, it's simply good "news for nerds" and anything CGI is greek to him ... then he has to find someone to stop the slashdotting for him which will cost both time and money.

    Not everyone is a perl guru, that doesn't mean they don't have the right not to endure a slashdotting against their will.

    A lot of people say if you don't want people showing up, don't put your stuff on the web ... well I put my house on a public street, does that mean I want 500,000 geeks driving by one day to take a look? Hell no, and it's my right to do what I can to stop that since geeks usually leave a trail of Jolt Cola cans and Slim Jims behind them. heh

    Take that to the next level and put those 500,000 geeks in my front yard, than you'll have what a slashdotting is like. If I put up a no trespassing sign, all 500,000 of you would be breaking the law regardless of how public the street (network) I live on is. So yes, the web is public, that does not make every single page a public place where the person who owns the page (pays for the hosting, resources, etc.) has no rights to control how it's accessed.

    To put it in simpler terms ... if I'm a site admin that does not want a certain amount of network traffic, and you continue to send that network traffic to my site after I ask you not to ... that my friends, is a denial of service attack. What's next, script kiddies saying they have a right to smurf you because your server is on the "public internet" and the whole point of the internet is to share data?!

  112. Why sue? by Garpenlov · · Score: 2

    This probably will sound hopelessly naive and uninformed to people who solution to every problem is to sue... But if you don't want people deep-linking to your website, why not use technical means to keep them out? Check the HTTP-Referrer, or only let them in if they have cookies that where set at the top level of your site... I guess by sueing, you don't have to worry about implementing the above methods, and then having people get around them. But I figure that if you don't want people getting to something, password-protect it. Of course, in the curious world of advertising, you can want people to see something, but only if you have control over it..

    --
    --- Where's my X.400 protocol decoder?
  113. Re:Similarity to "piracy"? by thimo · · Score: 2

    The way it is put here, does that mean they'll start fighting the use of junkbuster in the future too? I mean, the result is the same as deeplinking, you avoid the adds!

    What I think of this: Like others have stated before me, HTTP_REFERER isn't just there for the cat to play with! (not that that could help them with junkbuster... :)

    Thimo
    --

    --
    Avoid the Gates of Hell. Use Linux!
  114. "Please don't" vs "You may not" by ebcdic · · Score: 2
    You (CmdrTaco) prejudice the question by comparing "please don't spider this page" with "you may not link this page".

    How about "you may not spider this page" or "please do not link this page"?

    I don't think there's any difference between linking and spidering (ie indexing). If you make something public by publishing it, other people have a right to refer to it whether in a web page or an index. Of course you can ask them not too, but that goes for both cases.

    1. Re:"Please don't" vs "You may not" by locutus074 · · Score: 2
      You (CmdrTaco) prejudice the question by comparing "please don't spider this page" with "you may not link this page".
      Um, actually, no he doesn't. The difference is that Ticketmaster was trying to enforce the former, and the robots.txt "standard" is a convention that is followed by spidering programs. It's perfectly possible to write a spidering program that does whatever the hell it wants regardless of what the robots.txt file says.

      Anyway, what's the big effing deal? Why didn't Ticketmaster just configure their http server to redirect all traffic with a referer [sic] header of *.tickets.com so that instead of seeing http://www.ticketmaster.com/some/really/really/dee p/link.html, it would send you to http://www.ticketmaster.com/ ? That's a much better solution than litigous legal battles. Silly corporations, dirty tricks are for kids!

      --

      --

      --
      We have fought the AC's, and they have won.

  115. HTTP_REFERER dang it! by GMontag · · Score: 2

    The simple implementation of this little CGI variable "HTTP_REFERER" or it's equivelant in your development package, wil control "deep links". If you want to annoy people with your fron NASCAR style page, you can make that come up automatically no matter where someone trys to enter your site.

    Other option, mentioned several times above, have content in the middle of each page and common borders around it.
    <rant>
    This is one of the most annoying non-issues on the web.
    </rant>

  116. irrelevant by Delusion_ · · Score: 2

    If we were still using Tim Berners-Lee's web, we would still be clicking in an environment where content decided appearance rather than the author.

    If you recall, the original incarnation of the web called for the tags to say what text was, not how it should be displayed. The idea was that a tag would define what the content was - a quotation, a mathematical formula, a definition, words to be emphasized, etc - and that a browser written for a college student might display this content quite differently than a web browser written for a grandmother or a scientist or a lawyer, etc.

    Now, in the days where authors fight tooth and nail to get their pages to look the same in Netscape and Internet Explorer, the anti-design contingent has lost - for good or ill.

    So given that we've moved away from the idea of a web where commerce wasn't kosher and design didn't exist, should we really look to the original design spect to address an issue that goes beyond the original scope?

    Another interesting thing to consider was Nelson's Xanadu. In the Xanadu incarnation of internetworked hypertext, "deep linking" was part of the design - the idea is that text simply would not be repeated. If the Associated Press issued a news story and 100 sites quoted from it, in the Xanadu incarnation, that AP quote would be hard linked by design.

    Maybe Nelson's views on copyright and linking might be more relevant than Tim Berners-Lee's. I'm not familiar with them, myself, maybe I should go do some research...

  117. Deep linking html vs images by JPS · · Score: 2

    I'm not too sure what the decision exactly means, but I believe that deep-linking a PAGE should always be allowed. Maybe frame tricks and such should be prohibited though. However, it doesn't seem acceptable to me to allow people to deeplink other content such as images, video clip, sound, etc, even if they mention the deeplinking. Obviously, they would only steal resources without any gain for the deeplinked host.

  118. Re:I happen to think.... by Cuthalion · · Score: 2

    That is the distinction between 'linking' and 'deep linking'.

    Basically deep linking allows me to pass off other people's content as my own. I can do it by using their graphics from my own <img> tags, or by linking to files directly, or perhaps even by including one of their HTML pages in my frameset.

    My opinion on this is that it's only okay to do this normally. If you don't want this to happen, there are plenty of technical ways to avoid it, and it would be appropriate to employ them. (If I feel that you should only be able to go to here the long way, well, it's my server and I have every right to decide which requests I honor. It is CERTAINLY not my responsibility to maintain my site such that without-permission deep links to it continue to work.

    --
    Trees can't go dancing
    So do them a big favor
    Pretend dancing stinks!
  119. A contract won't go away just because it's evil. by XLawyer · · Score: 2
    The only really troubling (to me) point in the analysis is that a site's terms and conditions can prohibit deep linking. The article suggested that sites might then require you to agree explicitly to those terms as a condition of letting you use the sites. Those terms would then become part of an enforceable contract.

    The first troubling thing is how much more cluttered web browsing would become if sites got serious about that. It's tedious enough with a 56k modem; they don't need to make it worse by making you download extra Javascript and legalese before letting you use the site. I don't think anyone wants to see it become harder to get good stuff out of the net.

    The second troubling thing, though, is the attitude that some /. readers seem to have, which is that these restrictions don't bind us if they annoy us, or frustrate us, or make no sense from any perspective we can see. You don't have to justify a contract in terms of public policy or the common good.

    Site publishers have some information you want. They don't owe it to you. In an ostensibly-free society, they are entitled to decide under what conditions they're willing to share what they've created. You, in turn are free to decide to accept the conditions and access the information, or reject them and do without.

    This works both ways: no one needs to convince Mattel that the GPL attached to cp4break (I think that's the software I mean) is a socially beneficial way to distribute software--they're stuck with it. But a site doesn't have to justify a contractual prohibition on deep linking. If you accept it by visiting the site, you're stuck with it.

  120. Simple solutions for simple problems by gad_zuki! · · Score: 2

    Joe should get a banner ad.

    or at the very least a non-gouge web server.

  121. Liability, Abstraction, Interpretation, etc. by rakslice · · Score: 2

    Who is liable according to US law when a machine "commits" a crime? I don't know this, and it would be interesting (and useful!) to know. =)

    I'd like to think that liability is based at least remotely on responsibility... But, some interesting issues come up when assigning responsibility for supposed illegal links and the "unauthorized derivative works created through framing", a case which is very similar.

    I'll examine the frameset problem a bit further first (there are more comments that are specific to the linking problem below). Your PC, created by company A with hardware designed and manufactured by companies B, C, D, and E, is running instructions of a web browser made by company F, which is acquiring a frameset-containing-webpage written by person G from a server owned by H through networks owned by I, J, and K, all at your direction. The browser then assembles all the relevant data, passes it through to the display architecture of an operating system designed by L and distributed by M, to a monitor built by company N. I could add much more detail here, but this should suffice to demonstrate that there are a lot of people and objects involved.

    Now, by general consensus, it is equally correct to speak at any level of abstraction when describing an event. That it, it is equally true to say, when picking up a penny with my hand, that "I am picking up the penny", or "My hand is picking up the penny", or "The force of friction produced on the penny by the atoms of my hand is picking up the penny".

    So, in the case of the framing problem, the creator of the web page is producing the derivative work, your web browser is producing the derivative work, your computer is producing the derivative work, your monitor is producing the derivative work, you are producing the derivative work, etc. Who is responsible? All of the above? Some of the above? None of the above? Those on the highest level of abstraction? Those on the lowest? I believe that the assignment of responsibility in such cases is somewhat arbitrary.

    So while the issue of who is responsible for creating a link is similarly hard to deal with, there is another issue that's more specific to unethical linking: If you ask those complaining about the links what part of the source code they are against, they'll probably tell you that it's not specifically the A tag or the URL, but both together in the order that they are in. So if you remove the A tag, for example, they will stop complaining, because the link that was formerly present is no longer present when the page is displayed in a normal web browser. That is to say, in the normal interpretation of the HTML code, a link is no longer present.

    However, suppose someone writes a browser that automatically interprets text fragments starting with "http://" as URLs and presents them as hyperlinks. In other words, in this new interpretation of HTML code, any URL is a link. Does that mean that the forbidden URLs are now illegal links and need to be removed (and by the same token, if I write a browser which presents all occurences of the letter E as some forbidden link, the letter E be banned from the web)? Or, will the URLs be considered legal as long as the current W3C HTML standard or some other "normal" legal standard doesn't interpret them as links (leaving people free to write software which does)?

    Godel comes to mind for some reason. Hmm... (Note: If you haven't read _Godel, Escher, Bach: an Eternal Golden Braid_, please buy / rent / borrow / steal a copy if you don't have one, and read it RIGHT NOW!)

    Anyway, that's enough food for thought for one message. =)

    Just my $0.02 [a conservative estimate given the amount of time that I blathered on for...].

    -rak

  122. In Patent news... by mcrandello · · Score: 2

    I heard Ticketmaster just applied for a patent on their "25 click" technology...

    (Rimshot)


    ...5 years from now everyone will be running free GNU on their 200 MIPS, 64M SPARCstation-5.

  123. Re:Implicit copying of a web page by borzwazie · · Score: 2
    I don't think copying of the page is what's at question here. What this is saying is: "We want to be the sole distibution point of this information." Is this legitimate?

    I would argue that it is a point. Case in point: I work for a standards institution. We develop many of the standards that are used to develop products that you use every day. The development of a standard is not a trivial process, and is a committee affair which lasts some time. The actual proceedings of the committee leading up the release of the standard are destroyed, so as to limit liability.

    Once this standard is released, our organization controls the delivery of these standards. We don't allow people to link to these, since you may only view a standard in its entirety. Why do we do this?

    Again, more liability. A standard MUST be distributed with ALL relevant information. If we were to distribute part of a standard on high-strength aircraft bolts, but neglected to ensure that the reader of the standard read the section on post-heat-treatment, and the bolt design failed in use, then we, as the standards body are liable. Standards bodies are usually not-for-profit agencies, so we can't afford lawsuits.

    Additionally, these standards are copyrighted. Our members purchase the right to view any or all of our standards. It costs LOTS of money to develop a standard. Do we, as an organization, have the right to tell our members, "Use of this information may not be distributed to any other person or organization. Only the purchasing member has rights to this information." I think so. It's like game piracy. Do I, just because I bought some game, have the right to make copies of the game to pass out to all my friends? No. Can I make copies for myself? Sure. But distribution is illegal, and I think that this point is the substance of the argument.

    Do I believe linking is bad? Hell no. The web would not exist without it. Do I believe web sites have the right to demand that you go through them to get the information which THEY cataloged, which THEY invested in, which THEY made available as a service to those who subscribe to their services? Wholeheartedly.

    As an example, should it be legal for me, as a subscriber to some pr0n service, to start my own pr0n site which merely links to the site I belong to? No. How about this: I'm a student at a university. I have a T1 line into my room as a perk of living on-campus. Can I resell that service to others? Not likely. Should I be allowed? You decide.

    --

    "We apologize for the inconvenience."

  124. its not about deep linking ... by alamut · · Score: 2

    ... its about unfair business practices. one of which is deep linking into ticketmaster as a part of tickets.com business process.

    1. Re:its not about deep linking ... by ebh · · Score: 4
      Ticketmaster would also like to become the sole legal source for all tickets to anywhere.

      That's it right there. Ticketmaster does have a virtual monopoly on tickets to events it advertises. Since you have to go through them to get your tickets, they want to leverage that to force you down a clickstream that exposes you to as much paid advertising as possible.

      OTOH, sites like Amazon, which don't have monopolies on the products they sell don't seem to be making any noise over this issue. Why? Because deep-linking gets you to buy from them instead of surfing your way to someplace else.

      With Ticketmaster there is no someplace else, so that's no help to them.

  125. Quashing Deep linking.... by tcd004 · · Score: 2
    would destory the web as a researchers medium. We'd all be relegated to using pay-per use research outlets like Lexus-nexus.

    tcd004
    LostBrain

  126. Extremely Inappropriate by Yardley · · Score: 2

    CmdrTaco should have said 'UPDATE:' before the stuff about the Japanese article. That being said, that article disturbed me greatly. One line in particular.

    The personal opinion of this journalist is that the judge has made an extremely appropriate decision

    The reporter calls himself a journalist and then goes on to editorialize. I truly hope that this is not (though expect it is) a normal part of Japanese journalism. It is completely inappropriate to both act as though you are reporting the facts and then to go on to proffer a very specific opinion as to your view on the facts while at the same time calling yourself a journalist. Such behavior is completely inappropriate and exceptionally unprofessional.

    Of course, the journalist is wrong. Which is perfectly appropriate for me to point out since I am not purporting to report news events but rather candidly giving my opinion on a message board.

    When someone places a link on their web page to another webpage (or any other Internet content outside of their own site), that person has no way of knowing or controlling what exactly it is they are linking to. You may say that they should know because they have placed the link on the page, but at the very instant in which a person places a link on their own page, the content to which the link connects to can change. The change can be (and usually is) without any knowledge on the part of the linker.

    In this particular case, the content linked to had NOT been deemed illegal by court proceedings at the time the link was made, but only after.

    The logical conclusion this Japanese court would want us to believe is that whenever you place an HTML link onto your webpage you are responsible ever afterwards for what it links to whether or not you know it has been changed or deemed illegal. This is absurd.

    Specific to the claim that "the defendant undeniably increased the number of ways of accessing pornographic sites" and has thus run afoul of the law: the defendant has not increased the number of ways of accessing the offending material at all. There is one way to access it (so far as we know), the one URL which links to it. Publishing of the URL does NOT increase the number of ways to access. That remains but one.

    --

    --
    He lives in a world where those who do not run the client software of the omnipresent meme are unacceptable.
  127. Re:deep linking to non-html by locutus074 · · Score: 2
    I think that what he was referring to didn't have anything to do with copyrights.
    Picture this: You run a server at foo.com. That guy over at bar.net has an image you want to use, and it's on one of his servers. So what you do, instead of copying it to your server and linking via <img src="http://images.foo.com/graphics/picture.jpg">, you link it with <img src="http://www.bar.net/graphics/images/picture.jp g">. Of course, this means that you don't use your own bandwidth to serve it up, but the other guy's bandwidth instead.
    Side note: I head a story about a webmaster who was moving around some directories on her server, and started getting 404s in her logs. She discovered that some other site was linking images off of her server in-line on their pages. The graphics were just bullet buttons, but it still pissed her off. She wound up creating some new graphics with the same names as the old ones, and in the same location, only the new ones contained text such as "We are lame" and "We are such losers" and stuff like that. :)

    --

    --

    --
    We have fought the AC's, and they have won.

  128. Not prejudice by www.sorehands.com · · Score: 2
    It's a lead in. Just like headlines, they are meant to generate further interest.

    The case involves both issues.

  129. Huh??? by www.sorehands.com · · Score: 2
    Well, you can persue both. There is a way around each method.

    Huh?? Why would Ticketmaster want to stop anyone from buying with them.

  130. Spiders, copyright, and resumes by www.sorehands.com · · Score: 2
    Some spiders are not at odds with copyright.

    Some spiders will make an analysis of a page (and maybe generate a derivative work. Some will make copies.

    A search engine (or the CyberPatrol spider) may read the page, checking keywords, and building an index or value table of sorts.

    Other spiders will just copy. I had my resume, even though copyrighted and containing:

    Note to recruiters: Do not send requests for more information! I would be interested in valid, open job requests. That means that you may send me information about an actual job opening to see if I would be interested in that job.

    Any general recruiting requests will be treated as SPAM! It is not welcome. This is not an invitation for resume or job bank or any other services.

    A recruiting company put this a database which they sell access for. Then I started getting spam from that company. Their spider made copies and stored in a database, making a copy.

    I suspect that a legal delineation will be made. The type a spider for building an index, or for analysis will be allowed, but the other that just makes copies wil be tightly restricted.

    1. Re:Spiders, copyright, and resumes by www.sorehands.com · · Score: 2
      There is a difference from making a copy on your local hard drive and building a database which access is sold.

      When you view a page, it is usually copied into a cache, but you don't sell copies of the cache.

  131. Can't work! by www.sorehands.com · · Score: 2
    You are forgetting something.

    CyberPatrol does not link to blocked sites!. CyberPatrol checks your site against the list to see if it's been rated as bad.

    If your site is on the CyberNot list, and it should not be, then you should file a lawsuit against Mattel. Having thousands of such lawsuits may give them same feeling that they give people when they file their abusive lawsuits. But this would be legititmate

  132. Biting your nose. by www.sorehands.com · · Score: 2
    It's true that ticket Master will not make as much money, not getting hits on the pages with banner ads. But....

    Ticketmaster makes money from selling tickets. They may not make as much money, but if they lose the sale, they lose more money.

    What they can do, is on the response from the sale, reditrect them to the Ticketmaster homepage and tell them, they are better using Ticketmaster.com, not tickets.com. Or something like that.

  133. How to make your site navigatable, but unlinkable by jerryasher · · Score: 2

    I don't think that asking people to not deep link will lead to death of the web. Why are people offended by being prevented from deep linking, but not offended by sites that require registration, or subscriptions? Look at what's happening at Slate and The Street and let the consumer decide.

    That said, if you want to make your site navigatable but unlinkable, why not:

    Determine which pages can be linked to from outside. Call this class O.

    Determine which pages can only be linked to from inside. Call this class I.

    Have every page set the cookie to their class: I or O.

    If you are asked for an I page, don't return it unless the cookie says the person came from an I page. Otherwise redirect to the homepage.

  134. thoughts from a bot builder. by Pinball+Wizard · · Score: 2
    Here is ticketmaster's robots.txt directive:

    User-agent: *
    Disallow: /

    Now, one of the first things I learned when I started building robots was to build bots that played nice and that respected this file. Obviously they don't want anyone indexing their site. They may not realize that they are shooting themselves in the foot by doing this.(a site that doesn't want bots is turning away vast amounts of potential visitors-are you reading Ticketmaster?) However, I personally will always write bots that obey the robots.txt file.

    Its fine by me if a site does not want to be indexed. I will always relish the story of the store that asked mysimon to stop indexing their site, only to beg them to list them again after experiencing a significant drop in traffic.

    Law or no law, a site that refuses bots will experience the opposite of the slashdot effect. I can hear the wind rustling through the ghost town of ticketmaster.com already.

    --

    No, Thursday's out. How about never - is never good for you?

  135. deep linking to non-html by issachar · · Score: 2

    this may be a bit of a stupid question, but does this mean it's legal for me to link to someone's non-html content.

    Say their webpage contains graphics and I want to use those (assuming they're public domain), but I don't want to put it on my server. (I realise there are technical ways to stop me and it would make be a cheap bastard to boot, but that's not the point)

    Is this legal now? (I don't think it should be)

    --
    . --- If you're looking for free e-mail you won't find it here! http://www.noemailhere.com
  136. Re:I happen to think.... by sjames · · Score: 3

    A good example agains this is internal coporate information. Putting this on the web reaps the benefits of being easily available to the employees of the company, while not being public information.

    That's what .htaccess is for. Otherwise, it's like putting the private information in a folder under the doormat and hoping nobody will stumble over it. The 'normal assumption' for info on the web is that it's public. Requireing auth is a perfectly legitimate way to indicate information that is NOT public.

  137. Re:I happen to think.... by sjames · · Score: 3

    to conduct business via the web does NOT make it ok to deep link to my bank account information. Neither is it ok to deep link to sites that provide content on a fee paid basis.

    The link itself should be a non-issue. Other sites are perfectly free to deep link into my account info, as long as the bank server replys with "You are not authorized to view this content" or some such. A site stealing the user/pass for the info and using that to get to the data is another matter.

    Web servers are like a business establiushment where if the door is unlocked, there is implied permission for the public to enter.

    I understand that some sites get revenue from advertising, and they are free to do that. They are perfectly free to have their server refuse the request if the referred_by is from an outside site. (Or be more creative and send a redirect to their index page). If Ticketmaster had any sense that's what they would have done. The whole lawsuit could have been avoided for $60 - $200 worth of man hours. I'll bet it cost more than that just to ask their lawyer "Can they do that?" As a side benefit, their competition would have ended up with egg all over it's face. (priceless)

  138. Similarity to "piracy"? by Pahroza · · Score: 3

    In addition, Ticketmaster contended that deep linking interfered with its economic relationships with advertisers, who paid handsomely to advertise on the site's home page. Finally, the company said that Tickets.com was guilty of "passing off" and "reverse passing off" -- forms of unfair competition -- because consumers might confusingly conclude that Ticketmaster and Tickets.com were connected in ways detrimental to Ticketmaster and beneficial to Tickets.com

    This paragraph of the article seems to me somewhat like the software industry's claims of damages resulting from piracy. Given that certain people would never have purchased a product due to various factors, simply downloading a pirated version doesn't really cost them any money.

    I understand that ads generate revenue and that ticketmaster would be upset by people bypassing ads. However, the "offending" deep linking still takes the user to a page containing a banner, and ticketmaster will still receive a service charge, so what are they complaining about? Perhaps they never would have made that sale had the user not gone through the site doing the deep linking.

    Just a thought...

  139. Re:Deep Linking, to and other files by Quack1701 · · Score: 3

    I once had someone link their ebay auction to a picture on my server (without my permession.) What I did was wait until he had one bid (so he couldn't modicfy the auction) and then replaced the picture with some pornography. You'd be supprised at how many hits he started to get!

    In retrospect, I spamed my server more by changeing the picture, but I think it was worth it.

    If your afraid someone is deep linking your site, and you don't like it, just change your links. It's not that hard. And if the Japanese ruling holds any water, you may be able to get them into legal trouble depending on what part of they world they/you are from. *smile*

    Quack

  140. Seeing a contract != agreeing to it by xant · · Score: 3
    And although he dismissed the breach of contract claim, he granted Ticketmaster permission to file an amended complaint with facts showing that its "terms and conditions" created an enforceable contract, seen and agreed to by Tickets.com.

    This has come up before, and there is a strong argument against contracts that you agree to by having seen them. Could I create a contract that said:

    By viewing this text you agree to be bound by the terms and conditions of this contract. This contract stipulates that you may not view this post with moderator points remaining without moderating the post up 1 point.
    Well, you'd better hope not. Ticketmaster.com is saying they had contracts on display on the site, and that by using the site you're agreeing to those contracts. Sure . . . and hey, when I change the non-read-only license agreement on Sun's software download pages to "I 0wn j00 Sun Software", that creates a legally binding contract too . . .
    --
    It's rare that you're presented with a knob whose only two positions are Make History and Flee Your Glorious Destiny.
  141. Implicit copying of a web page by tjwhaynes · · Score: 3

    In the /. intro, it says

    I think people should be able to say, "Please don't spider this page" (robots.txt for example, but it gets stickier with copyrighted content) but I don't think anyone should ever be able to say, "You may not link this page" since that is fundamentally the anti-point of the Web.

    I agree. The very nature of the web would suggest that the act of accessing a web page was making a copy of it. Therefore it is difficult to see how anyone could say "You may not copy this page" because by the time you see this message you have already made a copy. Now - can this argument be extended to making links to a web page? If you consider web pages as a broadcast, rather than a published work, then I see no problems with unaltered content being mirrored, as this is merely an extension of the broadcast route. Mirrors and partial mirrors may prove less obvious. If in the process of making a partial copy you imply something derogatory or contrary to the original by changing the context of the page, then this is covered by libel or slander laws anyway if the case is sufficiently serious. Not to say that this doesn't happen already in the newsprint media - quotes are truncated and put out of context all over the place. Of course, the waters are further muddied with trademarks and other such concerns, but I don't believe that changes the base rules. It would be a sad day if the courts stopped linking to other sites - it wouldn't be a web anymore.

    Cheers,

    Toby Haynes

    --
    Anything I post is strictly my own thoughts and doesn't necessarily have anything to do with the opinions of IBM.
  142. When its public, its public... by HiyaPower · · Score: 3
    If you desire to restrict the linking of content to a site, you do what the NYT did, have a login to the site. This keeps out spiders, bots, and other sorts of randoms (not really, but it at least declares the intent to do so), while allowing access to their content. A link past the front door on such a page is dubious.

    However, if you do not impose such a lion at the gate, then you have declared your urls to be available to the web by whatever means folks choose. Its like saying that I can't buy a copy of a newspaper and post an interior page on a (paper and tack) bulliten board. Gee, so many of my book marks are "deep links". Is my bookmark file illegal? O well... When you can 't win by a legit means, litigate...

    Good manners sez you don't link into someones site as part of your content (as opposed to a link to send them there), without some "By your leave". Now if we are going to legislate some manners, I have some manners employed by drivers near the "Big Pig" in Boston that I would like to have included ;-)

  143. Re:The funny thing about deep linking by dattaway · · Score: 4

    So, I'd imagine this handy little trick would work in /etc/hosts

    208.48.26.217 www.nytimes.com

    which means whenever you look up www.nytimes.com, you get partners.nytimes.com instead.

  144. linking. by Signal+11 · · Score: 4
    NY times runs an article on deep linking... which requires registration to view. Anyone else find this ironic?

    Maybe the solution all these up-tight corporate sites (like the NY Times) will be to make even more obnoxious use of cookies, http referer values and more invasive authentication to "protect us".

    Well.. better login to slashdot so I can post this...

  145. Do not link to this page ... by |DaBuzz| · · Score: 4

    but I don't think anyone should ever be able to say, "You may not link this page" since that is fundamentally the anti-point of the Web.

    So let's look at an example:

    Joe Developer has a good idea but not a lot of money, he hosts his site with information about his project on a $9.99/month hosting plan where he gets 200MB of transfer a month.

    Someone submits Joe Developer's page to slashdot because it's a valid "news for nerds" item and Joe gets slashdotted. Within minutes, Joe contacts Taco saying "Don't link to my page!" ... does Taco take it down?

    Let's say he practices what he preaches (see quote above) and says "You can't tell me not to link it!" and leaves the link up. Joe Developer gets 500,000 hits in one day and goes over his allotted bandwidth by 20GB. At $5 per 10MB over his allotment, he now owes his hosting company a $1000 in overage fees and Joe Developer removes his site and goes bankrupt due to the "fundamental point of the web." (Note: I think Taco would remove such a link under these circumstances, this is just an example of why one size does not fit all.)

    So as you can see, there ARE cases where someone should have the right to say DON'T LINK TO THIS PAGE. While much of the web is built to get traffic, some pages are not meant to be slashdotted for a number of different reasons.

    So while I agree that a site that invites enormous amounts of eyeballs shouldn't deny linking (i.e. NYTimes, CNN, etc.), sites that do not aspire to get traffic should be allowed to control how they are linked.

    Now, if only Apache would deny based on referrer like the old NCSA servers did. *sigh*

  146. I happen to think.... by Ron+Harwood · · Score: 4

    ...that if you put something up on the web, you've made it publicly available for people to link to.

    What's the big deal anyway? If you put enough of a header/footer on your page that identifies the site, and show's links to other content (Say like ZD) then people will go to the stuff that interest them, and you've gained a reader from the "deep linking"...

    If you don't want people to link directly, protect your articles/materials behind some CGI that at least makes it more difficult.

    Just my three (Canadian) cents (that add up to 2 US cents).

  147. The funny thing about deep linking by wowbagger · · Score: 5

    Is that it can be applied to the NY Times as well.
    For example, to bypass the login for this article use
    http:// partners.nytimes.com/library/tech/yr/mo/cyber/cybe rlaw/07law.html. In other words, change the www to partners.