Lawsuit Stops Headline Scraping
Stephen Larson alerts us to the out-of-court settlement of Gatehouse v NY Times, a lawsuit that attempted to stop the Boston Globe from linking to headlines and excerpting initial sentences from a competitor's Web site. At issue was the Globe's practice — barely distinguishable from those of Google News, Yahoo, and others — of linking to another news source's coverage of local news. The upshot is that the Boston Globe will stop the linking. No judicial precedent was set, because the case was settled before reaching a judge.
Stephen Larson alerts us to the out-of-court settlement of Gatehouse v NY Times, a lawsuit that attempted to stop the Boston Globe from linking to headlines and excerpting initial sentences from a competitor's Web site. Read more here.
"wahts woring iwth my tyoping?"
Google News
Virtual Betting on Facebook for non-geeks.
Since links are so fundamental to the web, wouldn't it be easier if they just GTF off the internet rather than bother with these lawsuits?
Deeplinking and "stealing" your stories may hurt you int the short term financially. But - let's face it - the real reason of operating a newspaper or site is to make your audience see the world through your goggles. The more your opinionated news are linked or copied in one, the more influence you have on other people's thoughts, decisions etc.
Yes I'm that cynical (in the case of the news industry at least).
- Does it work in law to say "He's doing it too"?
- Isn't it "unconstitutional" (illegal) to have a law that applies to lots and is only enforced on some?
He may be dead, but his spirit will live forever.
FTA, it sounds like Gatehouse see this as a copyright violation but, as several other posters have pointed out, the same thing goes on on news aggregator sites all the time. In fact most stories on Slashdot contain snippets from other sites. It's an unavoidable and very useful facet of the web
This is yet another example of 'old' media not really understanding online practices. Most sites benefit tremendously from others linking to them - look at what happens with Slashdot. That is, unless the 'benefit' is so great that their server turns to dust.
Linking to other media sites is a common feature of many news sites. BBC News has links to other site's reporting for stories. It's just a headline and link, nothing special.
That link boosts the other site's search rankings, and every click-through is a reader that they didn't have before, and an ad-hit, and maybe a repeat visitor.
Taking the headline and the entire article is a different issue altogether, but I don't think that is the situation in this case. It is like all the Belgian (?) newspapers that want to have zero online presence or searchability. It makes no sense! You either participate, or you fade away on the fringes. That's why there is a "web" in "world wide web". Why be a bit of gossamer drifting on the wind when you can be in the web and actually be useful?
ok-geez-so-i'll-stpop-sending-you-traffic
*whoosh*
The ActionStreams plugin for Movable Type technically does something similar. It accesses public web sites for information about your other accounts and aggregates them on your blog. I wonder if this ruling would have affected that if the Globe had lost.
...explicitly allowing anyone to link to any internet page, using the original title or any title of their own creation.
Comment removed based on user account deletion
As it was settled outside the lawsuit, the lawsuit settled nothing. Also no precedent, so this is actually bad news.
Now we still do not know what is and what is not legal. A complete lawsuit would have been better, be it for or against linking.
Don't fight for your country, if your country does not fight for you.
I read about several days ago when this lawsuit was filed. If the paper wanted to stop deep linking (they say they loose revenue if they miss the advertising from navigating from the front page to the deeper layer), can't they just look at the referrer (how do you spell that?) and send sites to the front page? And their other complaint was the short one sentence description of what the article was about. They claimed that many people got all the news they needed from that one sentence, so the newspaper lost advertising dollars. But if they are concerned about this why don't they have a robots.txt file to keep indexing sites away? If you are going to allow indexing then how can you not allow short descriptions?
Copyright law should serve the better interests of the public. The idea of copyright law was to preserve the intellectual property of the author so they could earn enough money on their works to keep writing. In recent years this has been totally lost. I think the current copyright law covers up to 95 years after death, and there are people who want to make copyrights never expire. This does not serve the interests of the public. (But it does serve the interests of corporations, who own the politicians who write the laws.)
FTA:
GateHouse had claimed Boston.comâ(TM)s actions violated copyright and trademark laws. Boston.com provided links that sent readers directly to âoeWicked Localâ stories, meaning readers bypassed ads posted on GateHouse home pages, GateHouse claimed.
I am just wondering who paid money to close this case and avoid precedence. Google? Yahoo? Slashdot? ;D
Comment removed based on user account deletion
Creating newspaper headlines seems like the only job where creating puns and word play is a requirement. I almost feel sorry for interviewers. But are you really getting anything from scraping a headline?
"PATERSON LYIN' KING OF STATE"
"LOON FLIES COOP"
"PETT-Y CASH"
And these were just todays. Bonus point for guessing the actual story.
the real reason of operating a newspaper or site is to make your audience see the world through your goggles.
No it isn't. The real reason is to make money. If your competitor is stealing your work and using it for their own financial gain, I think you have a right to be pissed off and sue.
You give the media too much credit -- its motives are surprisingly shallow. It doesn't really care what you think, you are free to agree or disagree, as long as you are reading/watching/listening, and of course, paying attention to those wonderful advertisers who make the whole thing possible.
Which doesn't make it any less of a copyright violation. "Him too" is not a defence in law.
Actually Laches could be a defense. If the plaintiff did not sue other entities that engaged in this practice and then the defendant on seeing that the plaintiff didn't sue also engaged in that practice but the plaintiff suddenly decided to sue the plaintiff but not the other entities, then the defense could claim a laches defense.
(That is in theory, however the facts of this case probably don't support laches because (1) google/yahoo/etc are not competing with the newspaper but the other newspaper is thus it is a slightly different act and (2) laches requires that the defendant to suffer some harm from the "trick" of not suing for a long time and then suing.)
Regardless of the above they might still have a defense under fair use or at least be able to modify their reporting to make it fair use. Regarding the four tests for fair use, they will most certainly lose on the first test (commercial nature), but on the second (nature of work: news/facts), third (amount: one sentence) and fourth (commercial impact: more viewers on page thus more advertisement revenue) tests they could win.
Unlike Google News, the Boston Globe is, itself, a news-reporting organization. Mixing their own stories with those from competitors can lead to confusion. I didn't manage to see the offending page before they took down their linked stories; but I imagine it was done in such a way as to have the original source difficult to identify.
A pure aggregator service, like Google News, is different because it is rather obvious that ALL it is doing is aggregating. There is no 'new reporting' being done by "Google".
Another non-functioning site was "uncertainty.microsoft.com."
The purpose of that site was not known.
I work for the Democrat and Chronicle, and Gatehouse scrapes our headlines out of our RSS feed for their right-hand sidebar's "Regional News" block.
Bloody hypocrites.
when it gets to the point nobody can "legally" link to you without paying a "fee" .. let em languish in internet hell and die - there's other places to get the same info .. when will they learn?
may stop the headline scraping.
I'm not a coward by any name.
Is this only applicable to scrapers or could it be applied to news compilation sites like commondreams.org and antiwar.com?
The lawsuit didn't "stop headline scraping". The two parties agreed that one would stop scraping the other. It has no wider implications than that. The next company on the receiving end of a lawsuit may choose to fight. Even then, a concrete precedent may not be set, since there are shades of gray between quoting, citing, scraping, etc. In any case, your headline sucked.