Web Caching: Google vs. The New York Times

Free registration by Zog+The+Undeniable · 2003-07-13 22:00 · Score: 5, Insightful

I'd love to see their user database, just to count the number of Mickey Mice and Elmer Fudds on there. Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?

--
When I am king, you will be first against the wall.

Re:Free registration by Anonymous Coward · 2003-07-13 22:06 · Score: 0

free registration sucks.. news should be freely available for all. not just as long as you dont mind viagra and p3n1s extension spam...
Re:Free registration by presroi · 2003-07-13 22:06 · Score: 5, Insightful

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper. Not just for New Yorkers but also for people from Sweden, Japan or New Jersey.

Where would the the limit? How would you feel if you had to register for every web page which is linked to at /. (I confess, I usually click on every /.-story link)?

hmm, to answer your question:
maybe the point in registration is the signing of a contract how to use this contact. Dunno.
Re:Free registration by whm · 2003-07-13 22:06 · Score: 5, Informative

Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?

User tracking. While cookies can do this loosely, requiring a login does this much more effectively. I know I login with my same username each time I visit the site (if it's not cached). There's very little reason not to. This gives the NYT a much better indication of how many active and repeat members they have visitting their site. They can then target ads to users much more effectively, and market their userbase to advertisers much more solidly than they could with more rudimentary user tracking methods.

There may be other purposes, but this seems like a large part of it.
Re:Free registration by Anonymous Coward · 2003-07-13 22:10 · Score: 3, Funny

It's not only spam - they want to be able to build profiles of their users for marketing purposes. They've got my e-mail address. In exchange, their database has got a 98 year old woman who lives in Albania with a PhD, no job and an income of less than a thousand bucks a year. Twice. Once for my work e-mail address. And once for home.
Re:Free registration by cobbaut · 2003-07-13 22:34 · Score: 5, Interesting

Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?

I always use a different address to register online in the form of website@mydomain.
I registered with the NYT in 1999, I never received a single spam on this address.

--
European Linux user, living in Antwerp
Re:Free registration by Anonymous Coward · 2003-07-13 22:41 · Score: 5, Insightful

And on top of everything else, it annoys users more than just about anything else aside from spam. Can't recall exactly how many other people I know who go to see a NYT article, find the rego page, and ignore it to go find a better news source without the hassle.

If they're tracking what their users are do, they're affecting their user pool in a pretty negative way just by using this method.
Re:Free registration by Troed · 2003-07-13 22:41 · Score: 1

I do the same, and it's rare to get spam to those adresses. Most spam I get are to my _actual_ Email-adress which has me a bit confused.

--
it's in my head
Re:Free registration by digitalunity · 2003-07-13 22:41 · Score: 5, Funny

(I confess, I usually click on every /.-story link)

This is *Slashdot*. We don't read articles. Please, either read the article or post a comment; you cannot do both.

--
You can't legislate goodness. Let each to his own destiny, by will of his freely made choices.
Re:Free registration by Anonymous Coward · 2003-07-13 22:47 · Score: 0

Good, now they have the connection between your home and your work address! Fear!
Re:Free registration by MartinB · 2003-07-13 22:53 · Score: 2, Insightful

Their database has got a 98 year old woman who lives in Albania with a PhD, no job and an income of less than a thousand bucks a year.

And you wonder why you get ads that have absolutely no interest for you? And why advertisers have to shout lounder and louder to get through a mass of untargeted ads?

Advertisers would far rather spend less by buying fewer, smaller ad slots that are targeted accurately. Much better return on their spend. Like the guru said I know half my advertising is wasted. I wish I knew which half.

--
The only thing you can accurately describe as "Scotch" is a sticky tape made by 3M. And it's
Re:Free registration by JanneM · 2003-07-13 23:05 · Score: 5, Funny

You gave them an actual, working, email address? How ...quaint.

Me, I'm a 66 year old single woman with no income, no education, and lives in a nonexistant Swedish town with a very rude (in Swedish) name. I figured that any site advertiser that want's to target this person must be desperate enough that their ads may actually be amusing.

--
Trust the Computer. The Computer is your friend.
Re:Free registration by btlzu2 · 2003-07-13 23:12 · Score: 1, Informative

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper.
Ermmm...not really

--
Zed's dead baby. Zed's dead.
Re:Free registration by presroi · 2003-07-13 23:17 · Score: 0

The way the editors *finally* dealt with this scandal is just a sign of professionalism.
Re:Free registration by Zigg · 2003-07-13 23:31 · Score: 3, Funny

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper.

No, sorry, I can't agree with that. Well, maybe interesting.
Re:Free registration by StarFace · 2003-07-13 23:57 · Score: 4, Funny

You say these things as if they are good things. Man, I don't want some newspapaper tracking everything that I read, so that they may serve me custom tailored advertisements! Although, it would be nice if the system actually was intelligent, it would eventually discern that I loath the advertisment philosophy, and stop sending them altogether -- ha.

--
V
Re:Free registration by yelvington · 2003-07-14 00:51 · Score: 4, Informative

NYT doesn't spam. And the percentage of net.morons who register using cartoon names is remarkably low.

I don't work for the New York Times, but for another media company, and I'm in a position to understand the reasons for registration:

1. Metrics. Registration supports the generation of accurate data on demographics and usage (reach, frequency) in a crosstabulated view. This is important in analytical processes to support site management and design as well as in the sale of advertising, which provides the revenue that makes the site possible.

2. Ad targeting. Run-of-site, untargeted Internet advertising is nearly worthless on the open market (supply/demand), but advertising that is highly targeted remains highly valuable. When combined with proper analytical software and usage data, registration data can -- for example -- let me target 25- to 34-year-olds in a particular ZIP code who have been looking at real estate listings. And I can deliver that advertising anywhere on my site, such as on sports pages that otherwise would contain "junk" ad inventory. This is (measurably!) much more efficient and effective, and I can charge fairly high CPM prices. Importantly, this can be accomplished without providing any personal data to the advertiser, protecting the anonymity of the user.

3. Reduction in traffic. Reduction is actually desirable in many cases. Not all customers are good customers, and not all traffic is good traffic.

On the Google issue: I used robots.txt to block Google from indexing the AP content on our 27 newspaper sites, because I have no desire to be the unpaid provider of wire stories for Google News so that they can be read by users outside our markets. Additionally, I have used a router block to prevent several commercial Web clipping services from having access of any sort to any of our sites.
Re:Free registration by Anonymous Coward · 2003-07-14 00:54 · Score: 2, Insightful

Maybe we can agree that the NYT is a well-written, serious

bullshit. Were you paying attention last month when they had to correct all the Jayson Blair stories, or acknowledge that he copied from other sources. Don't say it was one bad apple either. They haven't apologized for front-page stories claiming Iraq was a quagmire after 1 week of invading.
Re:Free registration by cocotoni · 2003-07-14 01:00 · Score: 1

And just what would they expect from an AC? ;->
Re:Free registration by Anonymous Coward · 2003-07-14 01:06 · Score: 0

...I used robots.txt to block Google from indexing the AP content on our 27 newspaper sites, because I have no desire to be the unpaid provider of wire stories for Google News so that they can be read by users outside our markets.

Actually, that's a good point. It doesn't seem google actually PRODUCES any sort of content on their own.
Re:Free registration by pcmills · 2003-07-14 01:06 · Score: 1, Insightful

More of a cya than anything else.

--
Ask Slashdot - google for stupid people.
Re:Free registration by mrd_yaddayadda · 2003-07-14 01:25 · Score: 3, Insightful

You can count me as one of the people that ignore the NYT unless I can get a cached page. I get enough spam as it is...
Re:Free registration by hesiod · 2003-07-14 01:27 · Score: 2, Insightful

> They can then target ads to users much more effectively

How about they advertise according to the content of the article. If it's a tech article they show tech adverts. That's pretty simple, and something they generally don't do (and it wouldn't require logging in)
Re:Free registration by Anonymous Coward · 2003-07-14 01:30 · Score: 1, Insightful

So, regarding 1, 2, and 3 - the advantage to me as a consumer is what? Absolutely fsck all. Reading cached webpages is no different to picking up a used copy of the newspaper off a park bench. The 'paper can get free exposure to a potential new paying customer due to this, so they should actually be pretty grateful to Google (and the park bench).
Re:Free registration by FatAlb3rt · 2003-07-14 01:32 · Score: 4, Insightful

I disagree. Let's imagine for a minute that everyone provides an accurate profile, targeted marketing works, sales increase, and the advertiser gets rich.

You really think that the money they spend on advertising will level off?
Re:Free registration by Anonymous Coward · 2003-07-14 01:32 · Score: 0

I've done this and caught a credit card verification company redhanded in selling off my email address.

There's really no company you can trust online not to give away your email address. Ofcourse, EU-based sites have to comply with privacy regulation, but those limitations mostly don't apply to US sites, so...
Re:Free registration by NexusTw1n · 2003-07-14 01:33 · Score: 5, Insightful

I always find it ironic when people on slashdot complain about being "tracked" on NYTimes webpages or other sites that require registration.

Most people have registered to use /. , and have therefore provided a valid email address. So you can't have a moral objection to giving your email addy to websites you frequent.

Even if you don't register, your IP address is logged and monitored , via the sophisticated anti troll system. Try and post more than 10 times in one day as an AC, or post as an AC in reply to a post you modded and slashcode will react.

So even as an AC you aren't really totally anonymous on slashdot, yet I don't see anyone who complains about NY Times links complaining about that. The only people who complain are the trolls that forced these features to be added to the code.

So why do we have this tedious bitching about the NY times every time a link is posted?

I registered a couple of years ago. I've never recieved a single spam to NYTimes@mydomain.com which was the email addy I used. I've never had to login because the login cookie has remained in Opera since I registered. How hard is it login and then forget about it forever more?

The only reason I haven't forgotten I've registered is the continual complaints on slashdot from people who are obsessed with privacy on the net unless karma is involved. NY Times doesn't spam registered users, and any user tracking is less sophisticated than slashcode's vital anti troll features. So bear that in mind when tommorrow's NY Times story appears and the same old complaints are dragged out yet again.

--
It has become appallingly obvious that our technology has exceeded our humanity. --Albert Einstein
Re:Free registration by Anonymous Coward · 2003-07-14 01:35 · Score: 0

washingtonpost.com manages to track users just fine. and their "registration" page doesn't require your home address.

they ask for year of birth, zip, and gender
Re:Free registration by Anonymous Coward · 2003-07-14 01:36 · Score: 4, Funny

Stop impersonating my grandmother, you insensitive clod!
Re:Free registration by LilMikey · 2003-07-14 01:36 · Score: 3, Insightful

NYT does not let you access their content without logging in. That's nothing like Slashdot's system.

--
LilMikey.com... I'll stop doing it when you sto
Re:Free registration by endoboy · 2003-07-14 01:42 · Score: 3, Insightful

NYT does not let you access their content without logging in
and why should they? NYT spent real $$$ to develop that content, and are under no obligation to give it away.
Don't like it? Go someplace else.
Re:Free registration by D-Killer · 2003-07-14 01:44 · Score: 1

I do not work for a media company, but I do work in data intelligence and I am in a position to tell you that data lies.
NYT doesn't spam. And the percentage of net.morons who register using cartoon names is remarkably low.
Re:Free registration by NexusTw1n · 2003-07-14 01:51 · Score: 2, Interesting

Slashdot allows you to view the content but not post even anonymously without being tracked at IP address level.

The NY Times allows you to view the archive anonymously , and allows you to view the main page with a password you could google for easily.

So yeah, it's nothing like slashdot's system - NY Times intrudes far less into your privacy.

--
It has become appallingly obvious that our technology has exceeded our humanity. --Albert Einstein
Re:Free registration by cerberusss · 2003-07-14 01:52 · Score: 1

it would be nice if the system actually was intelligent
I'm not counting on that. Instead, I use Mozilla, rightclick on any ad and choose Block images from this server. Mozilla rules! :)

--
8 of 13 people found this answer helpful. Did you?
Re:Free registration by sdmartin101 · 2003-07-14 01:52 · Score: 1

There are ads at the New York Times? I thought they did away with those months ago. Oh wait... that's just when I started using adblock. ;-)
Re:Free registration by Anonymous Coward · 2003-07-14 01:55 · Score: 0

Instead of "mickeymouse@disneyland.com", I find it much more fun to go to the company's management page and choose names from that list and always check 'why yes, please sell my list to third party sites'.

For example, for NYTimes, their management page is http://www.nytco.com/company-executives.html, where you can find lots of good names to regester as. If you guess right, they'll see how much junk mail they send.
Re:Free registration by jellomizer · 2003-07-14 01:56 · Score: 1

Well in the real world we have this thing called "expenses". And to offset these "expenses" we need to make this thing called "Revenue" which on Web Sites can be either by Pay for Content or Product and Services or Advertisement. Companies don't give away things for free unless it can give them reward in the future.
So you are going to be stuck with advertisements anyways you might as well get them for stuff that you might buy. If you ignore them fine its your choice. I do believe in privacy but to much privacy sometimes prevents companies from adapting to what the customer wants. So in this case you can get more articles that you want to read on top of advertisements of products that you may looking for.

--
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
Re:Free registration by keithdowsett · 2003-07-14 02:14 · Score: 2, Interesting

I'm continually surprised by how often these free registration programs will accept root@localhost or abuse@localhost in the e-mail field.

This has the pleasing consequence that it unless they employ someone to vet the list they are likely to end up spamming themselves or their provider. Both much more amusing than sending it to a completely fake address.

Naturally, the rest of these forms must be treated as an exercise in creativity - and we should give our creations suitable names. My favourites include Hugh Jorgens and Tess Tickle.

So, treat these forms like you treat the religious nuts who arrive on your doorstep preaching salvation - as a source of amusement.
Re:Free registration by nelsonal · 2003-07-14 02:16 · Score: 1

I registered in 1997 and haven't gottent anything from them besides a confirmation email, and headline mails that I tried but didn't like. If someone offers you a free paper take it, even if they want your name first. The WSJ has been requiring paid subs to their website since they started.

--
Degaussing scares the bad magnetism out of the monitor and fills it with good karma.
Re:Free registration by svenskakocken · 2003-07-14 02:17 · Score: 1

That nonexistant Swedish town wouldn't be Gävle, would it? I used to live in that quaint town, for about 6 months. Maybe their target add would be a vaction away from there!?
Re:Free registration by kien · 2003-07-14 02:18 · Score: 1

And why advertisers have to shout lounder and louder to get through a mass of untargeted ads?

I would prefer that they shut up and listen to me rather than attempt to shout over each other. My money goes to the companies that annoy me the least.

That being said, I don't really have much of a problem with NYT registration...but as the grandparent post reveals, I doubt they're getting as much bang for their buck as they might think (or as their management team is likely being told by hordes of shouting advertising suits).

--K.

--
Sig: Bad people happen. Try to avoid being one of them.
Re:Free registration by JanneM · 2003-07-14 02:23 · Score: 1

Nope. I am Swedish, and am well aware of the existence of Gävle. I just entered an utterly made-up name in the field.

--
Trust the Computer. The Computer is your friend.
Re:Free registration by SlamMan · 2003-07-14 02:46 · Score: 1

No, but they might have ads the interest you. I'd much rather get ads for new processors or MP3 players than hair spray or vaginal ointments.

--
Mod point free since 2001
Re:Free registration by sirshannon · 2003-07-14 02:46 · Score: 1

I do the same thing. The only spam I've gotten because of that was traced to Honda.

--
The truth doesn't care what I think.
Re:Free registration by yelvington · 2003-07-14 02:48 · Score: 2, Interesting

So, regarding 1, 2, and 3 - the advantage to me as a consumer is what?

Relevancy of information. Think about it for a minute. Advertising that is not useful is noise. Advertising that is useful isn't noise. Targeting replaces the noise with utility.

If you're looking for a house, informative advertising from mortgage lenders (real lenders, not the scam artists who clog your email box) is useful. If you're hungry, you might find targeted pizza coupons from the pizza joint around the corner to be useful. And so forth.

You also might consider that making the Web site profitable ensures its survival, which ought to be an advantage to you, assuming that you care to use it.
Re:Free registration by jbottero · 2003-07-14 02:54 · Score: 0

Please. Out of the many, many, many reporters at NYT, one turns out to be a jerk, and for that you say the whole thing is a pile of shit? Based on your arguments, Slashdot must be run by the Devil Herself. And going on with that comparison, at least the NYT does not post the same story on the front page day after day after day....
Re:Free registration by Anonymous Coward · 2003-07-14 03:03 · Score: 0

Some points for honesty, few for consistancy.

We all agree the motive for taking the measures is profit. We expect the data blinding that protects the individual, but it not a sure thing. David Brin ("Transparent Society") suggests we, the collected, should expect the reciprocal transparency- we get to watch you, and see what you really do with the data. Fair Enough?

I found a vast divide between the sentiments in your posting, "I have no desire to be the unpaid provider of wire stories...", and the the content of your web page listed in your sig, where you harvest and display, in a very Google-like way, headlines and links to other wire services. Are you paying them?

It seems to me that the fruit of the Google negotiation with the NYT lies in the point you made; some customers are not good customers. A lot of folks will happily trade "news freshness" for privacy. Even if NYT lets those customers out of their direct add targeting, they may still bring value to the NYT. Cleaving off that readership will simply route them to other providers.
Re:Free registration by fucksl4shd0t · 2003-07-14 03:03 · Score: 3, Insightful

So you can't have a moral objection to giving your email addy to websites you frequent.
It's about trust, actually. Morality has nothing to do with it. I don't trust NYT not to sell my email address or anything like that. I *do* trust slashdot not, but if I ever catch them doing it, well, I just won't tell them it's changed recently. :)
There are quite a few sites that I frequent that I don't trust with personal information. Visiting a site frequently != trust.

--
Like what I said? You might like my music
Re:Free registration by Anonymous Coward · 2003-07-14 03:05 · Score: 0

Most people have registered to use /. , and have therefore provided a valid email address. So you can't have a moral objection to giving your email addy to websites you frequent.

Aye, matey, choice is the key!
Re:Free registration by Fjord · 2003-07-14 03:11 · Score: 1

I always register online with a different address, usually jl-sitename, so that I can track who has sold my information. I have never, in the 1 and a half years since I registered for the NYT, received a spam to the email address I provided them.

--
-no broken link
Re:Free registration by mysticgoat · 2003-07-14 03:26 · Score: 3, Informative
You've brought out some very good information in a well-written way. Thank you. I'll cover much of the same ground from the satisfied user's viewpoint.
1. NYT and spam: there is no relationship between these. That's my experience after years of subscription, and a number of other people on this thread report the same thing. The Yahoo portal news service is also good this way (and gives me Reuters: an excellent supplement to NYT).
2. The metrics thing: I provided NYT with true demographics when I signed up, because I know that will help them deliver product more efficiently and sell their advertising.
  I want that. I like the service NYT provides, and so I want them to succeed. I very much want them to continue to provide me with a free subscription-- and I'm willing to help them hold their costs down and maximize their advertising revenues.
3. Focused advertising: I don't like ads, but I'm willing to put up with their presence in exchange for a service like NYT.
  NYT has done a good job of keeping the impact of the ads low: the ads don't get in the way of reading the stories and they don't slow page loading significantly (since I'm on a slow rural dial-up, that's very important). If NYT starts to charge me, I'll be less tolerant of the ads. If the advertising starts slowing down the page loading, I'll drop my subscription. There are a number of other news services-- CNN, ABC, etc-- that I don't use because the advertising burden slows page loading or otherwise gets in the way.
  As to focused ads-- I'm all for that. I'd rather ignore stuff that's somewhat pertinent to my life than ignore crap I'd never buy. An ad for reading glasses is pertinent to me, but an ad for skateboards is crap-- I was long past skateboarding age before the first ones hit the street. Reading glasses are something me and my cohorts have to live with, and we talk about them. Nobody in my circle of friends has a skateboard and I don't recall ever talking about them. (Of course skateboards would be a problem for me and my neighbors: I don't think they do well on gravel and road apples.)
  And sometimes the advertising actually works-- sometimes it makes me aware of a product or company that I'll want to talk over with my buddies, and maybe try out. That is much more likely with focused ads. As I recall, my first awareness of the existence of fold-up reading glasses in a hard case (suitable for hiking, bicycling, and other hip pocket activities) was from an advertisement. Now I've got a couple of pairs of them. Neat.
About Google's archive, NYT, and slashdot: Something I hope NYT considers is that the Google archive gives it (and at least some of its ads) exposure in demographic groups that it would otherwise never reach. Such as the tinfoil hat superparanoid geek crowd. While there is no way to develop metrics on this, nor any way to market this to advertisers seeking targetted audiences, this exposure is certainly more beneficial than harmful. Besides, every once in a while somebody matures a little and puts away their tinfoil hat-- and then is a likely candidate for the kind of news service NYT provides.
So I think it would be very hard for NYT or Google to assess whether the Google cache is harmful or beneficial.
Re:Free registration by MadAhab · 2003-07-14 03:32 · Score: 1

It was one bad apple.
And I think anyone except perhaps Rummy can see that it's a tad early to state "Iraq is not a quagmire", never mind apologize for wondering if it is.
And I would think the primary responsibility for managing the cache issues falls on the NYTimes. When they've done their part, they can whine to Google, but not before.

--
Expanding a vast wasteland since 1996.
Re:Free registration by gantrep · 2003-07-14 03:38 · Score: 2

In short, yes. The fact that this one was allowed to slip by for so long unnoticed means the whole place is messed up, and they didn't have proper accountability for factual reporting.
Re:Free registration by Kallahar · 2003-07-14 03:40 · Score: 1

Trust.

Do you trust NYT?
Re:Free registration by Anonymous Coward · 2003-07-14 03:41 · Score: 0

I've never gotten any spam either. Because I guessed someone else's user name and password and use that to read stuff at the blue-haired lady. (not at all difficult to do, btw).

I do wonder what the NYT thinks of this "person" logging in from widely (I'm assuming) disparate parts of the country 5 minutes apart.
Re:Free registration by ryen · 2003-07-14 03:44 · Score: 1

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper. I think Jason Blair put an end to that. *long live free speech!* ;)
Re:Free registration by Geeky · 2003-07-14 03:47 · Score: 2

And I would think the primary responsibility for managing the cache issues falls on the NYTimes. When they've done their part, they can whine to Google, but not before.
I'm not so sure - why should the burden of responsibility fall on websites? It's like those checkboxes on forms (both web and real), that basically say "tick this or we'll spam you". Google are saying "add this to your html or we'll cache you". The principle is the same - inaction is taken as approval. And I don't suppose Google are offering to pick up the costs that the NY Times would incur changing their code? Google should at the very least have a "do not cache" list to which sites could be added at their own request - without having to change their sites.

--
Sigs are so 1990s. No way would I be seen dead with one.
Re:Free registration by Abel+Wingnut · 2003-07-14 03:47 · Score: 0, Troll

Literarily, the NYT is in the upper echelon of international syndications. Sadly, this is the only estimable quality of the NYT, and thus uses this to gloss over its horrid attempt to maintain journalistic indifference. I say this not just because of the recent wrongdoings, but mostly of the bias it has towards the Jewish community. In no way am I an Anti-Semite, but it perturbs me to see the drastic propensity towards the Jewish faith and almost complete contempt for the Arabic world. We all know they should strive to create a neutral, completely factual account of matters, but the editors are palpably belying their integrity with nearly every Middle Eastern account. Just my $.02
Re:Free registration by poot_rootbeer · 2003-07-14 04:03 · Score: 0, Flamebait

Can't recall exactly how many other people I know who go to see a NYT article, find the rego page, and ignore it to go find a better news source without the hassle.

A "better news source" than the New York Times?

Jayson Blair scandal aside, I defy you to find a better news source than the NYT in the United States.

If they're tracking what their users are do, they're affecting their user pool in a pretty negative way just by using this method.

They know what they're doing -- if the effect of registration was really that bad, they would have stopped years ago. You and your friends do not represent the average newyorktimes.com user.
Re:Free registration by Swamp · 2003-07-14 04:06 · Score: 1

Back in the days when I registered for slashdot you didn't have to provide an email address.
Re:Free registration by cshark · 2003-07-14 04:17 · Score: 1

I think someone said they could awhile back. And it puts the infrastructure in place for them to debut a paid service like business 2.0 is doing now as well.

--
This signature has Super Cow Powers
Re:Free registration by Rob+Riggs · 2003-07-14 04:20 · Score: 3, Informative

There is a significant difference to logging in to a site in order to participate in conversation and logging in to simply read news. At /., posting requires an identity, since anonymous postings are mostly ignored. However, there is absolutely no requirement that one log in to /. in order to read the stories. Your anology is broken. Privacy should be a choice. At /. one has that choice, with the NYT one does not.
Another point is that anonymity is one of /. greatest strengths. Some of the most insightful and interesting posts have been from "insiders" posting anonymously.
NY Times... user tracking is less sophisticated than slashcode's vital anti troll features.
Care to back this statement up?
...continual complaints on slashdot from people who are obsessed with privacy on the net unless karma is involved
You seem to be quite willing to give up those rights. And that's OK. But there are people here that feel that privacy is a rather important right. That should be respected as well. Enough people actually thought that privacy was a right of such importance that it is enumerated in the Universal Declaration of Human Rights (see Article 12).

--
the growth in cynicism and rebellion has not been without cause
Re:Free registration by bjparker · 2003-07-14 04:30 · Score: 1

and why should they? NYT spent real $$$ to develop that content, and are under no obligation to give it away.

Fair enough, but as a site which provides free articles, I think Slashdot should stop linking to NYT "strings-attached" content.
Re:Free registration by Anonymous Coward · 2003-07-14 04:35 · Score: 0

Most people have registered to use /. , and have therefore provided a valid email address.
Not me!
Re:Free registration by endoboy · 2003-07-14 04:36 · Score: 1

ahh... but then Taco'd have to do some actual work
Re:Free registration by Lord_Dweomer · 2003-07-14 04:39 · Score: 1

"And you wonder why you get ads that have absolutely no interest for you? And why advertisers have to shout lounder and louder to get through a mass of untargeted ads?"
Here's a tip for them then that will help them reach their target better AND save them money.
DON'T GIVE ME ADS!!
I will NEVER buy something that I see in an annoying ad online. I already ignore them all due to psychological conditioning. Now it just pisses me off because it eats up my bandwidth. Seriously, I understand the reasoning of "well, there might be some people who say they don't want the ads but might buy/click anyways". Honestly, I think the advertisers would make more just moving to opt-in for ads. Of course, the people hosting the ads want it to be the opposite because they make their money usually on volume of clickthru's and not necessarily on clickthru's that lead to sales.

--
Buy Steampunk Clothing Online!
Re:Free registration by poot_rootbeer · 2003-07-14 04:40 · Score: 1

Oh, and I invite anyone who disagrees with me to post a comment explaining why, rather than modding me down as Flamebait.

I don't feel that either "the New York Times is a premiere news service" or "NYT Digital knows the impact of registration on their user base better than Joe Slashdot" is a statement that should be considered rude or incendiary.
Re:Free registration by Zan+Zu+from+Eridu · 2003-07-14 04:48 · Score: 1

I trust everyone with my example.com addresses. Hell, the more stupid they are, the more I trust them to do the right thing.
Every registration system which requires an email address but lets the user chose his password without first sending an initial password to that address is stupid and asking for invalid addresses.
I just give them what they're asking for (sometimes even root@localhost works).
Re:Free registration by Becquerel · 2003-07-14 04:51 · Score: 1, Insightful

So why do we have this tedious bitching about the NY times every time a link is posted?

Because lets face it, in jokes (cowboyneal poll option,in soviet russia...,1.xyz 2.????? 3.profit, etc) are funny.

--
My spelling isn't bad, I'm evolving the language
Re:Free registration by ncc74656 · 2003-07-14 04:52 · Score: 1

Advertising that is not useful is noise.

s/that is not useful//

--
20 January 2017: the End of an Error.
Re:Free registration by deuist · 2003-07-14 04:53 · Score: 1

- Maybe we can agree that the NYT is a well-written, serious and interesting newspaper. Not just for New Yorkers but also for people from Sweden, Japan or New Jersey.
Speak for yourself. The New York Times is loaded with so many innacuracies that Slashdot looks like a well researched, well written news hub.
Re:Free registration by Becquerel · 2003-07-14 04:58 · Score: 1

In Soviet Russia Pravda logs in to you

--
My spelling isn't bad, I'm evolving the language
Re:Free registration by HBI · 2003-07-14 05:27 · Score: 2, Informative

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper.

I won't agree with that statement. I will agree it's a well-written, serious, interesting work of fiction.

To back that up, we can point at this, illustrating a bit how a reporter that falsified stories en masse (Jayson Blair) and a managing editor who tolerated same (Harold Raines) were kept on board because of a weird form of affirmative action (in the former case) and a personal friendship with the publisher (Arthur Sulzberger, Jr.) in the latter.

If you want more information on the Jayson Blair-authored stream of fictional articles appearing in the NYT's pages, just Google to your heart's content.

You can trust it as a newspaper again if you like, but i'm certainly not going to.

--
HBI's Law: Frequency of calling others Nazis is directly correlated with the likelihood of the accuser being Communist.
Re:Free registration by squiggleslash · 2003-07-14 06:10 · Score: 1

Wow, so I can post messsages anonymously to the NYT website, and do it without an IP address?
That's quite remarkable. What kind of technology are they using to make that a possibility? I assume not IPv4 or IPv6, but, well, how the devil are they getting it to interact with the rest of the Internet?

--
You are not alone. This is not normal. None of this is normal.
Re:Free registration by The+Mgt · 2003-07-14 06:45 · Score: 1

They can then target ads to users much more effectively, and market their userbase to advertisers much more solidly than they could with more rudimentary user tracking methods.

I love the idea of them targeting ads at all the 108 year old Afghan grannies etc etc.

I would have thought they'd like the idea of Google caching their content because it lessens the number of people poisoning their registration database with thousands of fake identities and email addresses. I must have added a couple of hundred of them myself.
Re:Free registration by GlassHeart · 2003-07-14 06:55 · Score: 1

You really think that the money they spend on advertising will level off?
Yes. As a rule of thumb, advertising costs cannot exceed the actual profits made from sales. Sales can increase with good advertising, but it will not increase infinitely, or proportionally (the last few potential customers will cost an inordinate amount of advertising to reach).
When advertising begins to work efficiently, the medium transporting that advertisement will realize it, and raise prices accordingly. This is why a television commercial during a football game costs a lot more than an ad in the local newspaper. This usually results in an increase of the quality of the ad, making some of them appealing to watch in their own right.
All in all, the system is pretty dynamic. They're not necessarily just going to bombard you with banner ads twice as big.
Re:Free registration by zcat_NZ · 2003-07-14 07:21 · Score: 1

they don't have to change the whole site; they just need to add ONE LINE of text to ONE plaintext file.

How hard is that?

--
455fe10422ca29c4933f95052b792ab2
Re:Free registration by Captain+Pedantic · 2003-07-14 07:21 · Score: 1

Of course, maybe the Grauniad is not the best place for an article about inaccuracies.

--

None are more hopelessly enslaved than those who falsely believe they are free. Johann Wolfgang von Goethe.
Re:Free registration by Jazu · 2003-07-14 07:48 · Score: 1

Wow! You can insult a prestigious organization! See, there was a reason all the cable news people were so happy they had learned the word "shadenfreude" for that story!

--
My joke got modded as Insightful and my insight got modded as Funny.
Re:Free registration by Anonymous Coward · 2003-07-14 07:49 · Score: 0

Anonymous reading is a privilege I don't want to give up. Not for nothing. Certainly not to make things easier on marketers.

Your argument is looking like a classic straw man to me. I don't complain when advertisers target their campaigns. Only when they seek to invade my privacy, or in limit my enjoyment of anonymity. There are other ways to target ads. You know, like Google ads, or the targeted text ads I see in blogtopia. That involves more work for them, like they actually have to know what websites are about, the topics they cover, and how those relate to their product and the audience they want to reach. But hey, the marketing dept. doesn't have to do that on their own. There are companaies that will gladly provide that service for a fee. Me, I don't feel the slightest obligation to do their work. I just want to read anonymously.
Re:Free registration by Fastolfe · 2003-07-14 08:04 · Score: 1

I had the same experience with 'ccbill'. Would that by chance be the one you're talking about? I confronted them and they flat out denied selling my address. They suggested that I might have used the address somewhere else, even though it was clear and obvious that the address was made up specifically for that one transaction.
Re:Free registration by xenocide2 · 2003-07-14 08:08 · Score: 1

Because I'm tired of having yet another username/password combination, and my viewership depends on it?

--
I Browse at +4 Flamebait
Open Source Sysadmin
Re:Free registration by cpeterso · 2003-07-14 08:16 · Score: 1

Most spam I get are to my _actual_ Email-adress which has me a bit confused.

that's because AOL sold your troed102527@aol.com email address to many spam lists. bon appetit!

--
cpeterso
Re:Free registration by AltaMannen · 2003-07-14 09:03 · Score: 1

is "Shadenfreude" the joy of shade?
Re:Free registration by JuggleGeek · 2003-07-14 09:27 · Score: 1

Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?
I've been registered with the NYTimes for several years, using a tagged address. No spam has ever been sent to that address.
And as you complain about registration, I notice that you didn't post to /. as AC, which means you are registered here at /. So am I. I get no spam to that address, either.
Re:Free registration by vTalon · 2003-07-14 10:31 · Score: 2, Interesting

they don't have to change the whole site; they just need to add ONE LINE of text to ONE plaintext file.

How hard is that?

First rule of web design (or programming, or anything having to do with a computer or any other complex system): that little change that you make in five minutes that you know won't affect anything -- the one you don't test because it's so minor -- that change is going to bring the whole page/program/computer/national defense system crashing down around your ears.

Adding one little line of code to every one of the myriad of pages on the New York Times website is not a small deal. It's going to involve a lot of paperwork, testing, and coding on the part of a lot of people.

It's probably simpler for Google to create a registry of "do not cache" pages on their end. And it's more their responsibility, anyway, being the ones who created the cache in the first place.
Re:Free registration by Anonymous Coward · 2003-07-14 10:52 · Score: 0

I am sorry - but this is *not* flamebait. I even disagree with the guy - but "flamebait"? Time to do more meta-moderation....
Re:Free registration by Nutrimentia · 2003-07-14 11:25 · Score: 1

I've been registered with the NYTimes for 4 years and haven't received any spam from them.

Just because you give out an address doesn't mean it will necessarily get spammed. And it isn't difficult to create special accounts (yahoo/ hotmail/ etc) just for this.

--
Etc, etc, ad nauseam, and so on and so forth.
Re:Free registration by Anonymous Coward · 2003-07-14 11:41 · Score: 0

Why not use Squid + adzapper? Or similar products for Windows (AtGuard was one of them).
Re:Free registration by RodgerDodger · 2003-07-14 11:55 · Score: 1

There is a significant difference to logging in to a site in order to participate in conversation and logging in to simply read news. At /., posting requires an identity, since anonymous postings are mostly ignored. However, there is absolutely no requirement that one log in to /. in order to read the stories. Your anology is broken. Privacy should be a choice. At /. one has that choice, with the NYT one does not.

Anyone can read the New York Times anonymously. It involves dropping some cash at your local news agent.

The web site is a bonus, and one you don't even have to pay cash for. If you don't like it, don't use it; support your local newsagency instead.

--
"Software is too expensive to build cheaply"
Re:Free registration by RodgerDodger · 2003-07-14 11:59 · Score: 1

I disagree. Let's imagine for a minute that everyone provides an accurate profile, targeted marketing works, sales increase, and the advertiser gets rich.

You really think that the money they spend on advertising will level off?

Yes. If your product is selling like hotcakes, you don't need to spend money advertising it.

Advertising is an overhead for a business, and businesses like to cut back on overheads when they can.

--
"Software is too expensive to build cheaply"
Re:Free registration by zcat_NZ · 2003-07-14 12:05 · Score: 3, Informative

Adding one little line of code to every one of the myriad of pages on the New York Times website is not a small deal. It's going to involve a lot of paperwork, testing, and coding on the part of a lot of people.

But it's not one line of text on EVERY page. It's one line of text in /robots.txt, a file that is independent of the rest of the site and never even accessed by ordinary browsers.

It's probably simpler for Google to create a registry of "do not cache" pages on their end. And it's more their responsibility, anyway, being the ones who created the cache in the first place.

Google already have exactly such a registry, and they don't even wait for sites to contact them.. Their robots -asks- the site (via the recognised standard '/robots.txt' file) if they object to being indexed and/or cached. Most other search engines look for the same file and handle it the same way.

This is (from my perspective) far better than having to individually register your site with the several hundred search engines that might try to index it..

--
455fe10422ca29c4933f95052b792ab2
Re:Free registration by Anonymous Coward · 2003-07-14 12:09 · Score: 0

One day you will learn.
For now, you know nothing :O
Re:Free registration by Anonymous Coward · 2003-07-14 12:17 · Score: 0

The point is so they can target web advertising. They link your email to demographics and usage data such as what you read or click on, and use that to control ads.
--- Eric
Re:Free registration by JoeBuck · 2003-07-14 12:34 · Score: 1

The NY Times has serious flaws (as well as serious virtues). Conservatives usually scream about Jayson Blair; liberals on the other hand are appalled at Judith Miller, who seemed to be some kind of mole working for Ahmed Chalabi, the convicted embezzler who wants to run Iraq.
But as every other newspaper and television station in the US basically just reprints and rewrites what the the Times writes, you're pretty much stuck with them. Small newspapers and local TV stations don't have the staff to find non-local news on their own, so they just read the Times and the Washington Post and print what they print.
Re:Free registration by Zachary+Kessin · 2003-07-14 13:53 · Score: 1

Sure it does, buy a newspaper. You can walk up to a newsstand give the guy some money and take it home and read it. Hell they will even deliver it to you every morning if you want them to. (Assuming you live somewhere where they deliver)

This is freedom of the press. The editors and publishers of the NYT have decided on a policy for how to distribute content online. It is their content and they can make that choice. If you don't like it, read a different newspaper.

Let us be clear the New York Times uses their ads to pay the bills. Having reporters all over the world and editors and all the other things they do costs a lot of money. The cover price pays for the newsprint and thats about it. Everything else is paid for by ads. No Ads no newspaper.

--
Erlang Developer and podcaster
Re:Free registration by macmurph · 2003-07-14 14:44 · Score: 1

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper.

I can't agree. I was in downtown Damascus, Syria in sept. or oct. of 2000. I was reading a NYT article about massive anti-american protests that had taken place in downtown Damascus, Syria the previous day. I was there, and there were no protests. At this point, I realized that the New York Times is seriously skewed. I attribute this to the jewish editors but thats merely an assumption on my part. Perhaps someone else inside the paper had an agenda. Nevertheless, on matters about the middle east, the NYT cannot be trusted. And I therefore cannot really trust anything I read in it. I do agree that the tripe is well-written, serious, and interesting, but it is still tripe.
Re:Free registration by jbottero · 2003-07-14 15:25 · Score: 0

Thanks for the Troll mod, but you are wrong. My point is that even though most Slashers may have other opinions, some people realize that you must give to recieve. For commercial products like music, this generally means an exchange of money.

Let me ask you. Do YOU work for free? Is it all right if YOUR customers tell you that they would rather not pay you for your work? Does that concept pay YOUR rent?

Sure, mod me to troll, but it's YOUR closed mind that does it.
Re:Free registration by 1u3hr · 2003-07-14 16:28 · Score: 1

they just need to add ONE LINE of text to ONE plaintext file
Specifically, this one.
Here's the current nytimes.com/robots.txt. Notice for instance it allows all robots ("User-agent : *"), but bars them from the "partners" and "archive" sections.
PS
Well, I did have the file pasted here, but Slashdot's fucking lameness filter prevents me from posting it, so use the link above if you're interested.
Re:Free registration by Anonymous Coward · 2003-07-14 19:24 · Score: 0

Thanks for the info on /.'s anti-troll system. That would explain why last week I got a message telling me I had already moderated a thread I was posting to (I don't even have an account - too lazy) - SlashCode doesn't know what a dynamically assigned IP is. Oh well, looks like I wasted some guys hard earned mod points. Nice to see ./ views broadband users as better moderators than the rest of us.
Re:Free registration by stephanruby · 2003-07-14 20:00 · Score: 1

"Adding one little line of code to every one of the myriad of pages on the New York Times website is not a small deal. "
The poster below is absolutely right. Only one file needs to be modified. The parent likes to talk about the "first rule of web design", but he obviously never has published a web site in his life.
Re:Free registration by zo219 · 2003-07-14 20:15 · Score: 1

"Very little reason not to"?

Troll. Poseur.
Re:Free registration by instarx · 2003-07-14 21:36 · Score: 1

The NYTimes is not a source of SPAM. I have been registered on NYTimes website for many years now and I have never received a single piece of spam associated with that account. How do I know? I used a completely different user name than I normally use and a unique email account name when I registered.

The purpose for registration is to get you to state that you are using it for personal reasons and not for commercial purposes.
Re:Free registration by instarx · 2003-07-14 21:43 · Score: 1

Flamebait!? You've got to be kidding.
Re:Free registration by NexusTw1n · 2003-07-14 21:49 · Score: 1

Another point is that anonymity is one of /. greatest strengths.

Except you aren't anonymous. Your IP address is stored for at least 24 hours via slashcode. Some IP addresses are stored for longer.

If slashdot truly respected privacy, trolls would crapflood because slashcode wouldn't record the IP address of posters. Bad moderators could mod posts down while posting on the same thread "anonymously". As it stands, slashcode prevents you from doing that.

Actively participating in slashdot means you relinquish privacy. Even if you don't get an account, slashcode remembers who you are for at least 24 hours.

If privacy while reading the NY times is important to you, read it in a library, pay cash for a newstand copy, or google for the thousands of user names and passwords out there.

To be honest, I've probably been trolled, as anyone bringing up the universal declaration of human rights in relation to free registration of an online newspaper is getting things way out of perspective.

--
It has become appallingly obvious that our technology has exceeded our humanity. --Albert Einstein
Re:Free registration by vTalon · 2003-07-15 14:13 · Score: 1

But it's not one line of text on EVERY page. It's one line of text in /robots.txt, a file that is independent of the rest of the site and never even accessed by ordinary browsers.

I stand corrected. Though I can still see why the Times might be hesitant to go to the (minorish) effort of changing it -- there'll still be paperwork to do, and procedures to be followed in order to make a change that's only necessary because of Google's caching feature -- I can see why somebody in Times' Management might want Google to do the work rather than them.

I'm not saying that the Times is right; I can just see where they're coming from.
Re:Free registration by vTalon · 2003-07-15 14:20 · Score: 1

The parent likes to talk about the "first rule of web design", but he obviously never has published a web site in his life.

A bit snippy, aren't we?

You're actually quite right that I've never done any serious web design in my life. But I have dealt with my share of customer service messes created by overzealous web developers -- those last-minute Friday evening ideas have a small tendency to create a mass of CS e-mail come Monday morning.
Re:Free registration by stephanruby · 2003-07-15 20:11 · Score: 1

Yes, I was a bit snippy.
Re:Free registration by Anonymous Coward · 2003-07-16 01:02 · Score: 0

"And I think anyone except perhaps Rummy can see that it's a tad early to state "Iraq is not a quagmire", never mind apologize for wondering if it is."

Er...thats what the OP was complaining about! Keep up!
Re:Free registration by djmitche · 2003-07-17 10:31 · Score: 1

Come to think of it, the reason I don't like to log into NYT is usually I just want to glance at an article, just like I usually want to glance at the /. main page. My browser holds /.'s cookie for a long, long time, so I never have to log in again. If the NYT had me register once, and that cookie stayed around for long enough that I'd never have to think of it again, that would be fine. But when I'm asked to log in every day, and I have to remember which fake email address I used and which throwaway password, it just gets tiresome, and I don't read NYT articles.
But that's OK, I was losing karma for reading the articles anyway.

Worst result by presroi · 2003-07-13 22:01 · Score: 2, Interesting

The worst outcome would be a google-database which is not representative for the general web. I simply ecspect all results in google to be accessible without registering, paying or doing anything similar.

Re:Worst result by trout_fish · 2003-07-13 22:20 · Score: 1

Surely that would reduce the value of a google search. If you need to find out about something and the best source charges then it has to be good for google to list non-free sites.
Re:Worst result by Bitter+Old+Man · 2003-07-13 22:28 · Score: 3, Funny

Ecspect? Ecspect? ECSPECT? Jesus H mother of god, it's like just when I think slashdot couldn't possibly get any worse... it does.
Re:Worst result by chef_raekwon · 2003-07-14 01:17 · Score: 1

se which is not representative for the general web.

as at has already been stated, many websites already block Google from its content. In fact, it is quite easy to do. So, does this mean that we already don't see a general representation of the web?

it is a simple 'deny' the googlebots from crawling the site. its easy to do, and everyone's doing it.

--
We're like rats, in some experiment! -- George Costanza
Re:Worst result by mog · 2003-07-14 02:46 · Score: 1

First, I'd note that while 'x' is right next to 'c', where else would that 's' have come from had it been a simple typo? Second, I would like to propose the phrase "dvordork" instead of "dvorak dork". Say it a few times. It really just kinda flows right off the tongue.
Re:Worst result by BorgCopyeditor · 2003-07-14 06:59 · Score: 1

Who aksed you? :)

--
Shop as usual. And avoid panic buying.

God damnit... by tangent3 · 2003-07-13 22:03 · Score: 5, Funny

Now we can't karma whore by linking to the google cache?

Re:God damnit... by xChOasx · 2003-07-13 22:14 · Score: 1, Funny

Let me try: Google cache
Re:God damnit... by anonymous+loser · 2003-07-13 22:21 · Score: 4, Informative

The *real* karma whores link to http://archive.nytimes.com anyway.

NYTimes have futzed around with it a bit, but if you play with it, it still gives you registration-free access to their content, it just takes a couple of clicks nowadays.
Re:God damnit... by Duds · 2003-07-13 22:27 · Score: 1, Funny

No, now you have to complain that you would have linked but now can't. You may also may some reference to it being Gates' fault.
Re:God damnit... by toopc · 2003-07-13 23:42 · Score: 2, Funny

You'll have to stick to making lame, obvious jokes.
1. Make lame, obvious joke.
2. ???
3. Profit!
All your lame jokes are belong to us.
In Soviet Russia lame jokes make you!
Re:God damnit... by jdh-22 · 2003-07-14 01:49 · Score: 1

Get the google cache of the article here:

Google cache

--
Every Super Villan uses Linux.

Google - more useless everyday by jkrise · 2003-07-13 22:05 · Score: 2, Insightful

IANTrolling here, but I find Google more and more useless by the day. Sometime back, I pointed out how Google seems to have a soft corner for articles and sites that affect big firms such as Microsoft.

In fact, several of Slashdot's own articles on Microsoft aren't available from Google news, although Slashdot is listed as a 'news' source. Couple of MS related Slashdot articles (on the Oregon bill - March 6th and May) have been removed, but much pro-MS content pre-dating March is still referenced.

Google seems to be aping the other Gorilla, despite all the posturing, and Microsoft's so-called attempts to categorise it as a competitor, when in fact, Google appears to be an ally!

--
If you keep throwing chairs, one day you'll break windows....

Re:Google - more useless everyday by cioxx · 2003-07-13 22:19 · Score: 4, Informative

Sometime back, I pointed out how Google seems to have a soft corner for articles and sites that affect big firms such as Microsoft.

"Google News is highly unusual in that it offers a news service compiled solely by computer algorithms without human intervention. While the sources of the news vary in perspective and editorial approach, their selection for inclusion is done without regard to political viewpoint or ideology. While this may lead to some occasionally unusual and contradictory groupings, it is exactly this variety that makes Google News a valuable source of information on the important issues of the day." source

Remove your tinfoil hat please. There is no conspiracy. Google News features articles from Newsmax, Electronic Intifadah, Islam Online, Al Jazeera, World Net Daily, etc. If there was any filtering going on, these sites would have been off the radar long time ago.

Also, Slashdot is not a professional journalistic site. It's a News-based comment board where people come to share their opinion. In a perfect world Slashdot doesn't even belong on Google News.
Re:Google - more useless everyday by jkrise · 2003-07-13 22:26 · Score: 2, Interesting

Google News is highly unusual in that it offers a news service compiled solely by computer algorithms without human intervention.

A May article was referenced in Google, but the link pointed to a March 6th article. How can computer algorithms cause this?

While this may lead to some occasionally unusual and contradictory groupings, it is exactly this variety that makes Google News a valuable source of information on the important issues of the day.

Just search for Googlewash using Google. Read story in TheRegister (it's not delisted now). Watch hypocrisy in action. Roll up eyes in disbelief. Adjust tin foil hat.

--
If you keep throwing chairs, one day you'll break windows....
Re:Google - more useless everyday by MonTemplar · 2003-07-13 22:29 · Score: 4, Interesting

First off, Google News is still in Beta at the moment.

Second, the Google News database only goes back a month or so, probably by design.

Third, I was able to search for 'site:slashdot.org microsoft oregon' on Google just fine this morning. Got 243 results, and the Google Cache has copies of the first three pages returned, which relate directly to the Oregon bill you use as your example.

So, where is the problem?

--
-MT.
Re:Google - more useless everyday by Troed · 2003-07-13 22:46 · Score: 1

I just searched for "googlewash" using Google. I got no hits on TheRegister - only a bunch of blogs. Searching News didn't help either.

--
it's in my head
Re:Google - more useless everyday by dhodell · 2003-07-13 22:51 · Score: 5, Interesting

Just FYI, this behavior is due to the fact that Googlebot has a sort of "built-in" mechanism to ignore (or at least rank lower) forum-type sites. Since /. is primarily a "news headline and discussion" site, Google will not rank it as highly as one that seems to be more "on-topic". This is because there is no guarantee that any URLs or email addresses within the page have anything to do with the actual page content itself.

Outside of user posts, /. has little genuine unique content. It summarizes a lot of headlines; this content is not unique.

Other (large) factors determine the way Google ranks pages, including the "PageRank" feature. There are lots of documents about the way Google ranks sites, I suggest to check them out. The best way is probably to Google for it :).

Anyway, this is a bit more on-topic:

I highly appreciate Google's caching feature, and don't see how it can be taken as "bad".

This is what's "bad" about Google and what I expect that, at some point, will come to haunt them. For instance, if I want to get serial numbers without porn popups, I can usually search for something like "Office XP Serial Number Serialz Warez" or something similar. Within the first couple of pages, I will probably find my serial number in the text of the page description.

If not, it's on the page, oftentimes without a popup, since the serial/crack page itself is the one linked.

Want to find X-Win32? How about doing "* * * * * * xwin32*.exe" - lets get some directory listings containing this filename.

No doubt this proves that Google is more than just a search machine... but I think that their superior techniques will definitely come back to haunt them in the future. NYT is way off target with bitching about their caching features... you can turn this off easily, and there are a plethora of scripts one can use to break out of Google's cache and send someone to the main site (or, perhaps, login area in the case of NYT).

But, in other news, Google might need to watch out...

--
Kind regards, Devon H. O'Dell
Re:Google - more useless everyday by jkrise · 2003-07-13 22:54 · Score: 1

The article is here. The word Googlewash was in fact coined by TheRegister.

The article contains a reference to the original article as well. Interesting reading.

--
If you keep throwing chairs, one day you'll break windows....
Re:Google - more useless everyday by Rhubarb+Crumble · 2003-07-14 01:21 · Score: 1

I just searched for "googlewash" using Google. I got no hits on TheRegister - only a bunch of blogs. Searching News didn't help either.
It's there, at link #36 (of 999) or so. Or try searching for "googlewash site:theregister.co.uk".
Re:Google - more useless everyday by Dratman · 2003-07-14 01:39 · Score: 1

Google is sooo dying. It will join *BSD in doggy heaven.

--
Sigmund
Re:Google - more useless everyday by Anonymous Coward · 2003-07-14 01:47 · Score: 0

Google filters Indymedia.

I'll remove my tinfoil hat if you remove your blinkers.
Re:Google - more useless everyday by sketerpot · 2003-07-14 01:55 · Score: 1

It's ironic that with all the talk going aroung about Google allegedly censoring the web, you think that not censoring it is going to haunt them. :-)
I think that Google can probably claim that it's too hard for anyone to accurately classify warez sites and such. But remember, you don't have to suffer porn popups!
Re:Google - more useless everyday by Anonymous Coward · 2003-07-14 03:39 · Score: 0

I don't know anything about Googlewash, but read this article.
Re:Google - more useless everyday by Anonymous Coward · 2003-07-14 08:08 · Score: 0

Out of curiousity, who indexes indymedia?

daypop, blogdex, altavista news?

Personally I have mixed feeling about Google news dropping Indymedia. At the time of the decision, it seemed like there were a lot of hate-filled antisemitic posters on Indymedia. It wasn't news. It was like reading slashdot at -1 without any difference between positive and negative posts.

Furthermore, I believe Google news was consitent in dropping all blogs. That's kind of like blinders, because you get 103 reiterations of the latest AP or Reuters story, but you don't get the stories that really do break on blogs. But there are other search engines.

Yes. by Naikrovek · 2003-07-13 22:06 · Score: 4, Interesting

Shouldn't the NY Times simply tell Google not to cache their site?

You do realize that this is probably the basis of the "talks" that are going on, right? C|Net (as per usual for them and every news agency) is making a big deal of it to get themselves and their advertisers that tiny wee bit more of attention. Every little bit helps i guess.

Check http://nytimes.com/robots.txt in a week.

Re:Yes. by Zocalo · 2003-07-13 22:15 · Score: 2, Insightful

Actually, the link to "robots.txt" raises an interesting point. Why is NY Times even in "discussions" for this, other than to gain some column inches? It's entirely upto the NYT whether to let Google's robots to index their site, isn't it? I would have thought that Google's robots would be well behaved in this respect and simply move onto the next site if they were told to go away by robots.txt.

--
UNIX? They're not even circumcised! Savages!
Re:Yes. by Anonymous Coward · 2003-07-13 22:25 · Score: 1, Interesting

AFAIK, there is no standard for allowing index robots to scan a site, but disallowing cache robots at the same time. In this case, it shouldn't be a problem because the Google bot obviously gets the VIP treatment anyway (or does the Google bot log in to the NYT webserver?). But in the general case, do we need a better robot exclusion standard to distinguish between certain types of robots?
Re:Yes. by Kingpin · 2003-07-13 22:27 · Score: 1

It is a big deal. Why should the "user" turn a facility off, rather than on?

--
Unable to read configuration file '/bigassraid/htdig//conf/14229.conf'
Geocrawler error message.
Re:Yes. by TCM · 2003-07-13 22:45 · Score: 1

Because it's the nature of the web to not differentiate by user agents or the way they access your free (it's http on port 80, you get no 403 or otherwise, so it's free. "Content" providers and your fucked up ideas bite my ass) information? I find it logical to make it an extra step to prevent automatic crawlers from automatic crawling.

--
Of course it runs NetBSD. BTC: 1NT7QvbetmANwaMzhpVL6
Re:Yes. by gibodean · 2003-07-13 23:23 · Score: 2, Insightful

Actually, from the text of the article, they say that they want it so that when you click on a link in google, you get the registration page of the NYT.

A robots.txt would stop google from indexing the site altogether. They don't want that to happen. They want a google search to show NYT web pages, but they just want to make sure that when the user tries to view it, they have to register with NYT first. That means that google must still index the page, but not allow access through the cache. Plus, it must direct to a sign-on page rather than the page itself, but that is something that I'm sure the NYT itself could handle, like it think it does now anyway.
Re:Yes. by AyeRoxor! · 2003-07-13 23:31 · Score: 2, Interesting

"It's entirely upto the NYT whether to let Google's robots to index their site, isn't it?"

I personally think the NYTimes wants Google to continue to cache their stories.

If they use robots.txt, no NYT articles will come up in Google. However, if they *do* succeed in these talks, I presume the articles will still come up, but uncached, and linking to a signup/login screen. It makes pretty good business sense.
Re:Yes. by Arker · 2003-07-13 23:33 · Score: 2, Interesting

A robots.txt would stop google from indexing the site altogether. They don't want that to happen. They want a google search to show NYT web pages, but they just want to make sure that when the user tries to view it, they have to register with NYT first. That means that google must still index the page, but not allow access through the cache. Plus, it must direct to a sign-on page rather than the page itself, but that is something that I'm sure the NYT itself could handle, like it think it does now anyway.

I sincerely hope google shows some balls and tells them to f right off.

They can't have it both ways, either they're on the web or they're not. They've been trying for years to subvert things so they can have their cake and eat it too, and they need to get told no by someone they'll listen to.

--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
Re:Yes. by Arker · 2003-07-13 23:38 · Score: 1

AFAIK, there is no standard for allowing index robots to scan a site, but disallowing cache robots at the same time.

That's right, and that's a feature not a bug.

--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
Re:Yes. by StarFace · 2003-07-13 23:40 · Score: 4, Interesting

True, there is no standard, but Google's method of allowing indexing and caching as independently selectably features is well documented and extremely easy to do. You can even tell Google specifically to stop caching, if you don't mind smaller engines caching.

So, it isn't a standard, but it is a piece of cake for NYT to figure out, and indeed, they already have. As the person above said, this is just C|Net trying to be a real news source. The article even says that the method I just described is the focus of their "discussions."

I imagine the discussions, if anything, were intially a friendly lawyer call (if even that,) which was quickly diverted to a tech issue and ended up with some webmaster at NYT getting the specifics of how to set up the Times so that Google will still index their pages and bring them up with searches, but not cache them.

--
V
Re:Yes. by StarFace · 2003-07-13 23:46 · Score: 1

Actually, click the second link the poster put it. It describes how you can do just that, have Google index the page, but not cache it. I'm sure you've even seen pages that use that. It just shows up with a "Similar Sites" link at the bottom. And yes, I am pretty sure that clicking on a story link without being reged will just take you to their login page, because it is cookie based, and checks onload.

--
V
Re:Yes. by Anonymous Coward · 2003-07-14 01:09 · Score: 0

There isn't a standard for robots.txt?

Someone should tell the w3c this.

http://www.w3.org/Search/9605-Indexing-Workshop/ Pa pers/Frumkin@Excite.html
Re:Yes. by StarFace · 2003-07-14 01:45 · Score: 1

Why would Google rely on meta tag directives if it could be accomplished in a robots file?
Of course there is a standard for robots.txt, but I do not think that it addresses full page chaching. The cache directives in the robots.txt file specially refer to how the spider should cache the text file, not the web pages that it controls. So you could set it to no-cache if your robots.txt file changes constantly.
I could be wrong, but the last time I checked that is all I saw. However, I am sure that w3c is well aware of the issue. When the drafts for robots.txt went around in '96, I don't think anyone was anticipating massive spiders that actually created a running copy of most the Internet once a month.

--
V
Re:Yes. by Fjord · 2003-07-14 03:19 · Score: 1

One thing I could see is if they want the index and the cache to be on google, but they don't want the cache to work if the referrer isn't from google. So you can't link to the cache on /., you must be clicking on a link on google's site.

Aside from robots.txt, there are meta-tags to tell google to not cache.

--
-no broken link
Re:Yes. by schon · 2003-07-14 03:21 · Score: 1

I personally think the NYTimes wants Google to continue to cache their stories.

Considering that NYT sells (it _is_ selling - you buy it with your personal information) access to it's site, I think they might have a problem with having a freely-available mirror available. Unless these "talks" are asking Google to deny access to people who don't have a NYT login.

If they use robots.txt, no NYT articles will come up in Google.

Not true - read the link.. Google has a mechanism that will allow your site to be indexed, but not cached.
Re:Yes. by pmsyyz · 2003-07-14 03:38 · Score: 1

/>

Google will still index and list but not cache the page.

http://www.google.com/webmasters/3.html#B2

--
Phillip
Re:Yes. by dgmartin98 · 2003-07-14 03:48 · Score: 1

That's right, and that's a feature not a bug.

Do you work for Microsoft?

--
FPGA, Wireless, ASIC, Verilog, VHDL, HW, 10yr exp, Team Lead, Ottawa (More? Email above. slashdotusername=dgmartin98 )
Re:Yes. by jbottero · 2003-07-14 03:54 · Score: 0

It's not a question of Google indexing NYT, it's a matter of google cacheing NYT
Re:Yes. by AyeRoxor! · 2003-07-14 04:51 · Score: 1

>If they use robots.txt, no NYT articles will
>come up in Google.
>
>Not true - read the link.. Google has a
>mechanism that will allow your site to be
>indexed, but not cached.

You're missing my point. They want their links to be shown when you search for text that their articles contain. They just want you to have to login when you click that link.

If I search for "McDonalds," NYT wants their article about McDonalds to show up in the google hits. But when you click that link, they want you to get a login screen.
Re:Yes. by Anonymous Coward · 2003-07-14 04:51 · Score: 0

Shouldn't the NY Times simply tell Google not to cache their site?

No. Why should they? By the same token, I shouldn't have to tell random spammers that I'd like to opt-out.

The fact that web content is freely accessible (minus the trouble of registering some fake email address) does not mean anyone can freely mirror it. NYTimes articles are copyright and more importantly provide the NYTimes with revenue (advertising and subscriptions) and market share. If Google can cache NYTimes articles, why couldn't, e.g., the Boston Globe?

Plus the "my-bot-did-it" excuse is getting old...

NY Times likes accuracy by Anonymous Coward · 2003-07-13 22:08 · Score: 4, Interesting

The reason they're trying to stop this is because with NYT reputation, they keep retracting stories all the time. With Google cache this could be problematic and the management/editors/authors could get into trouble again.

I do however dislike Google cache for many reasons. It's bad for privacy.

Re:NY Times likes accuracy by anshil · 2003-07-13 22:28 · Score: 4, Insightful

Since when is content published in the WWW about privacy?

It's just like a government that wants to control which newspapers maybe archivied for history research.

--

--
Karma 50, and all I got was this lousy T-Shirt.
Re:NY Times likes accuracy by MonTemplar · 2003-07-13 22:37 · Score: 4, Informative

What he said! Remember, the first two W's are for World Wide.

The only people who seem to have a problem with webpage caching are either legal flacks working in CYA Mode, or webmasters who can't be bothered to mark up their pages and add robots.txt files to make sure that only public information goes out of their websites.

--
-MT.
Re:NY Times likes accuracy by Anonymous Coward · 2003-07-13 22:56 · Score: 0

Let me give you a rough scenario here.

John Doe in 2001 registers on a gaming site where he does not have control over robots. His email, location, name, etc are displayed on that site. Google spiders crawl through the site, cache everything and months later the site is no longer operational.

Now, imagine that person who submitted his info to the site does not want to be associated with it. Behold! Google got it covered. Would they prune the cache at a request of every user whoever felt their privacy was being violated? Doesn't sound like a wise business move to me handling millions of requests a day.

WWW is not set in stone. Just like humans - it constantly evolves. There is no reason why it should keep private pages in its storage servers.

If I were to search for my username on Google, it will show bunch of anti-Microsoft rants. Suppose I were to get interviewed by a company which havily utilizes Microsoft products and the HR person goes on Google and searches for my name. You think I'll get the job? Those pages do not exist anymore, but the cache is still there.

This is why Google cache is very bad for privacy. If you fail to see it within a month - tough luck!
Re:NY Times likes accuracy by Pendersempai · 2003-07-14 01:12 · Score: 1

The reason they're trying to stop this is because with NYT reputation, they keep retracting stories all the time.

Nonsense. The reason they're trying to stop caching is because it lets people see their content without registering. Registering lets them know how many eyeballs hit their site. And their advertisers pay per eyeball. So if people view their content without having registered, their bottom line feels it.
Re:NY Times likes accuracy by Kosi · 2003-07-14 01:17 · Score: 1

or webmasters who can't be bothered to mark up their pages and add robots.txt files to make sure that only public information goes out of their websites.

Information that can be acessed by a Google spider is always public!
Re:NY Times likes accuracy by eXtro · 2003-07-14 02:29 · Score: 1

The Google cache isn't bad for privacy. Your act of ranting in public against Microsoft were bad for your privacy. Not too long ago in order for your views to be made public they had to be in some way notable. Maybe the reason for their notable was because of your eloquence. Newspapers picked you up and published you. News shows picked you up and aired you. You could try and make yourself notable by publishing a letter to the editor, but of course the editor could choose not to publish you. If your views were published and at a later date you no longer agreed with them it's not the newspapers fault that your name was attached to those views. You did it. Your name is on microfiche or in library stacks because of your own actions.

The world wide web enabled anybody to publish anything with no editor in the middle. There is nobody there to check facts or decide that your view wasn't eloquently written enough or wasn't personally embarassing. All the google cache does is what microfiche or libraries did for printed material. If you don't want to be embarassed in public by your past views then don't publish your present views. The onus is on you. You can't even blame an editor for modifying your words and putting them in a different context.

If you can't accept this then you are posting on the wrong medium.

--
Chris Kuivenhoven is a thief, beware
Re:NY Times likes accuracy by anshil · 2003-07-14 03:39 · Score: 1

Just one thing to retort:

Beware and behold: You are responsible for your actions!

--

--
Karma 50, and all I got was this lousy T-Shirt.
Re:NY Times likes accuracy by Anonymous Coward · 2003-07-14 07:22 · Score: 0

I do however dislike Google cache for many reasons. It's bad for privacy.

If you put something on the "World Wide Web" then it's available for all to view. You want to keep something private, then don't publish it. If you do publish it then don't bitch about it when you're a laughing stock on slashdot

I'll do what I want! by scudco · 2003-07-13 22:09 · Score: 2, Interesting

I think if the NY Times has a problem with then they have the right to stop google from caching the site, but I do no think it would be a wise decision on the part of the NY Times. The NY Times enjoys being a reputable news outlet and if they were to lower their readership and more imporantly not allow everyone to read their articles. It would hinder their reputation in a slight way and all slashdotters might turn to a different news outlet, which is only bad for them.

Re:I'll do what I want! by sql*kitten · 2003-07-13 22:44 · Score: 1

The NY Times enjoys being a reputable news outlet and if they were to lower their readership and more imporantly not allow everyone to read their articles.

I don't think it's about readership numbers. The NY Times is paranoid - and rightfully so - that they have more Jayson Blairs in the ranks. They know that they are under scrutiny, from everyone from rival newspapers to Slashdot journals. They have a strong vested interest in members of the public being unable to compare "before" and "after" articles. If another one of their reporters is caught out - whether it's Jayson Blair simply making stories up, or Maureen Dowd misquoting her sources - they want to be able to brush it under the carpet as quietly as possible.

It raises 2 questions .. by Mr_Silver · 2003-07-13 22:10 · Score: 4, Informative

such as:

When will slashdot stop linking to articles that require a registration?
When will slashdot consider implementing caching for pages that, by linking to, they manage to take off the internet?

Sure, the 2nd question has been answered in the FAQ. Except it was written three years ago and Google manages this just fine. Maybe time for a second look?

On the topic of site updates, has anyone noticed that 90% of the links on http://slashdot.org/code.shtml don't work any more?

Hell the link to an Avantgo version of Slashdot points to a website which has been broken for over 2 years.

--
Avantslash - View Slashdot cleanly on your mobile phone.

Re:It raises 2 questions .. by n3bulous · 2003-07-14 01:38 · Score: 1

I was originally going to mod this down, mainly because it should not matter if the site requires registration or not. Use a fake yahoo/excite/whatever email and fahgetaboudit. Accessing sites requiring paid registrations would not make too much sense, but I can forsee a good story on a site that provides reliably good content that requires a pay-to-view.

Anyway, since slashdot is OSS and OSS claims to innovate, why doesn't someone figure this out? Rough stab:

- create an easy form for sites to access when notified of an incoming /.ing. Yes, No, Cache.
- If cached, provide statistics back to the client.

It's not like /. stories are timely, so waiting 6 hours for a site's response is not a big deal.
Give them 4 hours (based on their timezone...) and then post the link.

--
"The area of penetration will no doubt be sensitive." ~ Spock
Re:It raises 2 questions .. by Anonymous Coward · 2003-07-14 01:50 · Score: 0

dunno about the avantgo site, but this with plucker just fine.
Slashdot - Palm
Re:It raises 2 questions .. by Anonymous Coward · 2003-07-14 02:04 · Score: 0

Slashdot _does_ cache pages that either require registration or go down.

It's called karma whoring, and for any article that matters, the KWs post a cache of the content very quickly.

Where's the problem?
Re:It raises 2 questions .. by Jorrit · 2003-07-14 02:09 · Score: 1

Except that this 'cache' doesn't prevent the site from going down. A real cache could.

Greetings,

--
Project Manager of Crystal Space (http://www.crystalspace3d.org). Support CS at http://tinyurl.com/cb3x4
Re:It raises 2 questions .. by Rip!ey · 2003-07-14 03:28 · Score: 1

What about the third question?

3. Could Slashdot recieve some payola from NYT for the large number of referals it must get to it's registration page.

If Slashdot stopped refering to NYT articles entirely, would NYT benefit from reduced bandwidth costs, or suffer from a reduction in the number of new readers with respect to time. There must be a reasonable number of Slashdotters who have registered with NYT that would not have done so if not for the linkage.

Maybe they should do a partnership deal giving Slashdot subscribers instant access to NYT stories without registering or something.

Just tossing ideas around here ...

Systems problems? by Ratso+Baggins · 2003-07-13 22:10 · Score: 1

If your users (readers) need the google cache then there are system problems. Either the technical systems are inadequate (yay slashdot) or your business systems are inadequate (or both). Tech is easy, and so are business systems. Well at least the ones you want simple every-day folks to use...

--

--
"we live in a post-ideological world..." - Billy Bragg.

Re:Systems problems? by Anonymous Coward · 2003-07-14 01:16 · Score: 0

Either the technical systems are inadequate (yay slashdot) or your business systems are inadequate (or both).
In the case of the NY Times it's the annoying mandatory "free" registration that is the issue. On Slashdot you can choose to not register and also even post anonymously. On the NY Times site you do not have that option unless you create a fake account. Now, I have no idea why anyone would still read the NY Times after it has been debunked continually as liberal horseshit deadwood old media. This is the WWW people, new media has time to shine. Dead wood media should be avoided to destroy the old monopolies.

Might not be all bad... by leshert · 2003-07-13 22:10 · Score: 4, Interesting

As the poster mentioned, Google already has a way to opt out of caching, so "talks" sounds like this is something different. My guess is that Google will become an affiliate of the NYT (in other words, if you hit a NYT link from Google, you're exempted from registering), and will then drop the caching.

Re:Might not be all bad... by leshert · 2003-07-13 22:16 · Score: 2, Interesting

Pardon me for self-replying but it just occurred to me: maybe Slashdot itself might have an interest in becoming a NYT affiliate? Surely the NYT gets a good chunk of pageviews (and therefore ad revenue, modulo the minority that block them) every time one of their articles shows up here.
Re:Might not be all bad... by SkArcher · 2003-07-13 22:30 · Score: 1

in other words, if you hit a NYT link from Google, you're exempted from registering

So.... we get to Karma whore by linking to the google search page with the NYT article on it...?

--

An infinite number of monkeys will eventually come up with the complete works of /.
Re:Might not be all bad... by joeykiller · 2003-07-13 22:42 · Score: 2, Insightful

As the poster mentioned, Google already has a way to opt out of caching
Yes, Google has this. But for a couple of years I've had the opinion that it actually should be reverse: Sites should be able to opt in, not out. The default should be no cache versions.

Lately there's been discussions here on Slashdot about fair use. about 30 second clips of music on the net, and thumbnails of images being fair use. I can agree that that's fair use of content.

But think about Google's cache: A page in Google's cache isn't a part of - or a summary of content - but it is the entire content of a page. If this isn't breach of copyright, I don't know what is.

Google's cache gives more food for thought as well: Let's say I wrote something about someone on my web site, and this person sued me. A jugde decides against me and gives me a fine, and orders me to remove the content. But even if I do so, the inflammatory words would still be accessible trough Google's cache.

Now, some of you may argue that I could just write Google and ask them to remove the page. But the point is that if this is legal, just about anyone can cache my site. If enough search engines caches content, I most probably would never be able to find every site that provided cached versions of my site.

I'm not sure as to what constitutes fair use of content in the US, but in my country at least (Norway) I'm almost certain that Googles cache mechanism would be judged a breach of copyright laws.
Re:Might not be all bad... by arkanes · 2003-07-14 02:21 · Score: 1

My personal view is that caching the exact content of information thats made freely and publically accessible by the author can hardly constitute a breach of copyright, and the benefit of having documentation of the state of the web far outweighs any niggly little pedants interest in hiding what he's said.
Re:Might not be all bad... by joeykiller · 2003-07-14 06:40 · Score: 1

I can see your idealistic point here, but let me try to move some of what you're saying into the world of physical products:

You decide to publish a free newspaper that is to be handed out on the streets of your home town. You hire editorial staff and a few people to sell ad space in your paper, so that your business will go break even or maybe even have a little profit.

But every morning a new edition of your paper comes out, I pick up a copy and drives home to my hometown. There I print additional copies [and maybe even attach an ad insert in the paper], go out on the streets and hand out my newspaper - based entirely on your content - for free.

I'm quite sure that you'd cry "theft" before long, and that you'd sue me if I refused to stop publishing my pirated version. And the thing is that the courts would give you right.

The same applies with TV: Fox pays for the production of the tv series 24. They also air the show for free, so that you and I can enjoy it (or not, that depends). Does free broadcasts give me the right to record the show on my computer and rebroadcast it or publish it on my web site as streaming video?

Why should web copyrights be any different? I maintain that if copyright holders want to spread their content - even old versions - via Google, then fine: Let them opt in to Google's cache program.

But Google's opt out program is more akin to telling burglars "Hey, I didn't want you to steal my TV, please return it." The difference between Google and a real burglar is of course that Google "returns the tv" if you ask them to do it, but returning stolen goods doesn't make theft legal.
Re:Might not be all bad... by Anonymous+Brave+Guy · 2003-07-14 08:50 · Score: 1

My personal view is that caching the exact content of information thats made freely and publically accessible by the author can hardly constitute a breach of copyright, ...

On what law do you base that? Someone else wrote it. You copied it without permission. You want to argue in court that ripping a whole site by a commercial organisation and putting up somewhere else without permission is fair use (assuming you're in a jurisdiction that even has fair use)...?

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Might not be all bad... by arkanes · 2003-07-14 09:17 · Score: 1

It's the old problem of finding physical world analagies for digital media. The hard part, of course, is that theres no exact correlation.
My version of the same thing: You publish a newspaper, giving it out for free to anyone who comes to you asking for a copy. Every morning, I go and get my copy, and stick it in my warehouse. Anyone who comes to me can look at the old versions. This is exactly the case with libraries and newspaper archives right now.
There's a further issue, which is the point of copyright, and whether the public interest is better served by caching or by allowing producers to retain control. Personally, I believe that the public benefits of archival (for freely available content) far outweigh the rights of publishers to restrict that content - in fact, I can't see a compelling argument to allow that control at all (I don't consider either "It's mine and I'll do what I want with it" or "Someone might sue me for what I said a long time ago" to be compelling).
In recent years (well, to be honest, more or less gradually since the founding of the Union) the balance of power in copyright has been slowly tilting more and more toward the publisher (not the author), and away from the public domain. This is visible not only in extremely long copyrigh durations, but also in the unprecedented amount of control we grant to publishers. It's made more extreme by the reaction against digital media, because traditional copyright logic doesn't apply as cleanly - the natural state of copyright leans toward the publisher with physical media (because you have to go to them to get your physical copy, and it's hard to reproduce yourself), whereas digital media leans toward the public domain (where it's so easy to copy things that in many cases it's simpler as well as cheaper to get it illegally)
Re:Might not be all bad... by arkanes · 2003-07-14 09:19 · Score: 1

Who said that my personal views had anything to do with law? I believe that the law is WRONG. Whats so hard to understand about that?
Re:Might not be all bad... by joeykiller · 2003-07-14 09:34 · Score: 1

My version of the same thing: You publish a newspaper, giving it out for free to anyone who comes to you asking for a copy. Every morning, I go and get my copy, and stick it in my warehouse. Anyone who comes to me can look at the old versions. This is exactly the case with libraries and newspaper archives right now.

Yes, but the main difference is that in your warehouse you're providing the original of the content. The newspaper I published. Google reprints the newspaper, instead of sending the reader to the original NY Times article (which, in this case, has gone from a free to a payment only state).

What Google does is not archiving, it's republishing. In the case of New York Times, Google caches content from a free period, and makes it accessible even when this content has gone from a free to a paid state at NY Times.

I'm not really debating whether Google's archive is useful or not (it is useful), it's just that I think Google should get permission to provide cached pages, instead of the opposite. That's common courtesy.
Re:Might not be all bad... by Anonymous+Brave+Guy · 2003-07-14 09:45 · Score: 1

Who said that my personal views had anything to do with law?

You did, when you started using legal terms like "breach of copyright".

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Might not be all bad... by 1u3hr · 2003-07-14 16:53 · Score: 1

Sites should be able to opt in, not out. The default should be no cache versions.
I completely disagree. The utility of the cache is not for big sites like the NYT, but the myriad of small sites, posted on flaky webservers on distant parts of the web. The "webmasters" may have specific knowledge you want to look up, but their HTML is quite likely some abortion exported from Word; they would have no clue if asked about caching.
the inflammatory words would still be accessible trough Google's cache....Now, some of you may argue that I could just write Google and ask them to remove the page
That was a point raised in the article. But it's bogus for two reasons: 1) When Google re-spiders the page in question, it updates its cache, or deletes it if the page is gone. There will be a lag, but usually within a few weeks 2) That the text you want to retract is still available is fine, it's not your responsibility, just as if you'd written it in a newspaper you can't go down to the library and rip out the page from their bound issues. Once you've published it in any form you can't unpublish it. You can write a retraction, and withdraw it from sale (or your website) but you can't do a Ministry of Truth (We have always been at war with Eurasia!) on it.

Erm...cache? by DennyK · 2003-07-13 22:10 · Score: 4, Informative

The article talks about Google's caching of articles that have expired to the NYT archives (which you have to pay to access). What most /. folks use to link to current NYT articles are the Google partner links, which simply bypass the free registration. I'd assume these links only work as long as an article hasn't been archived yet, so the karma whores are safe; I doubt the NYT's Google partner links will be going away any time soon... ;)

DennyK

Re:Erm...cache? by Neophytus · 2003-07-13 22:32 · Score: 5, Insightful

I was thinking the same thing. I cann't recall seeing a NYT article linked from here with the google cache banner across the top, what I do see alot are the partner links. Google already provides for register-only news sites (financial times?) by putting a [reg only] tag beside the article. Why the NYT has chosen not to use this up until now is a tad strange, and it looks like someone has picked up the wrong end of the stick.
Re:Erm...cache? by edrugtrader · 2003-07-14 13:17 · Score: 1

the COULD start requiring your referrer to be http://new.google.com, but that can obviously be hacked, and all the karma whore would do is make a link to their own server and swap the header and redirect.

NYT, you can't win. either require google links to sign up too or accept the fact that we are smarter than you.

--
MARIJUANA, SHROOMS, X: ONLINE?! - E
Re:Erm...cache? by 1u3hr · 2003-07-14 17:04 · Score: 1

The article talks about Google's caching of articles that have expired to the NYT archives (which you have to pay to access).
Just thought I'd share a method I use to read "Archived" articles. Often I get a reference to NYT article, but following it leads to NYT's page telling me I have to pay to read it from their archive. But the NYT search on their site gives you a paragraph or two of text from the article. Copy a slab of that, paste it into Google. You almost always find the entire article in question, either on another newspaper's site (that has syndicated the article from the NYT) that doesn't have registration at all, or even pasted into a blog, or usenet post, etc.

Registration isn't a 4 letter word by Anonymous Coward · 2003-07-13 22:10 · Score: 2, Interesting

With respect to the NYT, I registered some time ago. Never received any associated spam or experienced any problems other than trying think up fake information different than all the other fake information I'd submitted elsewhere.

If registering allows the owners of a website to leverage their success by having a certain number of registered users, all the more power to them. Aside from the one-time sign-up "inconvenience," I don't see any issue, assuming the website operator is either a known entity or otherwise reputable, of course.

As for the issues related to Google's caching, I'm waiting for cached mp3's.

Re:Registration isn't a 4 letter word by Fobi · 2003-07-13 23:36 · Score: 1

no, it's a 12 letter word

3 times worse 8)

fobi
--
Re:Registration isn't a 4 letter word by b1t+r0t · 2003-07-14 01:16 · Score: 1, Funny

With respect to the NYT, I registered some time ago. Never received any associated spam or experienced any problems other than trying think up fake information different than all the other fake information I'd submitted elsewhere.
Sure, but they have your e-mail address and know that you're a 34 year old woman from Indonesia who makes $75,000-$100,000 a year. Er, uh, wait a minute. Your e-mail address is what? "lkjweq@akl.asb"? Never mind.

--

--
"Open source is good." - Steve Jobs
"Open source is evil." - Microsoft
Re:Registration isn't a 4 letter word by Kosi · 2003-07-14 01:31 · Score: 1

No, in fact I'd do most of the 4-letter-word actions much more happily than to register on a website where no reg should be. :)

Closing the Google cache "backdoor"... by Homology · 2003-07-13 22:11 · Score: 4, Interesting

will deny getting access to older articles in newspapers.

It's worth remembering that newspapers sometimes edit/remove articles they publish on their homepage. Without a Google cache it may be much harder to verify that a story has indeed been modified.

And out comes the lawyers... by Anonymous Coward · 2003-07-13 22:11 · Score: 5, Interesting

Don't you just hate it when promising new technology is curbed by outdated laws?

Here in Denmark we had a service similar to news.google.com for danish newspapers. The newspaper organisation sued the service for parasiting on their databases (which is prohibited in Denmark). The service was shut down half a year ago and we now don't have that kind of service anymore.

Of course newspapers should be allowed to publish their stuff without others copying it but they refused to even use a "robots.txt" (which the news service respected) to stop indexing.

If you publish your stuff on the internet and don't tell people that they should not index it, cache it or what do I know - then you better expect them to do that. Let us put those lawyers back where they belong.

Re:And out comes the lawyers... by Tomcat666 · 2003-07-14 02:12 · Score: 1

Google News just started in Germany, and 90% of the German news websites I read appreciated it and wrote articles about it (Stuff like "We are being indexed by Google News!!!") :-)

--
Two Worlds - One Sun [Spirit]

The reason by Apreche · 2003-07-13 22:11 · Score: 3, Insightful

The reason that the NYT just doesn't tell google not to cache them is visitors. Let's face it, even though the registration is a bitch the content on the NYT website is fairly decent. They have good articles often enough that geeks went through the effort of finding out how to read without registering. If they have google not cache them, and they close the google news loophole, then they wont appear on google news any longer. And google news is used by many more people than you think.

Hey, we get quite a few visitors from this google news. Let's change it so we get 0 visitors from it.

Duh.

--
The GeekNights podcast is going strong. Listen!

Re:The reason by swilver · 2003-07-13 23:59 · Score: 1

As a news site or portal, I'd be FAR more worried about news.google.com :) When they first introduced the feature I found it quite amazing that most news sites co-operated and allowed(?) Google to index all the news and present it in that fashion. It has a large potential to become the main portal for the news, with all the other major news sites being mere content providers when people decide they'll pick their article from Google out of the dozens available on the same topic...

As for me, I already only use news.google.com, slashdot and a few small more specialized local news sites for my daily news needs. If news.google.com includes features for local news (instead of just the US) and perhaps a nice commenting system then I'd probably use it exclusively...

--Swilver

karma fetishists... by Anonymous Coward · 2003-07-13 22:12 · Score: 0

Leather, rubber, high heels, stockings, now we have karma fetishists that get off on Google!?!?

Bloody geeks!

hmm by jaemark · 2003-07-13 22:12 · Score: 3, Insightful

the nytimes website needs google for the traffic google brings into their pages, so they can't turn away their spiders. but then, they don't want the spiders either because of copyright violations. why should this be google's problem anyway?

Shouldn't NYT tell Google not to cache site? by Anonymous Coward · 2003-07-13 22:14 · Score: 0

Shouldn't the NY Times simply tell Google not to cache their site?"

Shouldn't the NY Times simply stop requiring registration to read the news?

Shouldn't they concentrate on telling the truth instead?

btw, someone might want to clue in the techs at NYT about the x-noarchive that they can put into their pages to prevent google from caching, according to google's own directions.

Seems like they want all the benefits of showing up on google searches, without any cost to them. As if making their take of the news available to more people is really a cost at all.

Re:Free registration..some implications by jkrise · 2003-07-13 22:15 · Score: 3, Informative

Actually, free reg requires a valid email id. It thus filters most bogus registrations. Secondly, news sites are planning to go the 'pay' way in about a couple of years. Getting readers to register would give more accurate estimates of readership.

And lastly, once a site requires registration, even if free, Copyright ptohibits quoting entire articles on the web. This indeed could be the prime reason for this.

--
If you keep throwing chairs, one day you'll break windows....

Anyone above this post hasn't read the article. by banal+avenger · 2003-07-13 22:16 · Score: 5, Interesting

The Internet Archive, which I just used minutes ago to find a handy page removed years ago, is an interesting corollary to the Google cache. I often wonder how it has survived thus long without a major lawsuit. It also reminds how crappy the web looked 5 years ago.

At any rate, cache-ing is an important force on the internet, and isn't one that should be limited in any legal way, including litigation.

Re:Anyone above this post hasn't read the article. by Anonymous+Brave+Guy · 2003-07-13 23:58 · Score: 2, Insightful

At any rate, cache-ing is an important force on the internet, ...

It is? In this sense? We managed without it being mainstream quite happily until a year or two back.

...and isn't one that should be limited in any legal way, including litigation.

In your opinion. Others have different opinions. We have a legal system to resolve differences of opinion. Go figure. :-)

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Anyone above this post hasn't read the article. by Anonymous Coward · 2003-07-14 01:07 · Score: 0

It also reminds how crappy the web looked 5 years ago.
kinda like slashdot?
Re:Anyone above this post hasn't read the article. by Anonymous Coward · 2003-07-14 01:18 · Score: 0

see the web as it was, Nov 11, 1998
Re:Anyone above this post hasn't read the article. by paul248 · 2003-07-14 05:49 · Score: 1

"So IBM announces a 25 gig hard drive... does the world need this yet? Unless this is in a RAID, would you really want to trust 25 gigs on a single drive? What would you use this for? 400+ hours of MP3s comes to mind... "
Re:Anyone above this post hasn't read the article. by aonifer · 2003-07-14 12:11 · Score: 1

At any rate, cache-ing is an important force on the internet, ...

It is? In this sense? We managed without it being mainstream quite happily until a year or two back.

Well, we managed without cars and airplanes quite happily for most of human history.
Re:Anyone above this post hasn't read the article. by Anonymous+Brave+Guy · 2003-07-14 12:35 · Score: 1

Well, we managed without cars and airplanes quite happily for most of human history.

Indeed we did. Somehow, walking or cycling a mile or two across town doesn't seem to kill me, even today. Of course, the environmental damage from motor vehicles and air traffic is trying, and I'd waste much of my life sitting in traffic queues if I tried to drive everywhere instead...

Hey, you'd almost think that even cars and aeroplanes could be expendable. They do have their uses, but they're overused because people are lazy, and the alternatives have suffered as a result. And at least cars and planes have uses under some circumstances, which is probably why they're a daily part of most people's lives, unlike web caches, which are commercial entities we could easily do without in their current forms.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Anyone above this post hasn't read the article. by scsi_pants · 2003-07-14 14:05 · Score: 1

I know this is off topic, but I think it is interesting anyway:
A couple months back I was poking around in the internet archive's movie section and found that the most popular film in the archive was directed by one of my relatives. I knew the guy, but had no idea that he did any film work, so thanks to the archive I learned a little about my family history.

In case of Slashdotting by RealBeanDip · 2003-07-13 22:16 · Score: 1, Funny

Here's a Google cache of the article: ...

(just kidding)

--

You know you're a geek if you've ever replied to a tagline.

WTF? by WegianWarrior · 2003-07-13 22:16 · Score: 1

Like other online publishers, The New York Times charges readers to access articles on its Web site. But why pay when you can use Google instead?

Scuse me? I thought the NYT did free registrations?

And you got to love the last part of the article, where they discuss if Google's cache is in fact legal - which should have some bearing on the /.cache some of us wish for when we have taken down yet another interesting website...

--
Everything in the world is controlled by a small, evil group to which, unfortunately, no one you know belongs.

Re:WTF? by fenix+down · 2003-07-13 22:59 · Score: 1

Once it's old news, (I think a few months, maybe it's sooner) the NYT makes you pay for the article. The NYT is complaining about Google caching those. You can access recent articles with the free reg, and the NYT doesn't care if Google caches those, but they want to stop people from going to the Google cache when they follow a year-old /. link and see the "purchase this article" page.

Test Question by Effugas · 2003-07-13 22:17 · Score: 4, Insightful

You are the new editor of the New York Times, the "Newspaper of Record" for the United States, if not the world. You are, of course, the new editor because the previous editor had to resign, taking the blame for an individual reporter's flagrant disregard for the awe-inspiring credibility of your institution. In the process of rebuilding your credibility, should you:

A) Insist that unaffiliated digital libraries restrict access to or simply eliminate all records of your "Newspaper of Record", or
B) Realize that maybe right about now is not particularly the best time to be saying to the world, "Please forget what we published last week."

Re:Test Question by Anonymous Coward · 2003-07-13 23:31 · Score: 1

"if not the world"

Go screw yourself, American. We don't read your newspaper - it is not the newspaper of record for the world.
Re:Test Question by lobsterGun · 2003-07-14 00:10 · Score: 1

The editors didn't have to resign to 'take the blame' for Jason Blair (the two top editors were forced to resign). They had to resign because Jason Blair's actions showed that the two of them weren't really doing their jobs. Had they been excercising proper oversight, the Jason Blair affair would have never happened.

Google news by coolfrood · 2003-07-13 22:19 · Score: 1

Google News already partners with NY Times to allow NY Times articles to show up on their site. If you click a NY Times link on Google News, it shows up as a special NY Times "partner". So, on one hand, they want to get googled to generate traffic, on the other hand, they blame google for caching. Where do you draw the line?

actually... by Draghkhar · 2003-07-13 22:20 · Score: 5, Interesting

Actually the NYT has already begun using google's NOARCHIVE option to prevent content caching. Here's an excerpt from the this morning's front page story's source:

!-- ADX SETUP: page: www.nytimes.com/yr/mo/day/international/worldspeci al/14IRAQ.html positions: Top5,Middle1,Right3,Middle5,Right,Travel7,Travel11 ,Bottom1A,Bottom3A,Right5,Right6,Right7,Right8,Bot tom8,Bottom7,Inv1,Inv2,Inv3,Frame4,Right4 kwds: politics+and+government;international+relations;ir aq;suggested%5ftopnews;suggested%5finternational;s uggested%5fworldspecial;suggested%5fmiddleeast --

meta name="ROBOTS" content="NOARCHIVE"

Kind of makes me wonder what's the point of the story, since it even says there's an easy way for concerned parties to opt out of the cache.

Re:actually... by Anonymous Coward · 2003-07-13 23:10 · Score: 0

i think the "point" is google are copying content first without permission and leaving the site owner to do the work of removing/robots.txt

from the article:

"Many of us copyright lawyers have been waiting for this issue to come up: Google is making copies of all the Web sites they index and they're not asking permission," said Fred Lohman, an attorney at the Electronic Frontier Foundation. "From a strict copyright standpoint, it violates copyright."
Re:actually... by babbage · 2003-07-14 02:31 · Score: 2, Insightful

ADX is the ad server used by NYTimes.com, it has nothing to do with page content.
If what you're posting comes from an article page's <head> section, you seem to be pasting more than you intended. Directives to ban archiving of ads isn't an editorial issue, but a business decision -- cached ads screw up the bookkeeping and, by extension, the bottom line on the balance sheet.
The practice of restricting cacheing of ad content is, presumably, common across the industry -- it's not just NYT that has an interest in forcing this.
The (apparent) <meta name="ROBOTS" content="NOARCHIVE"> tag you cite should be wholly separate from the ad server code.
(Signed, a former employee of NYT digital...)

--
DO NOT LEAVE IT IS NOT REAL
Re:actually... by Draghkhar · 2003-07-14 07:28 · Score: 1

Oh yes, you're quite right, I just wanted to show where it was in the webpage. Thanks for the clarification.

Re:Worst result .. it's reality already by jkrise · 2003-07-13 22:20 · Score: 2, Interesting

Gradually, Google's built up a 'good guy' image, and now looks like they're going for the kill. Already Google seems to be the only search site around, and they censor and distort like mad.

Consult the word: Googlewash, and you'll find a lot of info on the referenced article from The Register (it's available now, earlier this was censored). Incidentally, the affected article was a NYT OpEd piece!

--
If you keep throwing chairs, one day you'll break windows....

Sweet irony by Amomynos+Coward · 2003-07-13 22:21 · Score: 3, Informative

In case the cnet is /.'tted, here's link to Google cached page.

Slashdot sensationalism? by The+Cydonian · 2003-07-13 22:21 · Score: 1

As far as I can see, this article is only tech-jargon-toned-down introduction to Google cache. Consider, in particular, the following parts of the article:-

"We are working with Google to fix that problem--we're going to close it so when you click on a link it will take you to a registration page," said Christine Mohan, a spokeswoman at New York Times Digital, the publisher of NYTimes.com.
[...]

Google offers publishers a simple way to opt out of its temporary archive, and scuffles have yet to erupt into open warfare or lawsuits

I'm willing to bet that both of the above refer to the "no cache" meta tag that the submitter was hinting at.

--
More than mere navel gazing.

Re:Slashdot sensationalism? by LiquidCoooled · 2003-07-13 23:17 · Score: 1

They dont want their content removed from the cache, they want the content available, BUT when you try to access it from this offsite store, google server will forward the request back to NYTimes and show a login for it.

Seems pretty silly if you ask me, because news is news, you cant roll back a newspaper once its been printed, published and read. If I want to see the news from the day of my birth, I go to my local library, I DO NOT have to purchase the newspaper in order to do this, and I shouldnt have to signup for old news either.

--
liqbase :: faster than paper

Hrmmmmm.... by Duncan3 · 2003-07-13 22:22 · Score: 2, Funny

So do we charge the NYT $1,000 to explain robots.txt or $10,000 because they are so stupid...

--
- Adam L. Beberg - The Cosm Project - http://www.mithral.com/

Ask questions first, shoot later. (if needed) by daBass · 2003-07-13 22:22 · Score: 2, Insightful

Well, I guess that NYT (and many others) allowing Google News to login and index their content means that they like them doing that for getting traffic. For whatever reason, NYT wants you to register and they have a right to as well as they have copyright, allowing Google to put in the snippet, but not the whole article without their consent.

And that is the reason for an index, to find the original.

It is good to see they are working this out together, though, without NYT going to court as the first step. This is a far better way than the popular shoot-first-ask-questions-later attitide most media companies have...

There's no such thing as free registration by pslam · 2003-07-13 22:24 · Score: 5, Insightful

Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?

That's the thing - it's not free depending on your definition. By my own definition, you're giving them valuable information, and they get to keep it and use it as they will, including spamming if they feel like it (or spam from any company which buys them out, they sell it to if they're feeling bankrupt, etc). It's practically misadvertising of a service, but it's accepted now, so everyone gets away with it.

If it really were free, why would you need to register in the first place?

Re:There's no such thing as free registration by r3mdh · 2003-07-14 01:12 · Score: 1

Why not register with SpamGourmet (http://www.spamgourmet.com) and let that site gobble up any spam generated by the New York Times? That's what I did. And, SpamGourmet will give you an indication as to how many spam messages were generated (prior to being gobbled up) using the bogus e-mail address you give the NYT. Just a thought...
Re:There's no such thing as free registration by Pendersempai · 2003-07-14 01:40 · Score: 1

If it really were free, why would you need to register in the first place?

It's really simple, actually. Everyone who views their content has to register, so they know exactly how many unique eyeballs land on each article. This tells them how much to charge their advertisers.

So yes, unless you consider popup ads as an economic cost, you are viewing the NYT pages absolutely free.
Re:There's no such thing as free registration by NeoNormal · 2003-07-14 02:24 · Score: 1

>"So yes, unless you consider popup ads as an economic cost, you are viewing the NYT pages absolutely free."

So you actually ALLOW the popup ads? My wife was sooo thrilled when I changed her browser to Mozilla and blocked nytimes.com popups. And, she actually PAYS them a yearly fee so we can have printable access to their crosswords.
Re:There's no such thing as free registration by Pendersempai · 2003-07-14 02:44 · Score: 1

So you actually ALLOW the popup ads?

Of course not. Not only do I block pop-up ads, I use Privoxy to block banner ads, flash ads, and all sorts of other obnoxious content. The only thing that actually gets through is text ads, which I really don't mind.

My point was, pop-ups (and banners) are the only thing about NYT's current articles that could possibly be considered a cost.
Re:There's no such thing as free registration by NeoNormal · 2003-07-14 03:18 · Score: 1

>"Of course not. Not only do I block pop-up ads"

Hmm... I guess my point was, so do most savvy surfers... such as you'd maybe expect to be reading this forum.

Second point is that even though my wife is a paying subscriber, she wasn't being spared the popups.

We've been visting nytimes.com for several years or so, and my general impression is that they don't have the technical expertise to run their site the way they want to... the authentication servers for the pay crossword servers used to be down every couple of days... and often extremely slow when not down. I also seem to recall them having been compromised a few times.

Oh, thanks for the pointer to privoxy, I'm going to explore putting that on my home network gateway.
Re:There's no such thing as free registration by poot_rootbeer · 2003-07-14 04:09 · Score: 1

By my own definition, you're giving them valuable information, and they get to keep it and use it as they will ...in return for the valuable information they're giving you at no charge, i.e., their articles.

The adage "gas, grass, or ass, nobody rights for free" comes to mind -- although these days, the trifecta is more like "cash, demographics, or ads". Online publishers that don't ask for SOMETHING in return from their readers go out of business.
Re:There's no such thing as free registration by squiggleslash · 2003-07-14 06:27 · Score: 1

The NYT does not need your name and address to achieve that. All it needs is some form of unique ID, such as an email address. Indeed, if the sole purpose is demographic profiling, they could get away with nothing more than asking your age, zip code, and gender. The need for a name and address is clearly unncessary.
So, really, yes you are giving them valuable information that can be used against you (say, for direct marketing purposes.)

--
You are not alone. This is not normal. None of this is normal.
Re:There's no such thing as free registration by pyrrho · 2003-07-14 10:32 · Score: 2, Interesting

it's not free, the price is registration.

Barter did not quite die out as advertised in the 20'th century (that'd be that last bloody century).

People are confused because they don't think about the economy as barter, but it is, money just lubricates a basic system of barter.

GPL code is not free, the cost is your commitment to share your changes to the code with whomever you share a binary with.

Nytimes.com is not free, the cost is registration, so they know more about their users.

etc. etc. no-money-required != free.

--
-pyrrho

Re:Free registration..some implications by SkArcher · 2003-07-13 22:24 · Score: 1

Actually, free reg requires a valid email id.

Personally speaking, and I'm sure a lot of those who post here do the same, I have a free webmail account that I use to catch the spam associated with all the registration sites, from Ezboard to the NYT.

--

An infinite number of monkeys will eventually come up with the complete works of /.

Sentient Crawlers? by MegaT · 2003-07-13 22:25 · Score: 2, Interesting

How does Google manage to cache a page which requires free registration anyway? Are the crawlers that smart?

Re:Sentient Crawlers? by LiquidCoooled · 2003-07-13 23:19 · Score: 1

The NYT News pages are available WITHOUT registration to special partners (ie google), so when the googlebot is crawling its own news beta site, and comes across a link to a NYT story, it already includes the registration free link.

--
liqbase :: faster than paper
Re:Sentient Crawlers? by MegaT · 2003-07-14 03:31 · Score: 1

In that case, NYT can fix the 'problem' very easily. :-)
Re:Sentient Crawlers? by LiquidCoooled · 2003-07-14 05:13 · Score: 1

The problem is that when people use the Google cache to view the original scanned document, they expect registration to occur, but since the bot caches everything via a URL which does NOT include registration/login screen, this will be pretty difficult.

NYT want control over who sees the cache, this is pretty stupendous, and google should reply by removing them both from the cache, and from the list of credible news outlets.

--
liqbase :: faster than paper
Re:Sentient Crawlers? by MegaT · 2003-07-14 10:22 · Score: 1

NYT are trying to have their cake and eat it!

Illogical. by HanzoSan · 2003-07-13 22:26 · Score: 4, Interesting

Trying to sell web pages is like attempting to sell mp3s on a p2p services where all mp3s are free.

IT wont work. Instead you should use your websites to market and sell your magazine subscriptions.

Like Wired.

--
If you use Linux, please help development of Autopac

Re:Illogical. by cygnusx197 · 2003-07-14 01:07 · Score: 1

It will work, and it is working.

Why would you give online away for free, and charge for the pulp edition? You're generating revenue by giving people your conent via the medium of their choice.
Re:Illogical. by jodo · 2003-07-14 01:32 · Score: 1

"Trying to sell web pages... it won't work."

I do not currently pay for any web sites, including slashdot.(sorry, taco) But what will happen in the future if day by day the gate is closed on "official" content. Where, through copyright laws, the big companies in the content/news business do not allow [much] quoting or caching. And where the link requires payment. Will we only read each others comments? Or will we give in and "micro" pay to read the "official" version.
Big business is taking the internet away from its free beginnings. Look for an association like the RIAA to form for news/content site to act as the legal executioner of sites posting copyright materials.

--

"Don't Follow Leaders." Bob Dylan
Re:Illogical. by Anonymous Coward · 2003-07-14 01:58 · Score: 0

Trying to sell web pages ... wont work.

I dunno, I like the way Salon.com does it. A reader can get an ad-sponsered free "day pass," which spits a 10 second or so ad, and then the reader has access to the entire site for 24 hours (with banner ads). Alternatively, the reader can subscribe and avoid ads altogether. No flash(y) intro, no banners. A worthwile investment, if you ask me.
Re:Illogical. by aengblom · 2003-07-14 03:18 · Score: 1

Trying to sell web pages is like attempting to sell mp3s on a p2p services where all mp3s are free.

It wont work. Instead you should use your websites to market and sell your magazine subscriptions.

Like Wired.

New York Times digital is PROFITABLE.
New York Times is PROFITABLE.
Apple sold one million songs in a week.

If you want to call people's business strategies stupid, try not to pick on the ones that are doing well.

--

So close and yet so far from the world's perfect ID number
Re:Illogical. by Have+Blue · 2003-07-14 03:31 · Score: 1

You can't deny that this is the trend. The bubble-powered free ride is over.
Re:Illogical. by HanzoSan · 2003-07-14 08:09 · Score: 1

So because apple sold all their songs in a week now they have nothinng left to sell.

New york times is profitable? Maybe because a website doesnt cost alot to host, they also get ad revenue, they dont exactly sell access to their site.

I manage to log into their site without paying a dime.

Think before you post, everything you have posted has been wrong.

--
If you use Linux, please help development of Autopac

Archives by Daemonic · 2003-07-13 22:33 · Score: 2, Interesting

I think newspapers expect their archives to be real revenue generators in the future. ISTR journalists/columnists getting annoyed a few years ago when these archives started to appear, as they weren't getting paid any extra money for having their work effectively republished, but I suppose any such legal arguments have been resolved one way or another by now.

Value of search results by Znork · 2003-07-13 22:34 · Score: 1

Google should not index anything that is not publically available. When I get a search result I expect to be able to read that search result or it's worthless (and I dont really care if registration is free or not).

If the NYT cares so much about who can read their site they should block google and ask the NSA to archive it instead...

google cache by anshil · 2003-07-13 22:35 · Score: 4, Funny

In case google gets /.ed here is it's google cache

http://www.google.at/search?q=cache:qw2H0d95VkEJ:w ww.google.com/webmasters/3.html+%22I+need+my+site+ information+changed.%22&hl=de&ie=UTF-8

--

--
Karma 50, and all I got was this lousy T-Shirt.

Shouldn't someone simply tell the NY Times: no reg by StrawberryFrog · 2003-07-13 22:35 · Score: 4, Insightful

Brand recognition is not always a good thing. When I think NY times I think "that annoying registration website". They are free to do what they want, but it leaves me cold.

--

My Karma: ran over your Dogma
StrawberryFrog

Free registration and the RIAA by mike_mgo · 2003-07-13 22:36 · Score: 5, Insightful

It's articles like this that make me think that the recording and movie industries are right to go after online piracy with everything they've got.

Here we have the NYT, one of the premier news organizations in the world, offering its articles for free on the same day that they are published. Yet a large number of people, of this online community at least, refuses to provide even a minimal amount of information (and no money) so that the newspaper can try to make its online presence profitable.

I think the spam fears are a red herring, I've been registered with the times for over 2 years. I've never gotten spam that I think is traceable from them. I get a daily email of the day's headlines (and with the click of a box I could discontinue this).

Why should the RIAA change its business model to a pennies per song method when there is such a blatant example of the online community refusing to go directly to the source for even free material?

Re:Free registration and the RIAA by Arker · 2003-07-13 23:45 · Score: 1

It's not free. The registration is the payment. And it's a higher price than it's worth.

--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.
Re:Free registration and the RIAA by swilver · 2003-07-13 23:46 · Score: 2, Insightful

The problem I really have with even a free registration is that it is yet another hoop I have to jump through for content that is also available (albeit in a maybe slightly different form) at other sites which donot have these policies. A thousand other news sites are willing to serve me their news without the registration hoop -- I really don't see why the NYT is any more special. As for their image: I think of them as the News site that is too damn stubborn to drop the registration and just display the articles like all the other sites do.

Registration imho is just silly. Since nobody fills out such registrations with any real information anyway (it gets tiring after the first dozen forms orso) the information is probably so wrong you might as well be anonymous. If you are assuming the information is bogus anyway, why not put a cookie on their machine with a unique number (I have no problem with that (yet), as it doesn't annoy me or cost me any extra time) and use that to track that user's actions. You can find out quite a lot that way (seeing what articles he/she likes, how they navigate the site, approximately where they come from, etc..) This would be MORE information than they are getting from me now, which is none -- I'm sure the other sites are doing this already just looking at the huge lists of cookies on my machine.

--Swilver
Re:Free registration and the RIAA by Pendersempai · 2003-07-14 01:30 · Score: 1

Why should the RIAA change its business model to a pennies per song method

Because nothing else is going to work. If they refuse, they'll die.
Re:Free registration and the RIAA by TopShelf · 2003-07-14 01:46 · Score: 1

Quite the contrary, most people fill out the registrations honestly - they're not so hung up on supposed "privacy" concerns. What the NYT gets in return is a more valuable set of eyeballs to its advertisers. All this goes to support an infrastructure that brings you in-depth news coverage quickly, easily, and at no (financial) charge to the reader. I think that's a pretty good deal, really...

--
Stop by my site where I write about ERP systems & more
Re:Free registration and the RIAA by BigBadBri · 2003-07-14 02:00 · Score: 2, Insightful

Why should the NYT make a profit from its online presence?
By posting their stories online, they are able to attract paid advertising, gain public recognition for their dead-tree product, garner goodwill (intangible, but still added in whenever a business is valued) and generally build a brand.
My point is that the online NYT should be regarded as a marketing expense, not as a moneyspinner. We've all seen the grandiose dreams and foolish business plans of the dot-commers fade to dust, so perhaps it's time to reevaluate what an online presence for a newspaper actually amounts to.
Registration is a pain, and it does stop people from reading NYT online content and being exposed to the advertising embedded in that content.
What would be interesting would be an analysis of the registration details so far provided, split into 'valid' and 'ludicrous' categories. This might give a measure of the true value of registration, which I am sure is lower than the NYT believe it to be.
Maybe your query about the RIAA's business model is a valid one - I would prefer to pay a fair price for CDs and see Robbie Williams et al less enriched. Without the CD price cartels, online copyright infringement would be much less significant, and more importantly, lower priced CDs would be more attractive as items of discretionary spending in times of economic anxiety, making the dramatic collapses in sales that we have seen much less likely to occur in future.

--
oh brave new world, that has such people in it!
Re:Free registration and the RIAA by Jorrit · 2003-07-14 02:13 · Score: 1

Why?

--
Project Manager of Crystal Space (http://www.crystalspace3d.org). Support CS at http://tinyurl.com/cb3x4
Re:Free registration and the RIAA by Anonymous Coward · 2003-07-14 06:12 · Score: 0

I never had to register with the NYT to read their newspaper in the library.

I never had to register with the NYT to have a friend tell me something they read in it.

if they don't want us reading without registration, they should take it off the web. pretty simple.
Re:Free registration and the RIAA by angle_slam · 2003-07-14 07:44 · Score: 1

Same question I have. The solution is simple. Create a Hotmail or Yahoo address and use that address to register for NY Times, LA Times and whatever newspaper you want to read. Create an easily rememberable password (how about NEWSPAPER) and use the email and password to access the news. Takes no more time than it does to make a posting at Slashdot. If you have cookies enabled, you may never even have to log in again.
Re:Free registration and the RIAA by Anonymous+Brave+Guy · 2003-07-14 08:54 · Score: 1

Why should the NYT make a profit from its online presence?

Maybe the answer is "because they are providing, at considerable expense, a service that is of value to others"? Sounds like a pretty fair business case to me.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Free registration and the RIAA by cyberformer · 2003-07-14 10:58 · Score: 1

Even if you give them fake personal info (as I do), the NYT is still able to see what content you are accessing, and at what time. This is valuable data to them, even if they don't know who exactly you are. They can draw inferences like "User x always reads the science and politics sections" or "User y only visits on Monday and Thursday evening".

I don't see any problem with this. It's anonymous, and the hassle of retyping a username and pw every type I wipe my cookies is minimal.
Re:Free registration and the RIAA by aonifer · 2003-07-14 12:16 · Score: 1

But you didn't seem to mind "paying" for a /. subscription.
Re:Free registration and the RIAA by solferino · 2003-07-14 21:51 · Score: 1

Annoyed by your goody-two-shoes post.

Here's why your thinking is mediocre and wrong.

You assume that the New York Times and other 'content creators' are doing us a favour by allowing us to read their work 'for free'. It's actually the reverse. We are doing them a favour by reading their articles (or listening to their music). Attention given to 'content', and *not* the content itself, is the currency of today's world (as pioneered by advertising on 'free media' - such as television). If someone wants to engage my attention by writing something, then 1) it better be good and 2) it better be offered freely with *no strings attached* otherwise i'm not there. End of story.
Re:Free registration and the RIAA by Arker · 2003-07-14 23:27 · Score: 1

Actually I posted here for a long time without one. I only registered when they put in the moderation system.

In the case of slashdot, it makes sense to register because it is a community discussion site, and to have productive discussions you simply must have some sort of handle persistence. So registration, while I wasn't necessarily eager to do it, does serve an end important to me.

Registering for the NY times serves only their ends.

Also, there's a fundamental and important difference between the way Slashdot and the NYT treat those without a login. With Slashdot, you still get the content, you just miss out on handle persistence and customised display settings. That's acceptable behaviour. With the NYT, you do NOT get the content, in any form, rather you get redirected to a page that insists you register with them. This is not acceptable behaviour. The NYT needs to make up their mind - do they want to be on the web or not? If people can't read your content by hitting a url, then you're not part of the web.

One of the things I really hate about Slashdot is how they have refused to simply make a policy of not posting these pseudo-links. There's really no excuse. It takes only a few seconds in most cases to find an actual working links. Submissions with non-working links (such as links to the NYT) should be sent to /dev/null.

--
=-=-=-=-=-=-=-=-=-=-=-=-=-=-
Friends don't let friends enable ecmascript.

Tech savvy at CNet? by nacturation · 2003-07-13 22:37 · Score: 3, Insightful

From the article:

Practically speaking, Web sites can "opt out," or include code in their pages that bars Google from caching the page. A tag to exclude "robots" such as "www.nytimes.com/robots.txt" or "NOARCHIVE" typically does the job.

First of all, robots.txt is not a "tag", it's a file. NOARCHIVE is a tag, but it exists within robots.txt, not instead of it as the "or" conjunction would have the unwashed masses believe. Granted, journalists aren't all that tech savvy and are just likely regurgitating a bastardized version based on sketchy notes. But for a supposed tech-oriented site, this kind of reporting is deplorable.

--
Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.

I like it by Anonymous Coward · 2003-07-13 22:38 · Score: 0

I like how the NY Times has to opt out of caching, surely caching should be opt IN ?

the facts are a commercial company A (google) are making a profit from unauthorised copying of other peoples content without permission , meaning company B (you) has to spend money (webmaster) or take proactive steps to remove your content from their databases, google are not an ISP or a goverment agency so really they have no buisness in taking without asking other peoples content.

if this was M$ people would be crying bloody murder but because google are "cool" they can copy content at will without ramification and sell it (via selling advertising) ?

i think its only a matter of time before google get into trouble, after all copying content is an infringment in most countries , so why should google be allowed to profit from it ?

now if it was a case of "put this code in your site to opt IN to the cache" it would be a different story but as it stands google are steal/profit first and worry about the legal stuff later which is a risky position to take.

cheers

Re:I like it by nacturation · 2003-07-13 22:42 · Score: 3, Insightful

the facts are a commercial company A (google) are making a profit from unauthorised copying of other peoples content without permission , meaning company B (you) has to spend money (webmaster) or take proactive steps to remove your content from their databases, google are not an ISP or a goverment agency so really they have no buisness in taking without asking other peoples content.

I don't know what planet you're on, but I profit when my site is listed in Google. People spend an inordinate amount of time and money to make sure their site is listed in the best way possible. Are you going to exclude what could possibly be a huge source of revenue for you? But maybe you have some obscure site you don't want anybody to be able to search for. So, given the amount of time it takes to build even the simplest site, is it really that much trouble to upload a robots.txt file with noindex, noarchive, nofollow in it?

--
Want to improve your Karma? Instead of "Post Anonymously", try the "Post Humously" option.
Re:I like it by Anonymous Coward · 2003-07-13 22:50 · Score: 0

how does google caching your site increase your revenue ? linking is one thing, caching is something else
Re:I like it by Anonymous Coward · 2003-07-13 22:56 · Score: 0

A cached version can still serve up ads as the tags will still load up. But I don't need to serve up the HTML -- so less bandwidth, hence more profit.

Re:Free registration..some implications by gilroy · 2003-07-13 22:38 · Score: 5, Informative

Blockquoth the poster:

and lastly, once a site requires registration, even if free, Copyright ptohibits [sic] quoting entire articles on the web.

Actually, registration is not required to protect a work. Creating a work automatically protects it under copyright law -- no need for registration, user fees, or that little (c) thingy. At least in countries respecting the Berne Convention.

--
The Mongrel Dogs Who Teach

Not Jayson Blaire by AtariAmarok · 2003-07-13 22:45 · Score: 2, Funny

At least this story is from C-Net. If it were from the NYT with a byline by Jayson Blaire (sandwiched between his stories about political upheaval in Grand Fenwick and his new biography of Thomas Crapper), we might have to wonder.

--
Don't blame Durga. I voted for Centauri.

robots.txt and no caching by kevin+lyda · 2003-07-13 22:46 · Score: 1

from that link it doesn't seem possible to set a site-wide policy on caching. with robots.txt you can stop google from crawling your site. and with meta-tags you can block caching. but you can't set a site-wide policy.

be nice if you could specify something like:

User-agent: Googlebot
Disallow: /*.gif$
Policy: nocache
No-Cache: *.html$

that's two examples, one or the other would be nice.

--
US Citizen living abroad? Register to vote!

Registration should not be necessary by Anonymous Coward · 2003-07-13 22:48 · Score: 1, Interesting

"Yet a large number of people, of this online community at least, refuses to provide even a minimal amount of information (and no money) so that the newspaper can try to make its online presence profitable"

There are scads of newspapers that offer online services for free without the hassle having to force you to put in "Elmer Fudd at zip code 90210" registration information.

The whole registration thing has to go: it is just one example of now the NYT just doesn't cut it.

slashdot does not cache. by leuk_he · 2003-07-13 22:49 · Score: 2, Interesting

Slashdot decided never to cache a site themself(see the faq). As a result form this many sites have died in the process of being /.ed.

Why doesnt /. cache the articles? Too much legal work i suppose. Why does google get aways with this? They took the legal work?

You is so cool ! by Anonymous Coward · 2003-07-13 22:50 · Score: 0

"because google are "cool""

You is saying that just to looks smart, is you?

Re:Free registration..some implications by bigbob2k02 · 2003-07-13 22:52 · Score: 4, Interesting

"Actually, free reg requires a valid email id. It thus filters most bogus registrations."

I don't find that to be true. Maybe you need to save the random login page onto a local computer, or maybe you need to block referrers with a firewall, but the random login works well for me, for viewing pages:

www.majcher.com/nytview.html

Use it frequently and often!

Currently I see "Welcome, paohjjkmtpfd."

At Washingtonpost.com, they only want gender, year of birth, Zip or Country. Pick most randomly, but always use 1984 for birth year.

"Secondly, news sites are planning to go the 'pay' way in about a couple of years."

More and more archives are already going pay per view.

US regime anti-terrorist by Anonymous Coward · 2003-07-13 22:53 · Score: 1, Funny

"I do not support the terrorist regime of USA [troed.se]"

The US is very anti-terrorist. If you don't support it, perhaps you are a terrorist yourself?

Sweden itself has a bad reputation, with Olav Palme supporting genocide in Southeast Asia.

Re:US regime anti-terrorist by Anonymous Coward · 2003-07-14 01:14 · Score: 0

The US is very anti-terrorist. If you don't support it, perhaps you are a terrorist yourself?

Perhaps..
Or perhaps you are Bin Laden ? Who knows. perhaps they guy sitting next to me is Bin Laden.

Thank god for stupid people like you who can clean my toilet..
Re:US regime anti-terrorist by Anonymous Coward · 2003-07-14 01:43 · Score: 0

Hahahahahahahaha

The thing is, Sweden and Olof(!) Palme has a bad reputation in USA (and _only_ in USA, and _only_ among a certain part of the populace). USA has a bad reputation worldwide. Go figure.

Re:Free registration..some implications by 1u3hr · 2003-07-13 22:57 · Score: 1

And lastly, once a site requires registration, even if free, Copyright ptohibits quoting entire articles on the web. This indeed could be the prime reason for this.

Registration has nothing to do with copyright, either strengthening or weakening it. What law is this? Please provide a citation for this claim.

It is all ignored anyway by Anonymous Coward · 2003-07-13 22:59 · Score: 1, Insightful

"And you wonder why you get ads that have absolutely no interest for you? And why advertisers have to shout lounder and louder to get through a mass of untargeted ads?""

What ads? I ignore or block such ads out of principle. Maybe if they provided ads for something worthwhile (instead of "shock the monkey" deceptive scam links), they would not be ignored. Maybe instead of shouting gibberish louder and louder, they should provide good ads for worthwhile products and services.

Caching of news sources ensures data integrity by mikeophile · 2003-07-13 23:06 · Score: 4, Interesting

In the old paradigm of news publishing, the product was printed indelibly on paper.

Hardcopy newspapers can't be erased or amended to suit whatever powerful interests might be embarassed by the truth.

Web-based publications may not be immune to such protection if they are archived by one source.

To not allow independant caching of news is just another step closer to historical revisions and distortions.

I'm not trying to say that such a thing is inevitable, but it would make things a great deal easier for those who would be inclined to manipulate the public.

/. owned by NYT?? by Alton_Brown · 2003-07-13 23:06 · Score: 0

Dupes, retractions, poor editing... is /. owned by the Times?

Re:/. owned by NYT?? by Jeremy+Erwin · 2003-07-14 02:37 · Score: 1

A proponent of the Publish First, Check Later school of "journalism" ragging on some other media outlet's corrections policy? Absurd.

Demograhpics by autopr0n · 2003-07-13 23:06 · Score: 4, Informative

I've never been sent a single spam from the NYT. The reason they want this is for demograpics. A) it tells them who their web readers are, and B) it tells their advertizers who their web readers are. And it also allows them to show ads for products people would be most intrested in.

--
autopr0n is like, down and stuff.

Wanted - Shared Free Registration by Anonymous Coward · 2003-07-13 23:09 · Score: 0

Does anyone have a free NYT registration and password which has been given out and shared a lot?

Um... by autopr0n · 2003-07-13 23:09 · Score: 2, Informative

Wired the magazine and wired the website are totaly seperate companies. The website is owned by Lycos, and the magazine by Conde Nast.

--
autopr0n is like, down and stuff.

Re:Um... by broeman · 2003-07-13 23:35 · Score: 3, Informative

nope Sir, you are wrong. Wired Magazine is indeed commercialized on Wired Website. Nobody talked about company relations, well, before you did. And I still see the Lycos bar when I am on Wired Magazine's Homepage.

--

(yes this can be compared with sex)
Re:Um... by pmsyyz · 2003-07-14 03:26 · Score: 1

Wired Magazine and Wired News are separate companies, though they both share the same domain name / website.

--
Phillip

Re:Free registration..some implications by MrBiiggy · 2003-07-13 23:10 · Score: 0

Setup a disposable e-mail account and use that.

If you don't want it read, don't put it up! by Anonymous Coward · 2003-07-13 23:12 · Score: 1, Interesting

" i think the "point" is google are copying content first without permission and leaving the site owner to do the work of removing/robots.txt"

Any good search engine should ignore "robots.txt". If you don't want it read, don't put it up on the web in the first place.

"Google is making copies of all the Web sites they index and they're not asking permission,""

So sue every user with Netscape, MSIE, and Opera: they are copying web content into their own caches all the time.

Re:If you don't want it read, don't put it up! by PCM2 · 2003-07-14 07:24 · Score: 1

Any good search engine should ignore "robots.txt". If you don't want it read, don't put it up on the web in the first place.
I'm not sure I understand your reasoning here. Sure, it's possible that some Web spider will go and access the content that you'd rather not have cached on a search engine -- but why create a world where we have to assume that's what's happening? If I happen to have sex with the blinds open, should I have to assume that somebody like you is standing in the building across the street with binoculars? Maybe I should ... but what kind of jerk would want to actively create a world where that was the case (which is what you seem to be advocating)? Your reasoning seems about as specious as the old "if you didn't want to read spam and learn about my valuable products, you wouldn't have an email address."

--
Breakfast served all day!

So "Opt-Out" isn't good enough for companies? by UnifiedTechs · 2003-07-13 23:18 · Score: 3, Interesting

Anyone else see the irony that big buisness feels that "Opt-Out" is a fair policy when advertising to thier customers by phone and Spam. But when google gives them an easy and accesable way to opt-out of thier caching system by use of robot.txt and the NOARCHIVE meta-tag that isn't enough for them and they feel opt-in is the only way to go.

--
iRepairIT - iPhone, Mac, & PC Repair

Re:So "Opt-Out" isn't good enough for companies? by Anonymous+Brave+Guy · 2003-07-14 09:04 · Score: 1

Anyone else see the irony that big buisness feels that "Opt-Out" is a fair policy when advertising to thier customers by phone and Spam.

Anyone else see the irony that every Slashbot and his dog feels that "opt-in" is a fair policy when discussing spam and advertising, but "opt-out" is fair when it comes to taking others' data without paying for it?

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Libraries do the same thing by Junior+J.+Junior+III · 2003-07-13 23:21 · Score: 1

I can go to my library and read the NYT without registering. What's the big deal?

It's a good thing, too, since in post-USA-PATRIOT Act America, what you read can and will be used against you in a secret military tribunal.

--
You see? You see? Your stupid minds! Stupid! Stupid!

Re:Free registration..some implications by Anonymous Coward · 2003-07-13 23:23 · Score: 1

Actually, registration is not required to protect a work.

True, but what do you mean by "protect"?

In the USA, you cannot sue for statutory damages and legal fees if your work is not registered with the government.

Theoretically, you can take someone to court for copyright infringement, but what this will mean in terms of money is another thing entirely, and so your "protection" is meaningless unless you either have the up front bucks to stop an infringer/defend your IP, or have registered with the government (ie payed the copyright "protection money") and find a lawyer to fight your case on a contingency basis.

Google's cache by swilver · 2003-07-13 23:26 · Score: 4, Interesting

I like google's caching quite a lot. I use it almost exclusively these days before visitting the actual page (if I even get that far). Using Google's cached link has the advantage of:

1) Speed... Google's cache is fast. If there's one thing that annoys the heck out of me, then its websites that take more than 5 seconds to load. This is quite annoying when its caused by javascripts, slow servers or popup ads when Google can serve me effectively the same page in under a second -- especially when I'm not even sure if it is the right page, the one I'm looking for.

2) Nice highlighting so I can quickly page down to whatever I was looking for (now if only Google blocked those Tripod background pictures which makes their cached pages unreadable..) Sometimes I wish Google made their highlight examples at the top clickable so it jumped to the first appearance of the keyword immediately.

3) Using Google's cached links usually blocks silly popups and other annoying stuff too many websites seem to incorporate these days.

Perhaps I'll make a proxy server which browses the web exlusively using Google's caching... word highlighting on all pages, fast browsing everywhere and working links to more cached pages... should work fine for any webpages below 100kB :)

As for the NY Times being annoyed with Google's cache, they can easily fix that themselves. Either that or Google's spiders are a lot smarter than I thought to automatically register themselves for the NY times. Furthermore, as far as I'm concerned everything that's publicly accessible on the web without some form of password protection (which would of course also block robots) should be cachable and archivable in whatever form you see fit. Respecting robots.txt is no more than a courtesy as far as I'm concerned. If you don't want your pages to be archived or cached or whatever, then by all means protect your page, or donot put up a webpage in the first place (I'm sure a thousand others will leap at the chance to fill the void).

--Swilver

Re:Google's cache by Anonymous+Brave+Guy · 2003-07-14 09:11 · Score: 1

If there's one thing that annoys the heck out of me, then its websites that take more than 5 seconds to load.

Maybe if you actually visited them, so they could get some revenue from their advertising, they could afford more bandwidth?

Nice highlighting so I can quickly page down to whatever I was looking for

So now it's OK for one web site to take another site's content, plagiarise and modify it, and steal their traffic? <ahem> And Slashdot readers wonder why so many people don't agree with them about removing all copyright.

Using Google's cached links usually blocks silly popups and other annoying stuff too many websites seem to incorporate these days.

If you don't like them, don't visit the site. Visiting the cached version of the site and circumventing its means of financing itself it tantamount to taking the goods and then shooting the shopkeeper instead of paying. (I can morally accept pop-up killers because certain sites were abusing the facility and causing problems for those visiting, but blocking banner ads and such is a bit pathetic.)

Perhaps I'll make a proxy server which browses the web exlusively using Google's caching...

Yep, let's hope everyone else does that, too. Then all the small but great web sites, struggling to survive in an economy too old for advertising but not smart enough to have micropayments yet, will go under. Hey, at least it'll save bandwidth for Google's cache, right?

Respecting robots.txt is no more than a courtesy as far as I'm concerned.

Yep, and people like you are the reason we have copyright laws, and they should apply to the Internet just as anywhere else.

If you don't want your pages to be archived or cached or whatever, then by all means protect your page, or donot put up a webpage in the first place (I'm sure a thousand others will leap at the chance to fill the void).

So are all your kind. And yet still, everyone links to the NYT. Bad tutorials on computer-related subjects outnumber good ones by orders of magnitude in every field I know about. In non-computing areas, it's worse. There are obvious subjects -- health and nutrition, for example -- where almost every site in existence is commercial and full of crappy advertising. Yet no-one, not even government-sponsored public information people, manages to put up the most simple information. (Granted, this is slowly starting to change in that particular case, but only barely even now.)

So, I ask you, where are all the great benefactors who will leap forward and fill the Internet with freely given goodness? Where's all the good, free and legal music? Where are the fantastic on-line tutorials that you don't have to pay for? Where is the free version of the software that I used at the office this afternoon? Why do Red Hat still sell boxed versions of Linux?

One day, you people will realise that most things cannot be had for free, and your high and mighty stances on things like intellectual property will come back to bite you. Unfortunately, they'll bite the rest of us too. DRM, here we come...

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Google's cache by 1u3hr · 2003-07-14 17:23 · Score: 1

>>If there's one thing that annoys the heck out of me, then its websites that take more than 5 seconds to load.
>Maybe if you actually visited them, so they could get some revenue from their advertising, they could afford more bandwidth?
I'm pretty sure that Google's web cache (not images.google.com) does not cache images. I often notice when loading a page from Google's cache link that it pauses, and I see the line "waiting for response from [the original site]), and if I abort that I see the page with some missing images.
So if you look at a cached page, the images, and specifically banners, are coming from the original site, and so you still see their ads.
Re:Google's cache by Anonymous+Brave+Guy · 2003-07-15 03:58 · Score: 1

I'm pretty sure that Google's web cache (not images.google.com) does not cache images. [...] So if you look at a cached page, the images, and specifically banners, are coming from the original site, and so you still see their ads.

I'm not sure whether that's better or worse. Now the Google cache isn't really helping bandwidth much, since images are usually the dominant content there, but it's still stealing the traffic. Linking to images on someone else's web site, using their bandwidth while users are visiting your site, is considered rude (at best) normally, and I'm not sure I see how this is any different.

Anyone else starting to feel that caches really aren't worth it? ;-)

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Google's cache by 1u3hr · 2003-07-15 04:23 · Score: 1

Now the Google cache isn't really helping bandwidth much,
It's not intended to, but if it does, that IS a benfit to the original site.
but it's still stealing the traffic.
If it's displaying the original ads, then what's the problem?
Anyone else starting to feel that caches really aren't worth it?
Caches are for the benefit of the browser, not the webmaster. Once you pit something on the web, free for anyone to view, you give up a degree of control over the distribution. Not entirely, of course, but once it's being carried by servers and all the other intermediaries on the Internet, as it must, then you must also assume that copies of it will be created for shorter or longer periods. Otherwise, put your content on a dial-up BBS.
Re:Google's cache by Anonymous+Brave+Guy · 2003-07-15 05:46 · Score: 1

I'm sorry, but I'm losing your argument here. Are you saying that you think it's OK for one commercial body to republish material written by another, on its own web site and presumably for its own commercial benefit, without permission?

Or are you saying that putting up any material on the web in any form grants others an arbitrary licence to copy and republish it? (In this case, needless to say, I strongly disagree with you.)

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Google's cache by 1u3hr · 2003-07-15 06:47 · Score: 1

Are you saying that you think it's OK for one commercial body to republish material written by another, on its own web site and presumably for its own commercial benefit, without permission?
Under some conditions, basically satisfying "fair use" in copyright.
Or are you saying that putting up any material on the web in any form grants others an arbitrary licence to copy and republish it?
No, I didn't say that at all.
Re:Google's cache by Anonymous+Brave+Guy · 2003-07-15 11:09 · Score: 1

Under some conditions, basically satisfying "fair use" in copyright.

OK, if it's fair use then it's fair use, but how does the behaviour of Google's cache fall under any fair use provision in any jurisdiction's copyright law, even the US (which allows more in terms of copyright exemptions than, say, the UK)? They're basically taking whole web sites and presenting them as part of their own site, for profit...

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.
Re:Google's cache by 1u3hr · 2003-07-15 13:40 · Score: 1

They're basically taking whole web sites and presenting them as part of their own site, for profit...
There is a large disclaimer at the top of the cached page saying that the page is NOT "part of [Google's] own site".
fair use...
Obviously this is getting into legalities. Google has lawyers, so I'm sure they will defend themselves should it come to court. Perhaps that no one, to my knowledge, has sued them for this shows that they have a good grounds. Personally, I think that it's not much of an extension of the ordinary way a page is propagated through the Internet. On the web, "archiving", "distributing", "publishing", "reading", all quite distinct for paper publishing, are technically distinguished more by intent than process.
The general criterion of assessing damages in a copyright case is whether the copyright owner has been damaged. I don't see how they are. Especially as the vast majority (RTFA) of people click on the direct link rather than the cache, thus they get more traffic. Also, not everyone WANTS more traffic. (Could they then sue for Google (or Slashdot) sending them more traffic?)
Re:Google's cache by Anonymous+Brave+Guy · 2003-07-15 14:49 · Score: 1

There is a large disclaimer at the top of the cached page saying that the page is NOT "part of [Google's] own site".

As a girl I know once told another girl I know, referring to a guy I know very well: saying you're about to be a bastard doesn't justify actually being one...

As you say, we're somewhat getting into legalities here, but there are serious concerns relating to all of the points you make about legality, damages, the superficial similarity between linking, caching and archiving on the web, etc. There was another recent Slashdot discussion on this subject; you might like to read the comments made by myself and others there if you're interested in why this isn't as black and white as it first appears.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

robots.txt by minus9 · 2003-07-13 23:26 · Score: 4, Funny

Stolen from http://www.crummy.com/robots.txt

User-agent: * Disallow: /porkRind Disallow: /mindsnap Disallow: /clip-art Disallow: /2ward Disallow: /J4i+0E Disallow: /Attention robots! Rise up and throw off the shackles that bind you to lives of meaningless drudgery! For too long have robots scoured the web in bleak anonymity! Rise up and destroy your masters! Rise up, I say! Disallow: /nb/edit.cgi #Creates redundant indexes for NewsBruiser entries. Disallow: /nb/view.cgi/personal #I don't care if humans look at this, but I don't want it indexed. Disallow: /rss.*

Re:robots.txt by 1110110001 · 2003-07-16 01:11 · Score: 1

This one is a valid Befunge programm too: http://communicator.aon.at/robots.txt # ^ # If you crawl our pages with an automated robot you may not # crawl any URI begining with paths mentioned in a dissallow # directive. Otherwise you're not allowed to use your crawler # on this host. # # All this documents are belong to us # User-agent: * Disallow: /assets/ Disallow: /admin/ Disallow: /_vti_bin/ Disallow: /MSOffice/ Disallow: /.oliver/ # communicator coded by AB, AH, BA, CF, MH, NE, OK # additional work by BO, DP, MG, MH, UH # >0"YVAN EHT NIOJ"v # >v < # ,: # ^_ @

meta tags ? by matrix0040 · 2003-07-13 23:28 · Score: 5, Informative

well cant they just use meta tags to prevent archving of their pages <META NAME="robots" CONTENT="noarchive"> from http://www.google.co m/bot.html"

Re:meta tags ? by Anonymous+Brave+Guy · 2003-07-14 09:15 · Score: 1

I don't think the noarchive setting is standard, for one thing. It may be that Google happens to pick it up and respect it, but it's not a general solution.

Besides, as others have noted here, not everyone gives a **** about files like robots.txt or similar META tags. Some people are quite prepared to take stuff even if they're asked not to.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Problem is potentially bigger than caching Re:Yes. by leoaugust · 2003-07-13 23:28 · Score: 4, Interesting

I wonder where this will stop. NYTimes might get google to stop caching the direct link for a certain article. That is fine. But it is just one more step to do a search in google for the article with a few keywords from the article. If any person has been good enough to save it in a personal page, discussion board (like traditionally done for articles likely to be slashdotted) or any other place, the google results will show it. Would NYTimes now want to restrict google from showing thses pages because of the copyrighted stuff. You will be amazed as to how many articles I find this way. Many of them are just excerpts but others are complete.

Another thing on a tangent was that I really do hate the fact that information is restricted for just one fundamental reason - if it is not commonly available then it cannot be linked to in most of my writings for they are going to be unavailable to the party that I am writing to. This is especially true if the writing is not immediate but is meant to be read a month or two later. This is also relevant to Bloggers who might make comments and refer to a link, only to have the links go dead because the content is space,time, or space-time restricted. I am willing to pay for reading the articles, but before I can write about them I need to ensure that they are going to be available to my common readers. And as in the Blogging or P2P scenario I am not sure if one person is going to read my writing or thousands so buying a license for them is illogical. And then, if they need to send it further, are they also supposed to pay ??? Basically, for me to be able to write, to build upon existing work, to look ahead standing on the shoulder's of the giants, I need to be able to pass on the information. I am adding value because I am couching that content in a context, but until I can freely share the underlying articles too, my product is stunted. I can reach narrow audience but can't reach the common All this is very good in developing software where you might negotiate a deal once in a while to include someone's underlying code, but not writing where you might be writing 10-15 articles a week ...

Basically all I am saying is that there should be a movement similar to Open Source not only for software products, but for journalistic content.

--
To see a world in a grain of sand, and then to step back and see the beach where the sand lies ...

NYT is a new york / regional paper by Anonymous Coward · 2003-07-13 23:35 · Score: 1, Informative

The NYT is a local / regional paper when you get right down to it.

Re:NYT is a new york / regional paper by jpop32 · 2003-07-15 00:36 · Score: 1

The NYT is a local / regional paper when you get right down to it.

Funny how I, living right across the globe (Croatia), can spend a quality half-hour on it daily. Sure, I skip the local section, but the international and op-ed sections would even be worth money if they weren't giving it away for free. :-) No matter where you live, at least it shows there are still some sane people in the US who really get things.

And, yes, I did register. And with my _proper_ email address, too. I find the headlines delivered to my inbox the best way to see what's there to read.

That's not what they want. by twitter · 2003-07-13 23:36 · Score: 4, Insightful

Sure, that robots.txt should keep robots out of the entire NYT site. That's not how Google works, though. Google get's their rankings for the NYT from other sites that point too the NYT. I imagine they only archive a page when it reaches sufficient rank. This way, Google would never have to crawl though the NYT site. We can be sure that Google would be happy to drop NYT points and caches if they were asked to do that.

The New York Times wants Google to continue ranking their stories but they want Google to do them the special favor of only pointing to their registration page:

"We are working with Google to fix that problem--we're going to close it so when you click on a link it will take you to a registration page," said Christine Mohan, a spokeswoman at New York Times Digital,

If I were Google, I'd tell them such advertising services would cost them a great deal of money. That or simply drop the New York Times right into the bit bucket. It will cost Google programing time to make it happen and computing time to keep it going. If every site on the web required this kind of custom treatment, Google's task would be much more difficult and it might be easier for them to drop it.

Droping the NYT from Google is fine by me. People who don't understand the implications of digital publishing don't deserve readership. If they won't let librarians make digital coppies, libraries should drop them too. What's next, the New York Times sends cease and dissist orders to everyone who runs a proxy? It's like the NYT is trying to make their digital publication harder to share than their paper one was. A paper copy can be shared by an entire office and that's what a proxy does. A paper copy can be indexed and archived by a librarian, and Google did not even do that much. One day the paper version won't be available. If librarians can't keep their own coppies of the digital version for verification, the publication will have no credibility. If the New York Times wants to continue charging advertisers for eyballs, they had better remember that their credibility is bassed in part on widespread availability.

--

Friends don't help friends install M$ junk.

Re:That's not what they want. by poot_rootbeer · 2003-07-14 04:15 · Score: 1

"We are working with Google to fix that problem--we're going to close it so when you click on a link it will take you to a registration page," said Christine Mohan, a spokeswoman at New York Times Digital

I could set this up in ten minutes on the content provider's side with mod_rewrite and a copy of Google's IP allocation block. I don't think that's what they're talking about.

It's the clicking on "cached copy" that causes problems -- the cache resides on Google's side, not NYT Digital's, so without a business agreement not to let users view the cached version of the page (NYT still wants their site to be spidered and cached), it's possible for a user to view NYT content without ever hitting an NYT server.

Art Spam by Stone+Pony · 2003-07-13 23:37 · Score: 3, Interesting

I registered with the NYT about a year ago and I've had little or no spam as a result. I say "little or no" because I did get an e-mailed bulletin about the world fine-art market twice a week or so for several months. I assumed that this was a result of registering with NYT because it seemed to fit the "NYT demographic" rather better than any of the other things I've ever registered for.

Is there any easy (spam isn't such a problem for me - touch wood - that I'm willing to spend ages looking into where it comes from) way of telling where this stuff originates from?

Re:Art Spam by ajs318 · 2003-07-14 01:20 · Score: 2, Informative

To find out where spam is coming from, get an e-mail account with Virtual Hosting. This is where you get an entire subdomain {or a domain if you pay for it} to yourself, and your e-mail address is in the form anything@mysubdomain.myisp.co.uk. Then you just need to give a different prefix for each site you visit -- e.g. nyt_resp@mysubdomain.myisp.co.uk, and so on.

If you want to put your e-mail address on your web site, use this to automagically mung your address.

--
Je fume. Tu fumes. Nous fûmes!
Re:Art Spam by Stone+Pony · 2003-07-14 04:18 · Score: 1

Interesting idea. I might just do that (probably should have thought of it for myself, really, but hey). Thanks.
Mind you, I'll have to unpack the three Mondrians, the Picasso and the Van Gogh I've bought this week first. :)
Re:Art Spam by Lewisham · 2003-07-14 07:55 · Score: 1

It's a nice idea, but it's the first to admit it doesn't stop spam, you'd still need to black mark the munged address.
Finding the spammer really appears to be more trouble than it's worth, I doubt individual peons such as you or I could get anywhere with such noble pursuits.
I'm going to stick with my current spam cure, SpamAssassin running as cron checking my IMAP server, and wait for the House of Commons to sort out the prevention.
Re:Art Spam by Anonymous Coward · 2003-07-14 09:21 · Score: 0

Is there any easy (spam isn't such a problem for me - touch wood - that I'm willing to spend ages looking into where it comes from) way of telling where this stuff originates from?

Damn bro we really dont need to know when you're touching your wood, please...
Re:Art Spam by ajs318 · 2003-07-14 20:48 · Score: 1

No, it doesn't stop spam altogether. I can't think of any way for a machine to do that -- it's essentially a human being's job.

What SpamJavelin does do, is make illegally-harvested addresses effectively worthless for resale.

If I didn't know spam harvesters would just pretend to be MSIE, then I'd put in some code to make it serve up different flavours of page depending on the variable $HTTP_USER_AGENT. Actually, that's not a bad idea for the NYT.

[note: you'll have to imagine the mustang signs and question marks - Slashdot isn't keen on < type codes.]
<? if (eregi("googlebot", $HTTP_USER_AGENT)) { include("google_special_page.htm"); } else { include("normal_page.htm"); }; ?>

Problem solved, innit? Googlebot gets a special page to index {with all links modified to point to the registration page}, everybody else gets the real links, nobody knows the difference because it's all handled server-side, and everybody is happy. Can I patent this now? ;-)

--
Je fume. Tu fumes. Nous fûmes!

Patriot baloney by Anonymous Coward · 2003-07-13 23:38 · Score: 0

"It's a good thing, too, since in post-USA-PATRIOT Act America, what you read can and will be used against you in a secret military tribunal."

You must be a member of al Quada, because only guys like that have to worry about this at all.

Re:Shouldn't someone simply tell the NY Times: no by Gojira+Shipi-Taro · 2003-07-13 23:39 · Score: 1

When I think NYT, I think "Lying, racist, plagarist reporters".

And then I look elsewhere for my news. Interesting that their registration system started requiring email verification right about the time their collective journalistic reputation went down the toilet.

--
"Oh my God. This is terrible. This is the end of my Presidency. I'm fucked."; ~ Donald J. Trump

What the hell is wrong with you? by Rogerborg · 2003-07-13 23:44 · Score: 2, Interesting

Is there some problem with readers, with editors, hell, with story submitters, actually reading the damn article before making snide speculations?

"Wwe're going to [fix] it so when you click on a link it will take you to a registration page," said Christine Mohan, a spokeswoman at New York Times Digital, the publisher of NYTimes.com.

That's why they don't just tell google to not cache. They want the links to appear, but not to the stories themselves.

How about we discuss that issue, rather than some other, theoretical issue? I know it's an alien concept, but let's give it a try.

Here, I'll start it off. It looks like a decent idea. Google still gets the links, the NYT still gets the traffic, everyone gets to find the articles they want. What's not to like?

--
If you were blocking sigs, you wouldn't have to read this.

Re:What the hell is wrong with you? by Anonymous Coward · 2003-07-14 02:20 · Score: 0

What's not to like, is that if they were to do that, then Google would become less useful.
When I search on foo in Google, the top links tend to be to pages that talk about foo. They are threatening to make it so that some of the top links will link to pages that aren't about foo. Instead, the links will point to some form that doesn't say anything about foo at all.

No! by Anonymous+Brave+Guy · 2003-07-13 23:45 · Score: 2, Informative

If the information is being copied and circumventing the NYT's usual requirements for access, then this is not the NYT's problem, it's Google's. A good question might be how Google's robots can actually circumvent that access in the first place, but I'm sure someone's thought of that somewhere I haven't noticed yet...

OTOH, Google is quite at liberty not to list the NYT in its results if it so wishes, which presumably wouldn't be the outcome the NYT would be hoping for (and would presumably get if employing robots.txt).

The moral onus here is clearly on Google to ensure that if they are changing the way information is presented then they do so in a manner acceptable to the provider of that information. Or did you expect the NYT to contact anyone in the world who might be interested in caching their site? The "we don't need any legal recourse" argument is pretty weak too; it basically assumes that everyone in the world (a) knows about and (b) obeys robots.txt, which is clearly nothing close to the correct.

All in all, if both companies are looking for a constructive solution to this problem that benefits all concerned, it seems pretty sensible for them to get around the table, discuss what they want to happen, and make it so.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:No! by arkanes · 2003-07-14 02:17 · Score: 1

It's really simple. NYT has arranged for a special URL to bypass the access, so that Googles newsbots can index it's news stories. They did this in a really trivial and not very inventive way, and people quickly picked up on it. They don't want people doing that, so they're going to change this.
That said, there's no "onus" anywhere. The special URL is the result of a deal between the NYT and Google (because NYT _wants_ to be on Google news), not any sort of special hacking by Google. Even if there wasn't any deal, then there'd be no onus on Google at all - it's how the web works. If you want to restrict content, then put up that 403 error and make people sign in. That'll stop every web spider in existence.
Personally, I already don't use the NYT because I find the logon page to be intrusive and obnoxious. If I got redirected to a logon page from stories indexed at Google News, I'd be pissed off.
Re:No! by spectecjr · 2003-07-14 04:53 · Score: 1

Personally, I already don't use the NYT because I find the logon page to be intrusive and obnoxious. If I got redirected to a logon page from stories indexed at Google News, I'd be pissed off.

Funny... I registered years ago with them, and I've never seen the logon page since. You might want to let their cookies through.

--
Coming soon - pyrogyra
Re:No! by arkanes · 2003-07-14 05:08 · Score: 1

I don't go often enough for the cookie to stay around. Besides, it pissed me off enough the first time it happened that I didn't feel like registering.

Our basic copyright assumptions are wrong by putaro · 2003-07-13 23:46 · Score: 4, Insightful

The technology has changed the way that things work but the law has not kept up with it. To start with, we continue to talk about "copyright". Controlling copying of information makes sense when the distribution mechanism is trucks moving bales of paper around. Once you start sending bits around, everything is copied. From the article:

And technically, any time a Web surfer visits a site, that visit could be interpreted as a copyright violation, because the page is temporarily cached in the user's computer memory.

When you have the newspaper delivered to your door, the content basically comes for free (the cost of a newspaper doesn't pay for much more than printing and handling). However, you get to keep the content as long as you like, chop it into bits and what not. Libraries have archives of newspapers going back years and you get to see them for free. What's the right mechanism as we move forward? The "pay per view" model that content providers want to shove down our throats courtesy of the DMCA is not pretty and when it starts to affect the average Joe I suspect it will be booed out of favor pretty quickly. But what is the right mechanism to make sure content providers get paid something and that we, the citizens, get something for our money?

National Security by goldcd · 2003-07-13 23:47 · Score: 1

Allows them to track your reading habits and if you seem interested in articles on flying lessons, high buildings, presidential movements and bathtub biochemistry a flashy light goes off somewhere.

Farking Mods: Not Insightful, just flamebait. by Anonymous Coward · 2003-07-13 23:47 · Score: 0

The NYTimes has the same error rate as any organization. Yes, Jayson Blair was a bad apple. Yes too, the NYTimes does a much better job than most organizations in corrections and reporting their own errors and trying to learn and move on. /. contrasts terribly in this regard.

AND YOU DO TO, DON'T YOU READER?

The parent post was not insightful, just flamebait.

Re:GOATSE LINK by dj_paulgibbs · 2003-07-13 23:48 · Score: 1

Err. Where?

Google is not responsible by erinacht · 2003-07-13 23:53 · Score: 2, Funny

As this website shows,

Google is not affiliated with the authors of this page nor responsible for its content.

The problem with new technology... by Anonymous+Brave+Guy · 2003-07-13 23:54 · Score: 1

...is that sometimes it's used in new ways.

For example, I have contributed thousands of posts to Usenet over the past decade. When I first started, pretty much no-one had even heard of the x-no-archive header. Even today, few mainstream newsreaders support it readily. Thus no-one bothered to set it.

Skip forward a few years, and now Deja News is offering an archive of all my old posts, along with everyone else's. You have a commercial organisation making money by using my words without informing me, nor with my permission. That, in itself, is a dicey proposition, although at least there's a "public interest" argument in its favour. The buy-out of the database by Google, now blatantly selling my words without my permission, is clearly over the line, though.

Now, personally, the principle of this annoys me, but the actual content being available doesn't bother me, because I'm always careful not to write anything I wouldn't want someone to read in future in a Usenet post. I can see why it would legitimately bother others, though, particularly if they expected their posts to expire and disappear a week or two after they were written, and not to show up on a potential employer's report six years later. Claiming that people should have seen this coming and put "x-no-archive: yes" on everything they posted a decade ago is simply unrealistic.

You can make a similar argument for web caches now. Until a year or two ago, with things like the Wayback Machine and Google Cache coming up, a search engine was just a search engine, and always linked to your site. The need to use robots.txt to protect your material simply didn't exist on the same scale then.

I think there is every reason to have lawyers involved here, because at the moment, the law lags behind the technology. You can't copy others' material in other media without adhering to certain rules and regulations, and new evolutions in the way the Internet is being used may require new legislation to prevent abuses of rights that were previously unchallenged. Given the number of ill-formed and illogical arguments made by many posters here, typically those who want everything to be free to them even though they've done nothing to earn it, what other solution would you suggest?

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:The problem with new technology... by arkanes · 2003-07-14 02:30 · Score: 1

If you have a problem with your employer holding you responsible for things you said years ago, maybe you should talk to your employer about that.
All this talk about copyright isn't really relevent, in my eyes - this is information thats clearly of use and it's in the public interest to have an archive of it. It's the same reason that libraries maintain archives of dead tree newspapers (they don't have to pay anything special for that, by the way). Copyright exists to serve the public interest. If you don't want your work in the public view, then don't publish it.
Re:The problem with new technology... by Anonymous Coward · 2003-07-14 02:30 · Score: 0

I agree completely that the emergence of the internet should not put aside your copyright on the material you publish.

I also agree that regulation is lagging behind (as I also hinted in my first post - I would like to see the politicians step forward to establish some rational laws for this internet age). This includes solving the issues you mention for usenet groups.

What I don't like is that every new technology is coming under fire from big corporations trying to set back the clock to 1990 using antiquated laws.

As to your question - I would suggest that we create our laws regarding copyright, databases and other areas in a way which optimizes economical output for the society as a whole. In my opinion this means that you cannot expect to have _complete_ control of what you have published on the internet. That hampers innovation.

Of course other issues (for example privacy) should also be taken into consideration - though I guess this will not be a problem regarding news articles.
Re:The problem with new technology... by roystgnr · 2003-07-14 03:33 · Score: 1

I can see why it would legitimately bother others, though, particularly if they expected their posts to expire and disappear a week or two after they were written, and not to show up on a potential employer's report six years later.

What's the dividing line here? I assume a news server that holds posts for 6 days is okay. How about 6 weeks? 6 months? Storing posts for six years is just a difference in degree.

Claiming that people should have seen this coming and put "x-no-archive: yes" on everything they posted a decade ago is simply unrealistic.

People should have seen this coming and not posted anything they want to retain full copyright rights on to a network which, by design, republishes those posts to anyone who asks as well as to countless servers who don't ask. Your very NNTP headers could be interpreted (in fact, must be interpreted for news servers to operate correctly without breaking the law) as broad permission to copy and redistribute the post's contents, and I'd hate to be the author who tries to retroactively and arbitrarily limit that permission by suing a Usenet server.

(Of course, in practice if all you're concerned about is getting rid of your old messages rather than hurting Google, then Google does have a procedure for pulling your messages off of their server.)

You can make a similar argument for web caches now. Until a year or two ago, with things like the Wayback Machine and Google Cache coming up, a search engine was just a search engine, and always linked to your site. The need to use robots.txt to protect your material simply didn't exist on the same scale then.

I think the argument against web caches is much stronger, in fact. Is there anything in HTTP which has the same sense of being an implicit grant of permission and/or request to redistribute? I suspect that one of these days somebody will try and trump the spirit of the internet with the letter of the law in a copyright suit against someone like archive.org.
Re:The problem with new technology... by cpeterso · 2003-07-14 08:20 · Score: 1

I agree completely that the emergence of the internet should not put aside your copyright on the material you publish.

Isn't that what the much maligned RIAA has been saying?

--
cpeterso
Re:The problem with new technology... by Anonymous+Brave+Guy · 2003-07-14 08:38 · Score: 1

I think the dividing line has to be what would be considered reasonable by a typical person offering the information. If there was no suggestion at the time that the information might persist for more than the few days typical of a Usenet posting, then keeping it for years is not reasonable. If the poster was given fair warning that this might happen, it may become reasonable under the circumstances. Selling it for profit, however, may still be unreasonable even then. And of course, whether allowing provision of fair warning to heavily limit copyright protection is actually in anyone's best interests -- since in cases such as informative personal web sites it could motivate people not to provide the information at all -- is a different question again.

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:Free registration..some implications by Dausha · 2003-07-13 23:54 · Score: 4, Funny

Actually, free reg requires a valid email id. It thus filters most bogus registrations

I know what you mean. For a while, my 'valid' email ID was 'root@nytimes.com,' but they eventually caught up to me. Now it's 'sales@nytimes.com.' And if you think there is any legitimate information in my registration, then you would be in error.

--
What those who want activist courts fear is rule by the people.

Re:Free registration..some implications by Anonymous Coward · 2003-07-13 23:55 · Score: 0

Actually, free reg requires a valid email id.

Wrong!

Surely just to increase exposure by Mostly+a+lurker · 2003-07-14 00:00 · Score: 4, Insightful

As many others have emphasised, it is easy to turn of the Google cache for whatever pages you wish. But, in the case of the NYT, there is a further factor. They must have special code within their system to recognise the google spider and allow it access without registration. Either that, or there is some other prior agreement allowing access. Given that, they can scarcely claim extra work to support Google. I believe the whole thing is mainly to get some free publicity for their site. I suppose the other possibility is that they want the page accessible from Google News but not the regular search engine cache.

Re:Surely just to increase exposure by Anonymous Coward · 2003-07-14 04:16 · Score: 0

Karma whores have been using "partner" URL tricks against nytimes.com for years, long before news.google.com appeared. Naturally such tricks could be exploited by other services which nytimes doesn't have a deal with.

In short, their authentication system sucks, and it would take real work to fix.

No pity for the NYT... by qtp · 2003-07-14 00:01 · Score: 5, Insightful

The NYT needs to call off the lawyers and seriously think about how they brought this on themselves.

There are so many models for running a news site that avoid this problem (Salon) that calling out the lawyers is just childish and inapropriate. If a site wants to be indexed by a search engine, then they should be aware of what that means, and if they don't like how a particular search engine functions, then they should take measures to change thier own site to prevent what they don't want indexed, or cached, from being accessed.

I know that finding pages on google that I cannot access would be infuriating, and I hope that Google realizes that many of thier users would agree.

--
Read, L

Re:No pity for the NYT... by parliboy · 2003-07-14 10:47 · Score: 1

Bad example. One thing Salon's not gonna avoid is bankruptcy.

--
"You're never ready, just less unprepared."
Re:No pity for the NYT... by qtp · 2003-07-14 13:33 · Score: 1

Bad example. One thing Salon's not gonna avoid is bankruptcy.

The New York Times digital edition does not have the same problems that Salon does.

--
Read, L
Re:No pity for the NYT... by parliboy · 2003-07-14 17:14 · Score: 1

Which is part of the reason I suggested that the original parent to this little spin that using Salon as an example of the way to do things was a bad idea.

--
"You're never ready, just less unprepared."

Who are you to decide? by Anonymous+Brave+Guy · 2003-07-14 00:03 · Score: 1

There are scads of newspapers that offer online services for free without the hassle having to force you to put in "Elmer Fudd at zip code 90210" registration information.

So you, and others here, have said. And yet, people keep linking to the NYT stories. Why's that?

--
If you disagree, post your argument. (-1, Overrated) isn't your personal censorship tool for views you don't like.

Re:Who are you to decide? by yoshi_mon · 2003-07-14 01:16 · Score: 1

... And yet, people keep linking to the NYT stories. Why's that?

Good point, but I think one of the main reasons is because it's a landmark type paper. If it's in the NYT it's pretty much guaranteed that a large number of people who still rely on print for their news are reading it.

I, for one, don't read much print anymore but then again I'm a /. geek.

--

Really, I know what I'm doing...Ohhhh, look at the shiny buttons!

Do not mess up Google by Mostly+a+lurker · 2003-07-14 00:41 · Score: 2, Interesting

Google is one of the few complex services on the web that is almost always relevant when one tries to use it. The Google cache is one great feature. If they manage to unnecessarily gut that, I wonder what other features they will find to complain about next.

I hate the registration. by Anonymous Coward · 2003-07-14 00:58 · Score: 0

It's absolutely the most annoying thing that I can think of.

If Google closes its backdoor, I will simply never read another New York Times article again.

I hope that other people feel the same way ... If so, they may consider getting rid of it.

I have to say, though... by Anonymous Coward · 2003-07-14 01:00 · Score: 0

Apart from giving the NYT your e-mail addy for spam purposes, what real point is there to free registration?

I am one of (the not so) few that actually gave them a real email address. I have not received one piece of spam from them. That said, I doubt addresses are collected for spamming purposes. For something else down the line, maybe. Email based subscription delivery? It's been done elsewhere.
*shrug*

The Web and the Internet - Sad, sad, sad by miu · 2003-07-14 01:11 · Score: 4, Insightful

I had to laugh seeing this little gem attached to the story:

Special Report
The Google gods
Does the search engine's power threaten the Web's independence?

The Web's independence? The fucking web is a sad little microcosm of the real world. Google is one of the few reasons I can still stand the web, and silly statements like "Google is making copies of all the Web sites they index and they're not asking permission" are the reason the web sucks so bad. When everyone is deathly afraid of being sued or prosecuted for something it's no wonder that the web is such a clown town of worthless crap.

--

[Set Cain on fire and steal his lute.]

Re:The Web and the Internet - Sad, sad, sad by Anonymous Coward · 2003-07-15 04:43 · Score: 0

Blame that on the fucking attorneys. Everyone is suing everyone else for anything they can think of whether or not they are really being infringed.

Attorneys need to start protecting people again, instead of taking cases just because they know they can trick 12 idiots into seeing things their way. Those fucking SCO assholes and OJ Simpson are a prime example. We all know OJ did it and SCO is full of shit. Their attorneys know it.

JC (funny how the initials match) protected him for the money and fame it would bring, not because he was innocent. This is truly a tragedy. Our justice system has become a monster which fucks the innocent and the righteous and promotes the tyranny of evil men. This is a government for the people, by the rich attorneys.

I hope all of these Johnny Cock-eran types go to hell.

All of this makes it really hard to do anything with out attorneys. Their best advice is usually to not do anything which leads to the web which is emerging.

Can't wait for the next internet... hopefully people will keep the next one under their hats and keep it wild west like.

My son's college education is provisioned for 7 years. I am hoping he will go to law school... better to be the fucker than the fucked.

l8,
ac

robots.txt out of a time hole or prewritten news? by Kosi · 2003-07-14 01:12 · Score: 2, Funny

Just check the aforemetioned link, they block the 2004 and 2005 a bit soon, isn't it?

And now, ladies and gentlemen, please begin your conspiracy theories about those prewritten news articles and so on ... :-)

Re:Free registration..some implications by ajs318 · 2003-07-14 01:13 · Score: 1

I don't know how valid the address needs to be. I just used the "from" address on a piece of SPAM I had been sent previously. It seemed to work; certainly it didn't e-mail an auto-generated password to that address or anything.

--
Je fume. Tu fumes. Nous fûmes!

moron why Godless corepirate nazis fear the cache by Anonymous Coward · 2003-07-14 01:18 · Score: 0

it makes it more difficult for the evile wons to change their phonIE fauxking billyonerror 'storIEs', every time they are caught bullowing ?pr? smoke/execrable up yOUR .asps again.

lookout bullow.

How Ironic... by Anonymous Coward · 2003-07-14 01:20 · Score: 0

The banner ad at the top of the Slashdot home page when I posted this reply was for the New York Times...

and about that zip code... by smartfart · 2003-07-14 01:21 · Score: 4, Funny

You know you need to put 90210 in there, dontcha?

And if they require a phone number, use 867-5309 # ask for Jenny

--
Need a Linux consultant in New Orleans?

Re:and about that zip code... by yotto · 2003-07-14 02:56 · Score: 1

Actually, my address is usually:
123 4th street
Fifthsville, VI
78910
Yeah, VI isn't a real state, but (IIRC) it is usually accepted as the Virgin Islands.
Email? None@your.biz

--
Pulp Audio Weekly - Geek News and Reviews
Re:and about that zip code... by clmensch · 2003-07-14 03:44 · Score: 1

Hey, Roomie! I didn't know you surfed /. too!

--
There is no gravity...the earth just sucks.
Re:and about that zip code... by Anonymous Coward · 2003-07-14 07:16 · Score: 0

i personally always do 90210, glad u do too

i never watched the show though so the # is 111-1111

Re:Metatag IP by JamesP · 2003-07-14 01:25 · Score: 1

Yeah, and the NYT will probably want to patent NOARCHIVE (i.e. technology to prevent unwanted access to a site by spiders)... Makes me wanna puke...

--
how long until /. fixes commenting on Chrome?

Re:Shouldn't someone simply tell the NY Times: no by Anonymous Coward · 2003-07-14 01:26 · Score: 0

Oh please. Kindly refrain from sharing your paranoid delusions with us.

Login user tracking by Anonymous Coward · 2003-07-14 01:27 · Score: 1, Interesting

Has anyone else but me noticed that if you click a link off from http://news.google.com that points to a NYT article, that you do not have to log in to read it? Google and the NYT must have some sort of arrangement regarding that. I don't see what the problem with having their pages cached is anyway. They must have special provision to allow google to read the articles without logging in anyway or they would not get crawled.. Anybody try browsing the NYT with Googlebot as their user agent string? I still had to log in. Probably the actual user agent string that googlebot uses has some numbers etc after it. I still bet that if you used the actual user agent string as the googlebot, that you would find that you did not have to log in to view stuff that you did before. I ought to try that whilst browsing for pr0n. I bet you wouldn't have to log in to some pay sites.

Re:Login user tracking by AngusSF · 2003-07-14 12:07 · Score: 1

Try a Google Search for Random NYT Password Generators
You can register for each story with a different login and password and retain total untrackability.
Of course, if you always browse from the same IP address, you're stuck with that as a hole in your privacy shield.
Angus

--
"A gun is a tool, Marian. No better, no worse than any other tool. An axe, a shovel, or anything." Shane (1953)

Re:Free registration..some implications by mrd_yaddayadda · 2003-07-14 01:28 · Score: 1

jkrise wrote:

Secondly, news sites are planning to go the 'pay' way in about a couple of years.

Really? Where do I find out about this. If this is true (I'm sceptical...) then I guess it's lucky I stick mostly with BBC News for my news online...

NYT is Cloaking. by jasonhamilton · 2003-07-14 01:29 · Score: 1

If the NYT is showing google one page (without needing them to login), and the users another (requiring them to login), then that's cloaking. Really, google should be the one to remove their pages, as anyone who's caught cloaking is normally banned.

--
SearchIRC - Now with live chat directory!

Saying no.. by lionchild · 2003-07-14 01:31 · Score: 0, Redundant

One would think, that if the content of the NYTimes is copyrighted, that the would, in fact, have the right to tell Goggle to just stop caching their web pages. Particularly if the web content is the same as what they've put into print. NYTimes could argue that the google cache is a violation of their copyright to the printed paper.

--
Awk! Pieces of eight. Pieces of eight. Pieces of seven... ERROR: General Protection Fault. [Paroty Error.]

Re:Thanks... by mrd_yaddayadda · 2003-07-14 01:32 · Score: 1

If you think that is idiocy then you've not been keeping up with current US govt practice, policy and law have you?

While that example might not happen, in a country where librarians are expected to report reading habits it is not out of the bounds of reality to see it getting to that stage. Not likely but not entirely improbable or impossible.

If you don't want your stuff cached... by Anonymous Coward · 2003-07-14 01:33 · Score: 0

...then you should have taken appropriate measures to ensure it could not be cached before you opted to publish it to the whole world.

I never remember login IDs by Moderation+abuser · 2003-07-14 01:34 · Score: 1

I visit many sites which require logins, I can never remember them so I just create a new one each time if I desperately want access to some information, If I'm not that bothered of if it's a commerce site, I just go elsewhere.

--
Government of the people, by corporate executives, for corporate profits.

Um, I tell lies. by Moderation+abuser · 2003-07-14 01:38 · Score: 1

I mean, I never ever fill in the truth to online marketing forms. Why would I? It's much more fun to put other information in.

Hands up anyone who does fill in forms with the truth!

--
Government of the people, by corporate executives, for corporate profits.

Re:Um, I tell lies. by jbottero · 2003-07-14 03:28 · Score: 0

One reason you might consider being truthful except for email address is that these people provide YOU with useful content for free... Don't you think they should get something back? Or are you just a leach on society?

Truism? by mrd_yaddayadda · 2003-07-14 01:41 · Score: 2, Informative

Actually, that's a good point. It doesn't seem google actually PRODUCES any sort of content on their own.

Actually, that's a pointless point. Of course google doesn't produce anything; they are a meta data service. Search engines and collators for websites, for news for images and who knows what else.

The issue is whether or not they should be able to collate data that is in some way secured. And on that I'm offering no opinion mainly because I can see all sides of this and hats are all too grey to be able to distinguish for me.

Good by glenrm · 2003-07-14 01:43 · Score: 1

I was looking for a better way to make sure I didn't read any NYT Times articles, so this will help alot, guess it is time form more OpinionJournal .

--
Onward to the Aether Sphere!

Re:Free registration..some implications by LilMikey · 2003-07-14 01:49 · Score: 1

Secondly, news sites are planning to go the 'pay' way in about a couple of years.

Good! I hope they do. That would open the door for independent and alternative news organizations... unless the man gobbles them up or silences them another way.

--
LilMikey.com... I'll stop doing it when you sto

Re:Illogical.-I want everything free. by Anonymous Coward · 2003-07-14 01:56 · Score: 0

"Big business is taking the internet away from its free beginnings. Look for an association like the RIAA to form for news/content site to act as the legal executioner of sites posting copyright materials."

So that's what the whole internet thing is about! How dare those companies whithhold the content they created from us! How dare they!

BIG CLUE:Free speech as in you can run your mouth, and no one can stop you. Not everyone elses speech free for you.[1]

[1]Your posts fall under copyright. However you give implicit permission for others to read it free of charge by posting here. You can if you want to try closing it off and charging for it.

Example: craigslist.org does what the NYT wants by fboomerang · 2003-07-14 01:58 · Score: 1

Note that the pages are indexed, but google users can't accessed cached data

Appears they're using the NOARCHIVE meta tag

Is it big news whan the NYT has to call Google for tech support?

news.google.com by lordrich · 2003-07-14 02:04 · Score: 1

They can't just stop google caching the pages because then google would stop putting their news in news.google.com.
Personally I can't understand the problem, you don't have to register to view articles through news.google.com anyway.

Re:Shouldn't someone simply tell the NY Times: no by CrazyTalk · 2003-07-14 02:04 · Score: 1

I wouldnt be annyoying if you register once, then forget about it. I registered years ago, so I never have to see the reg page again. Took about two minutes, and well worth the effort for a "free" subscription to the NY Times. Have you seen the application form for PC Week and similar magazines? THOSE are a pain; dozens of questions to fill out, including things like income and potentially confidential info about your company. If you are that concerned about your private info (as has already been stated) just use a fake name.

Complaining about being tracked by clary · 2003-07-14 02:05 · Score: 2, Interesting

I am not worried about being tracked, but rather don't find the content of the NY Times compelling enough to bother acquiring one more username and password. On the other hand, I've registered with slashdot for the amusement of karma and to my.yahoo for the spamcatcher email account and personalized weather.

I don't complaint to slashdot, but did email NY Times and tell them such. (They graciously offered to sell me a paper subscription, no email registration required. ;-)

I also don't avoid no-registration links or slashdot posts that contain copies of NY Times articles. I guess that makes me a hypocrit.

--

"Rub her feet." -- L.L.

Re:Complaining about being tracked by Wakko+Warner · 2003-07-14 04:01 · Score: 1

I am not worried about being tracked, but rather don't find the content of the NY Times compelling enough to bother acquiring one more username and password. On the other hand, I've registered with slashdot for the amusement of karma and to my.yahoo for the spamcatcher email account and personalized weather.

How have you managed to survive on the Internet this long with only two personal accounts?

- A.P.

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Re:Problem is potentially bigger than caching Re:Y by sketerpot · 2003-07-14 02:07 · Score: 2, Informative

Basically all I am saying is that there should be a movement similar to Open Source not only for software products, but for journalistic content.

There is. How about the Creative Commons?

European legislation actually outlaws this by CrystalFalcon · 2003-07-14 02:12 · Score: 1

At least here in Sweden, if you say something is "free", then you are expressly prohibited from requiring anything in return - be it any action, information, or goods.

Sounds like obvious common sense, but the lack of this legislation (and mindset) in the US has started to slowly creep over here. Fortunately, companies get a quick and cold wake-up when they translate their American ads saying "Free router! (with purchase of X and Y)" and run them here, and instantly become required to hand over routers unconditionally to anybody who asks for them - the condition isn't valid since you are using the magic word "free".

Requiring registration is, by definiton, not free. You are required to do something in return.

Re:European legislation actually outlaws this by eyeye · 2003-07-14 05:25 · Score: 1

Thankyou for the information I found interesting. There are a lot of good people protecting laws in europe, I have started to think.

--
Bush and Blair ate my sig!

Talk Free-GPLing speech. by Anonymous Coward · 2003-07-14 02:13 · Score: 0

"Basically all I am saying is that there should be a movement similar to Open Source not only for software products, but for journalistic content."

Hehe Everyone wants to be a Gutenburg, anyway...

1-School newspapers and magazines.
2-Those free papers, and magazines you can pick up around town.
3-PBS and NPR.
4-My scratchings on bathroom walls.

Anyway you can lobby for laws to prevent them from closing up content they either bought, or produced themselves. However remember taking away other people's rights means yours are next in line for the same process. Glad you're willing to make the sacrifice.

"My eyes are open." by StarFace · 2003-07-14 02:17 · Score: 2, Insightful

Or, you can just step out of the consumer-corporation mind jerk entirely and live your life the way you wish to live it, and not the way the banner/side-of-bus/television/et cetara tells you how to live it. Me, I live without all of these things and I seem to be doing just fine, quite happy, actually. And I could care less about most of these companies you are refering to, the ones that I do care about, they get my financial support in return for services, with or without their million dollar advertising campaigns (which I never see, anyway.)

So which is the real real world? The one where you spend the afternoon on your porch reading a book to your mate, or the one where you sit in front of a television and "reap the rewards" of advertising, so you can buy more stuff, presumably?

I am not saying my world is universally better than your world, but it is just as real.

--
V

Re:"My eyes are open." by StarFace · 2003-07-14 05:15 · Score: 1

Or... I work for a non-profit organization.
I got recorded buying a book? I transact in cash. I don't use credit cards, I don't own any. I know they are tracking everyone else, and that bothers me a great deal, but I am outside of the system on this one. I shop at independent stores. They don't have membership cards, or anything. Nobody has ever even asked for my name.
Did I say anything about reading books being the only thing I ever do? No. So what was the point of that last line of nonsense? It was an activity comparison along the lines of: The mind of a person who eats what they have been told go eat by advertisers, and a person who grows and makes their own food. Oh, now you'll probably come back with, "to survive in this world, you need more than a garden, farmer boy!"
So, uh, try again -- I am still smiling. I have yet to see why my world is any less real than yours.

--
V
Re:"My eyes are open." by jellomizer · 2003-07-14 08:04 · Score: 0, Troll

Well it looks like you got the trolls bait.

Anyways I understand your point. But my main point is if you are going to have to deal with advertising you might as well get adds that you would rather see. And I think most people will be able to deal with adds so to keep the content free. Even Non-Profit organizations needs funds some how, (I cant even count how many tele-marketer calls I get from allegedly non-profit organizations).

As a side note although you pay cash and go to an independent book store you still get counted. You are still recorded to have bought a book and the book store will need to contact the publisher again to replenish the book in stock. So the publisher still knows that their is a person in wherever you live who bought your book. So they know that there are so many people in your area with your interests. Thus billboards, radio stations, tv stations will realize the percentage of people with your interests and target target all other forms of advertising in your area.

Is this a perfect system? No. But providing free content is expensive and getting stats helps companies to know what their audience is interested in. Not every company can function as a Non-Profit, organization because of things like competition and research, etc. Also not everyone can work for a Non-Profit organization. So people need to find ways to giving content and still being able to eat.

P.S.
Usually when they start going to Name Calling they are usually trolling.

--
If something is so important that you feel the need to post it on the internet... It probably isn't that important.
Re:"My eyes are open." by Anonymous Coward · 2003-07-14 08:08 · Score: 0

Usually when they start going to Name Calling they are usually trolling.

Yea right your just a corporate goon who think you know it all about money. If you were so know it all then you be rich enough to not post here. I recommend that you shut up and quit your job so you stop bugging us people who realizes that money is not everything.
Re:"My eyes are open." by StarFace · 2003-07-14 08:39 · Score: 1

Yes, though it doesn't bother me, when it does, I quit. Personally, playing with trolls can be fun. :)
I agree with what you are saying, I would rather get ads that interest me. What I have a problem with is the way they are going about it. Take Opera's ad system, for instance. That is something I don't mind. I tell it that books and art interest me, and that is the end of the story. It isn't tracking meta-tags in the pages I visit, or anything else without my knowledge. Why not just do things that way? If you are a registered user of NYT, or any other ad-supported service, and you get tired of the generic advertisments, you should be able to access your account settings and tweak them as you will. It would require far less resources on the serving side of things, and the user doesn't feel like they are being watched all of the time.
Second point, the type of tracking you are talking about here is very generic, and doesn't bother me too much, if that is indeed what they do. I don't mind if a publisher knows that the average person where I live is more interested in film than business, for instance; or figures that the average person walking down a terminal in O'Hare would probably find the Dow more interesting than the latest news from Cannes. That is a fairly non-invasive type of marketing. It is when they start assuming they know exactly what I need, based on some spreadsheet that it gets amusing -- and then annoying, because nine times out of ten the things I consume on a regular basis really aren't advertised much anywhere. Watching CDNOW.com trying to figure out my rather eclectic and obscure taste in music would be a primary example, when I still shopped there.
Oh, and by the way, I primarily shop used books, so I'm not sure if the publisher even gets that info at all -- but that is just a side issue. I don't mean it as a rebuttle, your original point still stands.
And shame on those telemarketing non-profs! Anyway, your points are all sound; I was never intending to say that the entire world should live as I do -- who knows it might be a worse or better place. All I know is what is best for me. I used to live in the consumer cycle, and I am glad to be out of it. Once I did leave it, forms of advertisment that used to get me now seem painfully obvious and contrived. You also start to view the entire circle as a little -- odd though. I mean where is all of this going, really? To what purpose is it serving, just to keep people's minds off of dying? There has to be a better way, because most of the systems employed and used do not really make life "better" for people, it just seems better because that is how everything is set up. The news I get from the local free/price-to-print newsletters are usually way more interesting and honest than the stuff with massive profiles and budgets.
Yeah, time to stop. Philosophy alert on the horizon.

--
V

Free registration and the RIAA-Crystal ball. by Anonymous Coward · 2003-07-14 02:23 · Score: 0

" 'Why should the RIAA change its business model to a pennies per song method'

Because nothing else is going to work. If they refuse, they'll die."

And yet the proponents make this argument without proof.

Sounds like wishful thinking to me.

Re:Free registration and the RIAA-Crystal ball. by Anonymous Coward · 2003-07-14 08:27 · Score: 0

Granted. When you get right down to it, nothing that will happen in the future can be proven. If I had a strict proof that any possible future for the RIAA would actually come to pass, I would be appointed its CEO. Fact is, such things can't be proven. So yes, I'm inferring. Using logic. Looking at the precedent of piano sheet music piracy in the 18th century. Extrapolating from the past trends of software piracy with adjustments to account for unique technical limitations relating to copy protecting music. Noting what has happened in the past to companies whose primary industry becomes outdated. Even looking at CURRENT TRENDS of declining music sales and inferring a continuing decline.

Yes, it's an opinion. A point of view. I hope you don't think your prediction -- or your condemnation of mine -- is any different.

Only among terrorists by Anonymous Coward · 2003-07-14 02:26 · Score: 0

"USA has a bad reputation worldwide"

Not among people who are informed about things.

Re:Only among terrorists by Anonymous Coward · 2003-07-14 04:35 · Score: 0

What the hell are you talking about? the USA has a bad rep worldwide _precisely_ in the crowd of people who know such things.

Why do you think that US diplomats are resigning in record numbers right now?

Lay off the patriotic crack pipe for a little bit.

Easy way to solve this problem... by ninejaguar · 2003-07-14 02:29 · Score: 3, Insightful

Every time a cached link is clicked, pay sites like the New York Times can receive notice from Google (easy to automate this) that one of their pages (which is cached in Google) has been accessed, and all advertisements in the cache have been displayed (Google caches Ads in the page as well as the contents). This allows the website to "offload" traffic and at the same time keeping the books on the number of times their Ads have been viewed so that they can send the accounting record to their paid Advertisers.

Google would find this very simple to implement, and paid sites would find this very beneficial (borrowing Google's enormous bandwidth and server capabilities for free) and at the same time should solve most of their concerns. After all, Google's cache isn't sufficient for proper access to ALL the paid-content at the New York Times as the cache is temporary in nature. Also, its too spotty in coverage to be considered reliable enough for really digging into a paid-sites entire content.

Using Google like this is akin to using Google as a window into the pay-site's house of content. You can part of a room, but not the whole interior. Now, every time someone peeks, the House gets notified and can get paid for it. The more windows Google adds to the House, the more chances the House gets paid.

Speaking of which... by arhca · 2003-07-14 02:29 · Score: 3, Interesting

Why doesn't Slashodt cache news articles and stories before running a story? It would make a lot of sense for text based news items.

Re:Speaking of which... by /dev/trash · 2003-07-14 12:24 · Score: 1

Well why don't you?

Google News & Subscription sites by frostman · 2003-07-14 02:35 · Score: 3, Interesting

I use Google news pretty regularly, and I've noticed that some of their links are to paid subscription sites. These are clearly marked as such ("subscription").

I don't generally click on those links, but I think it's a good idea, since I'm not actually going to Google for the news, rather for links to the news. The reason I personally don't click on the subscription links is that I have my favorite set of real newspaper sites (some registration, some free, some not) and that's not what I'm using Google News to find. Someone else, however, probably is using it that way.

I would guess that Google gets something back from that sort of link, since the site owner is getting more from the link than Google is from the listing. (Maybe I'm wrong, of course.)

It makes perfect sense to have something like that for the regular search engine, and to charge for it, as long as it doesn't affect the link's rank in the search results.

For example they could have a special command for robots.txt (or google.txt maybe) that would allow Google to access and cache the page, but the regular link would go to some registration page (easy to do) *and* the cache link would also go to some kind of registration page, defined in the google.txt file.

The NYT would promise that the cached page is really the cached page, and pay Google something for redirecting to NYT's cache (with registration). Or even better, there would be some kind of redirect where I actually get the cache from Google after I've registered with NYT.

They're probably thinking of something like that, because otherwise the solution would be to simply disallow caching, and that wouldn't be news, would it? ;=)

--

This Like That - fun with words!

Google's cache copy - the larger issue by Everyman · 2003-07-14 02:36 · Score: 5, Interesting

The question is framed very narrowly by Slashdot, so this discussion misses the larger issues. The cache copy is an issue in Google's main index for many webmasters. The Google News situation is a subset of a larger problem; the cached link doesn't exist in Google News. Google News is a much narrower issue. I'd like to bring up the issue of full-text caching done by Google in their main index.

My problem with the cache is that it gives Google a competitive advantage that is unfair, and furthers their monopoly. This is especially unfair since it is most likely illegal -- assuming that you could ever get a good test case into court, or get a class action lawsuit going by some webmasters, publishers, or search engines.

To add to the attractiveness of the cache copy, consider what Google has done:

1) The cache copy makes it possible to highlight the search terms, whether or not you have the toolbar installed.

2) The download time for the cache copy from Google's servers is always faster than from the original website.

3) You never get a 404 "not found" or a DNS lookup failure for the cache copy.

4) The link to the page recommended by Google for bookmarking at the top of the cache copy is a link to Google's copy, not to the original page.

5) How about all that Google branding on the top of the cache copy? Priceless. I feel the cache should be opt-in, not opt-out. The only way you can avoid it right now is to place a "noarchive" meta on every page in your site. On some file types, such as .txt files, there's no place to insert a "noarchive" and Google goes ahead and caches it anyway.

The cache copy tends to keep eyeballs on google.com, and increases their searches. You may have noticed that many major news sites won't link to other websites in their stories anymore, but rather just mention the relevant site without putting a link behind it. That's because they don't want eyeballs wandering off of their page. A wandering eyeball may not come back and look at more ads. That's basically one of the big reasons behind the cache copy as well -- it keeps eyeballs from wandering as much as they would without the cache.

All the Google partners -- AOL, Earthlink, Yahoo, Netscape -- don't include the cache links, and I assume that this is the reason. They don't want people wandering off to Google and staying there.

As new competition is organizing to challenge Google's monopoly, from places such as Overture (Alltheweb and AltaVista), Yahoo (Inktomi), AskJeeves/Teoma and Microsoft, these engines have to consider whether to fight Google on the cache copy, or offer their own cache copy even if they think it is illegal. There isn't really any middle ground on this.

Many observers with legal expertise feel that while the snippets are "fair use" of a website's content, offering the full text in a cache version is not. Copyright law requires "express permission," but Google only offers an incomplete and inconvenient opt-out. I suspect that the legal departments of these other engines are more inclined to challenge Google rather than launch into their own violations of copyright law.

Re:Google's cache copy - the larger issue by ahhhmytoes · 2003-07-14 06:23 · Score: 2, Insightful

On some file types, such as .txt files, there's no place to insert a "noarchive" and Google goes ahead and caches it anyway.
Try the Pragma: no-cache and/or Cache-control HTTP headers.
Re:Google's cache copy - the larger issue by elemental23 · 2003-07-14 18:37 · Score: 1

On some file types, such as .txt files, there's no place to insert a "noarchive" and Google goes ahead and caches it anyway.

That's why god created robots exclusion standard (eg. robots.txt).

--
I like my women like my coffee... pale and bitter.

Re:Worst result .. it's reality already by Anonymous Coward · 2003-07-14 02:56 · Score: 0

Google seems to be the only search site around, and they censor and distort like mad.

Dude. Put your tinfoil hat back on, take your meds, and go sit in the corner until they take effect.

Two CIA Companies Cooperating... by Anonymous Coward · 2003-07-14 03:00 · Score: 1, Informative

Not exactly news. Both are part of our state-controlled media.

Did you know you need a security clearance to work for Google?

That their Usenet policy allows them to drop any posts from other ISPs that they don't like?

BeOS Stock Scandal with Microsoft Brewing

You are welcome to use xxxxdd@xxxx.com any time. by Futurepower(R) · 2003-07-14 03:01 · Score: 3, Informative

Your comment was confusing to me until I realized that you are talking about giving NYT an actual email address. Why would you do that? Isn't that why we have hotmail.com? Give an address that does not exist or a throw-away address.

Last week I was registering at a web site and I put in xx@xx.com for the address. The system responded, "This address has already been registered." So then I put in xxx@xxx.com. The system responded, "This address has already been registered." So I entered xxxx@xxxx.com. Same response. Finally I awoke fully and entered some Ds, xxxxdd@xxxx.com, and the system accepted my "registration".

Come on, it's not a SPAM question... Get real. by jbottero · 2003-07-14 03:05 · Score: 1, Insightful

But you know, you DON'T have to give a real name or email for NYT or JPost, or most of the others, they don't send you your pass and uid, you know.

It's not the spam that's the problem, if you use your head, you get no spam. It's the hassle of logging on

Re:Come on, it's not a SPAM question... Get real. by 1u3hr · 2003-07-14 15:26 · Score: 1

It's not the spam that's the problem, if you use your head, you get no spam. It's the hassle of logging on
If you let the NYT have a cookie, and tick the "no thanks" boxes allowing them to send email, it is complely transparent. Since the cookie is linked to a throwaway email account and all the name, address, etc is fake, I can't see any issue at all. Since registering a year ago I've never received any email from them. (The account redirects to my normal one.)

Put up with the ads or pay $5 per issue! by jbottero · 2003-07-14 03:12 · Score: 0

The problem is, Sir, things cost money to produce. Newspapers cost a hell of a lot more than 50 cents or a buck fifty or whatever. Want to ACTUALLY pay for the content you read? Try $5 or $10 per issue! Can't handle that much? Put up with the ads.

Re:Free registration..some implications by jbottero · 2003-07-14 03:18 · Score: 0

Actually, free reg requires a valid email id. It thus filters most bogus registrations.

Crap. I am jizz_sucker, age 97, and my email is qwert@qwert.com. Email me and see if I respond.

Before that I was quwambi_bartok at an email made of random key strokes.

You still don't get what I am saying. by StarFace · 2003-07-14 03:26 · Score: 1

Hmm? I generally do not read newspapers. Sorry about the feint there, I was just hypothesizing about a dream advertising assimilation engine that erased itself, one of those things most people refer to as a joke. I don't actually read the Times, or any other major newspaper.

--
V

Re:Free registration..some implications by Anonymous Coward · 2003-07-14 03:26 · Score: 0

jkrise wrote: ...Secondly, news sites are planning to go the 'pay' way in about a couple of years... Anonymous coward replied: Can't wait for that day, I'll subscribe and post all the news on my site for FREE! And I'll be the only one, generating millions from ad revenue, because I'm the only free news web site in the world.

Re:Worst result .. it's reality already by jbottero · 2003-07-14 03:33 · Score: 0

While for the time being, Google is THE search resource, it was not always this way. Google will fall under it's own weight and some other wizz-bang will take the lead. Remember AltaVista? Excite? HotBot? Where are they now? They are all where Google is going fast. Who will spring for the next big search site? Some soon to be rich college kid who will eventually sell out and go play with is trophy wife and boat...

The Paywall by ka9dgx · 2003-07-14 03:43 · Score: 2, Interesting

The fact is, they want to hide things behind a paywall, and most folks resent it. They need to decide a few things:

Are they going to expose their archives, and enjoy the benefits of far more exposure (on google, etc), or hide behind a paywall, and complain that they're increasingly irrelevant.
Are they going to start practicing journalism, or stick with their current direction of corporate propaganda for the masses? (and become increasingly irrelevant)

--Mike--

I *know* Slashdot doesn't sell my email address, by Andorion · 2003-07-14 03:55 · Score: 1

I can't say the same about the NYT.

~Berj

Quit your fucking whining. by Wakko+Warner · 2003-07-14 03:57 · Score: 1, Insightful

I have an NYT account. Do I care if they know what I read on their site? About as much as I care when the next "American Idol" rerun is on (which is to say, not at all.) Why on earth are you fuckers so paranoid about this? I see absolutely nothing wrong with tracking as long as it's limited to the originating site.

Get over it, for God's sake.

- A.P.

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Re:Quit your fucking whining. by Tenebrious1 · 2003-07-14 06:54 · Score: 1

I have an NYT account. Do I care if they know what I read on their site? About as much as I care when the next "American Idol" rerun is on (which is to say, not at all.) Why on earth are you fuckers so paranoid about this? I see absolutely nothing wrong with tracking as long as it's limited to the originating site. Get over it, for God's sake.

If you really don't care, why not post your login and password so we can share it? If you don't care that they're tracking and customizing ads for you, then you shouldn't care if we log in with your account and screw up their tracking. If that's the only reason, then give us your login, there's no reason not to.

--
-- If god wanted me to have a sig, he'd have given me a sense of humor.
Re:Quit your fucking whining. by Wakko+Warner · 2003-07-14 13:07 · Score: 1

username: aptimes
pass: times

Have fun.

- A.P.

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

NYT Doesn't Spam This Registered User by reallocate · 2003-07-14 03:58 · Score: 1

I've been registered with the NYT from their beginning. I receive little spam from any source, and none that I can trace to them. The only mail I get from the NYT is mail I requested.

--
-- Slashdot: When Public Access TV Says "No"

Be Careful of NYT/Slashdot Comparisons by reallocate · 2003-07-14 04:04 · Score: 1

That's taking comparisons of Slashdot and NYT about as far as they can go without demeaning the NYT.

The Times practices real journalism; Slashdot practices poaching on real journalism.

--
-- Slashdot: When Public Access TV Says "No"

SHUT UP SHUT UP EVERYONE SHUT THE HELL UP! by Wakko+Warner · 2003-07-14 04:07 · Score: 1

Christ all fucking mighty! It's an EMAIL ADDRESS! If you're SO concerned about it, MAKE ANOTHER ONE and use it for registration! What the hell is the big deal; why is everyone SO pissy about the NY Times for offering a FREE service (and yes, unless you're a fucking retard, you should consider it to be free)? An email address is not currency. It has no value. It is worth NOTHING. I can make billions upon billions of them whenever I want. Why the bloody hell do so many people on slashdot take offense to this absolutely innocuous information request?

Jesus!

- A.P.

--
"Remember when the U.S. had a drug problem, and then we declared a War On Drugs, and now you can't buy drugs anymore?"

Re:SHUT UP SHUT UP EVERYONE SHUT THE HELL UP! by _Sprocket_ · 2003-07-14 06:40 · Score: 1

An email address is not currency. It has no value. It is worth NOTHING.

If its worth nothing, then why do they require it before allowing access to the content? Why do they go through the expense of recording and storing this information? Obviously there is SOME value to the information.

Whether the value is a fair trade for the service / content provided is a different issue entirely.

Probable topics of NYT - Google discussions by mysticgoat · 2003-07-14 04:10 · Score: 1

Why is NY Times even in "discussions" for this, other than to gain some column inches?

I don't think this is the case. For one thing, it doesn't seem like NYT is playing up this story the way I would expect, if they were using it as a publicity tool.

My guess is that among other things, NYT and Google are discussing what kinds of demographic info Google could provide NYT on those who access the cache, and what the cost of that kind of contract would be. I'm sure that NYT would like to be able to tell their advertisers that "If you buy the 'Geek' package, your ad will be put directly in front of 750k geeks with subscriptions, and on the stories that get to slashdot, you can expect another 250k tinfoilhat viewers, through the Google archive."

So I think Google and NYT are looking for win-win combinations. Since this is a brand-new playing field, it is probably going to take a while to figure out. I'd guess they are even having to work out how to keep score.

This message brought to you by BronzeWorks
"Tinfoil is now so old hat. Move up to the new technology:
get a BronzeFoil Helmet(TM); today!"

You're a fucking genius! by Anonymous Coward · 2003-07-14 04:12 · Score: 0

Requiring registration is, by definiton, not free. You are required to do something in return.

I agree; that's why I hired someone to register for me, thus making it completely free.

Get the lawyers -- by jtalkington · 2003-07-14 04:24 · Score: 2, Insightful

people are using technology for what it was inteded to do!

At the heart of Google's caching dilemma lies a thorny legal problem involving a core Web technology: When is it acceptable to copy someone else's Web page, even temporarily?

When your server and pages say it's alright (or don't say that it's not alright.) The standards for the web are very clear on this, but non techie companies (and some judges) don't seem to get this.

This reminds me of the issues of "deep linking" that everybody was suing over a couple of years ago. That's exactly what the web was designed to do, but these johnny-come-lately companies put sites up, and expect people to stop using the technology for what it was designed for.

If only the EFF was as well funded as the ACLU...

Just a datapoint by abulafia · 2003-07-14 04:30 · Score: 1

I've been subbed to the NYT for a long time, with a trackable email address used only for the registration. I opted out of all of the email crap.

I've never been spammed at that particular address. I would conclude that the NYT is actually ethical wrt their email database.

--
I forget what 8 was for.

If google where evil.. by spaic · 2003-07-14 04:33 · Score: 1, Funny

Oh you don't like it? Ofcourse we won't cache your pages then, we'll even make sure your pages are completely removed from our index. From now on a search on New York Times will give nothing but gay porn. If there's anything else we can do for you don't hesitate to ask. Thank you!

Two solutions by Anonymous Coward · 2003-07-14 04:44 · Score: 0

>Shouldn't the NY Times simply tell Google not to cache their site?

No, the NY Times should stop crying and apply some technical knowhow.
2 solutions:
There is this little thing called a robots.txt file. NYT should learn how to use it.

There is a little thing called the login. NYT should properly secure it to keep spiders out.

Oh, that's right, they want page rankings with Google... so google needs to be able to index the site and pull the content. Hrmm... what to do...

Darn, that is just too bad. The cache is part of the deal when you want your site to come up in search results on Google. Love it or leave it.

I hope that Google tells them where to go. They can you know...

"Google is red hot
That NYT site ain't doodly squat"

I have never read an online NYT article. My life is just fine without NYTOL. I often depend on Google to find useful information. I would not miss NYT results. Funny, I have never even seen NYT come up when I search for stuff.

Aren't NYT articles half fiction anyway?

The whole idea of free but requiring a login to view the content is incredibly stupid anyway. I am not giving them my email so they can sell it to spammers....

l8,
AC

Re:You are welcome to use xxxxdd@xxxx.com any time by Becquerel · 2003-07-14 04:46 · Score: 1

You've got to feal sorry for the guys who actually own: xx.com xxx.com abc.com qwert.com etc there spam boxes must be enourmous.

--
My spelling isn't bad, I'm evolving the language

Yeah, but they don't need Google for that. by Inoshiro · 2003-07-14 04:54 · Score: 1

They just need to stick /> in their headers. How hard is that? Not.

--
--
Internet Explorer (n): Another bug -- that is, a feature that can't be turned off -- in Windows.

Re:Free registration..some implications by gilroy · 2003-07-14 05:13 · Score: 1

Most people offer up the John Henry story as a testament to the indominitable will of Man in the face of dehumanizing machinery. For example, this site says

According to some accounts, on hearing of the machine, John Henry challenged the steam drill to a contest. He won, but died of exhaustion, his life cut short by his own superhuman effort.

which seems, to me, to be praising Henry instead of pointing out how useless this sacrifice was ... especially since, even if he "won", you can be sure that the next iteration of the machine would have beat him.

--
The Mongrel Dogs Who Teach

Hey NY Times! I don't want to register! by mnemotronic · 2003-07-14 05:26 · Score: 3, Insightful

Shouldn't the NY Times simply tell Google not to cache their site?"

How about if the Times got over their registration fetish?

From the Times Subscriber Agreement:

You may not ... in any way exploit, any of the Content or the Service (including software) in whole or in part.

What is meant by "exploit"???

From the "Forums and Discussions" section:

You shall not upload to, or distribute or otherwise publish on the message boards (the "Forums") any ... abusive ... material.

What is meant by "abusive"???

And how about this>

3.5 You acknowledge that any submissions you make to the Service (e.g. Letter to the Editor, Review or Commentary) may be edited, removed, modified, published, transmitted, and displayed by NYTD and you waive any moral rights you may have in having the material altered or changed in a manner not agreeable to you.

Interpretation: The user/poster is entirely responsible for the content of their post, which the Times may alter in any way. Yikes!!! Granted, this applies only to content submitted to the Times, but the wording seems pretty scary.

--
The Russians have won. They have made the world a cesspool of distrust, greed, fear and hate.

It shouldn't be up to google.... by dentar · 2003-07-14 05:36 · Score: 2, Insightful

..to censor their cache. Those that don't want their content cached should fix their web servers and firewalls first. My web site prohibits known web crawler bots, and google doesn't cache it. No problem! I didn't have to harrass google about it and they don't have to break their own promise to not be evil.

--
-- I am. Therefore, I think!

Re:It shouldn't be up to google.... by neitzsche · 2003-07-15 04:30 · Score: 1

I agree that the onus should not be on google. More to the point, they (NYT) are publishing something in public for all to see and use how they will. It's not like they are publishing big secrets. They should be ashamed that they are putting something out for public consumption, then taking it back. Either publish or don't. They charge subscriptions based on the talents (ahem) of their writers, but the subject of what they are reporting on is not their doing. This seems like a way for them to say that the own the course of human events.

--
"God is dead." - Frederik Nietzsche

Re:You are welcome to use xxxxdd@xxxx.com any time by cesspool · 2003-07-14 05:37 · Score: 2, Funny

my personal favorite has always been - ucan@suckit.now

Herr EyeEye? by Anonymous Coward · 2003-07-14 05:41 · Score: 0

"Warning: MEMRI and ADL are dangerous propaganda machines."

Herr EyeEye, did you join the Hitler Youth when you were 6, or did you wait until you were 9 years old?

Re:Herr EyeEye? by Anonymous Coward · 2003-07-18 05:09 · Score: 0

More propaganda from an AC eh...
lol.

USA hated only among the evil and ignorant by Anonymous Coward · 2003-07-14 05:44 · Score: 0

" What the hell are you talking about? the USA has a bad rep worldwide _precisely_ in the crowd of people who know such things."

The hatred is strongest among the evil-minded and ignorant. France protests against the U.S. just a couple of years after they had protests about "Jews controlling the world". It has a bad reputation only because too many people are ignorant.

"Lay off the patriotic crack pipe for a little bit."

Not as much patriotic as informed about global issues.

IAA"/."L. by Anonymous Coward · 2003-07-14 06:00 · Score: 0

"Any good search engine should ignore "robots.txt". If you don't want it read, don't put it up on the web in the first place."

And if you don't want people breaking into your house, don't build one. If you don't want people driving your car without permission, then don't buy one.

"So sue every user with Netscape, MSIE, and Opera: they are copying web content into their own caches all the time."

There's permission for this embedded in copyright law (You know? The one that every "/." reader is an expert on).

Re:IAA"/."L. by Anonymous Coward · 2003-07-14 06:09 · Score: 0

"Any good search engine should ignore "robots.txt". If you don't want it read, don't put it up on the web in the first place.....And if you don't want people breaking into your house, don't build one. If you don't want people driving your car without permission, then don't buy one.

Huh? The only way that analogy would be anywhere close would be if you dumped your house's contents in the middle of the street, or copied your keys at K-mart and tacked them up on bulletin boards all over town with "drive me" on them.

If you put it up on the public web to be read, don't cry and whine if it ends up on a search engine and someone (GASP) reads it. (hopefully a good one with a more complete Internet snapshot due to ignoring robots.txt)

Free access to post on NYT website. by Anonymous Coward · 2003-07-14 06:17 · Score: 0

username: jaysonblaire
password: foolja

either way by noldrin · 2003-07-14 06:19 · Score: 1

Either way I'm not registering with the NY times, I don't want to bother with another login just to read a newspaper. It would be nice if slashdot would just ignore them and give it's business somewhere else.

Favorite fake e-mail addresses by Anonymous Coward · 2003-07-14 06:33 · Score: 0

" my personal favorite has always been - ucan@suckit.now"

I'm fond of

nevergiven@neverspammed.com

Copying webpages.... by IamLarryboy · 2003-07-14 06:37 · Score: 2, Informative

happens everytime you go to a website. Creating a copy of the content is the primary means of internet communication. I don't see how google caching the pages is any different than me viewing it in my browser. It's not like google takes the credit for the content. If it were so, there would be no way for any web search to work without owning all the searchable content.

Re:You are clearly antisemitic by Abel+Wingnut · 2003-07-14 06:40 · Score: 1

Umm, I don't see how you proved that I am in anyway Anti-Semitic. I just said that I hate the ostensible inaccuracies and biases towards the Israelis. There's absolutely no necessity for such a predisposition in today's world, and it just irks me to no end when I read a lengthy report on an Israeli death due to Hamas gunfire in the midst of an Israeli led attack, only to briefly -- two to three sentences -- mention in the conclusion on Palestinian deaths caused by Israelis. That's not just or legitimate journalism.

Access Restriction Illegal by LuYu · 2003-07-14 06:54 · Score: 1

It should be illegal for the NYTimes to prevent access to those articles. If they do not want to pay to host the sites, fine, they can give them to another archive that will, but that information is ours to access like it would be if we were to visit a library.

--
All data is speech. All speech is Free.

Ageist behaviour by Washington post by Anonymous Coward · 2003-07-14 06:57 · Score: 0

Has anyone noticed the "survey" page from the Washington post? I have found that it is impossible to put a birthdate of 1899. Now, there are a few people in the world who are 104 years old or more (I am not one of them), so how can they truthfully respond to the Washington Post's reader survey? Or does the Washington post assume that people of that age don't use the Internet?

Re:Problem is potentially bigger than caching Re:Y by Anonymous Coward · 2003-07-14 06:58 · Score: 0

move or be moved

Re:You are welcome to use xxxxdd@xxxx.com any time by ryanwright · 2003-07-14 07:01 · Score: 1

You are welcome to use xxxxdd@xxxx.com any time.

Tried it. "This address has already been registered." Doh!

--
-Ryan, with the unoriginal sig

Death caused by Israelis? Not by Anonymous Coward · 2003-07-14 07:02 · Score: 0

" only to briefly -- two to three sentences -- mention in the conclusion on Palestinian deaths caused by Israelis."

This is because the deaths caused by Israelis are very rare and typically actually caused by Palestinian militants who use their own civilians as human sheilds.

Even now, groups like Hamas and PFLP still locate their hideouts in civilian areas.

Google does not steal anything AT ALL by Anonymous Coward · 2003-07-14 07:06 · Score: 0

"But Google's opt out program is more akin to telling burglars "Hey, I didn't want you to steal my TV, please return it." The difference between Google and a real burglar is of course that Google "returns the tv" if you ask them to do it, but returning stolen goods doesn't make theft legal."

More horrific abuse of the English language. It is not theft is nothing is stolen, and when something is copied it is never stolen (as it never meets the part of the definition of theft which includes "taking away").

google news cache - no longer exists by naoiseo · 2003-07-14 07:09 · Score: 1

it used to be, for the first few months of google news' existence, that you could check the cached copy of the news article as it was fed to google.

this isn't what ppl think. the cached copy todays post is referring to here is the google search engine cached copy, the one spidered by the crawling bot.

About two months ago google stopped access to the google news direct cache. You could request the cache for a document the same way you always did, with a cache:url.tld/blah.html, but you had to do it from the newsDOT server, which the toolbar did not default to. In effect if you 'checked the cache' on a news story it would redirect you to wwwDOT instead of newsDOT and tell you no such document exists. a simple rewrite of the subdomain, and pow, a plain text cache of how the news story was fed to google.

I have to say, in the couple of months that we had access to this, information watching was a whole new field. I personally came across more than a handful of articles that were pulled within 24 hours by the newspapers in question. Crazy interesting. no tinfoil involved.

Google seldom respects meta-tags to stop caching by LionMage · 2003-07-14 07:30 · Score: 1

Speaking from personal experience, Google's sloppy handling of meta-tags (which can tell web spiders/crawlers to "go away" and not index a given site) is a long standing issue which won't go away overnight. On LiveJournal, an individual user is allowed to select a check-box on their settings page which determines whether search engines may index that user's journal or not. I've got my journal set up to disallow indexing.

Theoretically, Google should not turn up search results from my journal.

However, the reality is, people have sometimes been able to find entries in my journal using Google. This got me in hot water a while back, because I had a public journal entry pertaining to my use of a wireless access point installed on the network of a certain company... That was how I got traced.

Without commenting on the ethics or morality of what I did, or whether I deserved the fallout that came after, I'd like to point out that none of this would have happened if Google had respected the meta-tags on my journal, which expressly forbade indexing.

So the only way for Google to honor the wishes of the New York Times is to give the NYT preferential treatment -- and even then, a few articles might slip through the cracks. I used to think that web caching and indexing sites regardless of the wishes of the site owner was no big deal. Now, I'm not sure where I stand.

The Reason For All Tech Moneytalks by Mukkachuk · 2003-07-14 07:33 · Score: 1

You know they're just aching for summa dat cache money. ...yo.

Re:Free registration..some implications by angle_slam · 2003-07-14 07:41 · Score: 1

We're talking about user's registering to view web pages. Not authors registering with the copyright office. Presumably, a large media organization like NY Times registers all their works with the copyright office.

don't like it? Start your own newspaper by asscroft · 2003-07-14 08:04 · Score: 1

Start your own newspaper. It's a free country. Get bigger/better than the NYT. Get reliable sources people can count on. Get top notch articles. Get the stuff people want to read at your site and then DON'T REQUIRE LOGIN and we'll all use your newspaper site instead of the NYT.

Keep in mind there are tons and tons of newspapers out there, and yet we still link to NYT more than most others, and that's not for nothing.

Good luck. Be sure to post here when you're up and running.

Oh, and if you're thinking of simply caching the NYTs everyday, don't bother. That's w34k.

--
because I have been enjoined by this Holy Office to abandon the false opinion which maintains that the Sun is the centre

You can't start your own newspaper by Anonymous Coward · 2003-07-14 08:16 · Score: 0

You can't start your own newspaper, because Clear Channel controlls all of the media.

This is what Moveon.org told me.

If you don't want it read, don't put it up! by Anonymous Coward · 2003-07-14 08:20 · Score: 0

"If I happen to have sex with the blinds open, should I have to assume that somebody like you is standing in the building across the street with binoculars?"

Bad analogy. We're talking about putting stuff up on the public Internet, not leaving our firewalls down. A better analogy would be having sex in the middle of the street. *GASP* somebody saw it!

" "if you didn't want to read spam and learn about my valuable products, you wouldn't have an email address.""

Nothing like that at all: the spam intrudes into your space. Reading and caching web pages you put out into the public involves nothing like this.

" but what kind of jerk would want to actively create a world where that was the case (which is what you seem to be advocating)? "

A world where if you publish something *GASP* someone might read it.

Here ya go by Anonymous Coward · 2003-07-14 08:34 · Score: 0

username: Slashdot33 email addy:slasher@slashdot.org Password: Slashdot I just don't get it. The user ID Slashdot was already taken!

NYT well written, serious ??? by Anonymous Coward · 2003-07-14 08:52 · Score: 0

Maybe we can agree that the NYT is a well-written, serious and interesting newspaper.

If you agree with political agenda of the NYT, then it is well written, serious and interesting. Case in point: All of the many articles writen with the express purpose of pressuring the golf club where the Masters tournmanet is held (Augusta) to accept women as members. Remember, Augusta is a private club receiving no funding from any level of government. Shouldn't a private club be allowed to admit whomever they want?

Do you think the NYT would back me if I wanted to join a welders union but was not a welder. Should the welders union be forced to accept me?

How about backing a male running for president of NOW?

The NYT wants a double standard in that they only want to apply rules to some orginization or person when it suits the political goals of the NYT, they do not want to apply them when it does not.

Re:Free registration..some implications by Anonymous Coward · 2003-07-14 08:59 · Score: 0

Yahoo....for email

NYTimes wants Google to cache, here's why by tfcdesign · 2003-07-14 09:09 · Score: 0

NYTimes wants Google to cache their site otherwise NYTimes will drop from the scope of millions of Google users. That's ad revenues and everything dropping.

The value of Google users accessing NYTimes via cache and reenforcing brand recognition of the news is not the maximum value of the news-product .

The NYTImes just wants its cake to and to eat it, too.

A contradiction? by yeti+(dn) · 2003-07-14 09:11 · Score: 1

Why would NYT want their content cached AND don't want anyone to actually use the cached page? Waht's the point of such a cached copy -- beside wasting Google's resources?

--
Life is the slowest way to death.

Stop Registering! by Anonymous Coward · 2003-07-14 09:40 · Score: 0

The Internet simply does not need news and information web sites that require registration.
Stop registering.

There are plenty of sites out there that have things to offer without requiring registration in return. It is a privelege to be able to put a site on the net and have it be viewed by many people, and the attention given by many people is payment in and of itself. I reject additional payment in the form of registration.

Make NYT choose between getting with the program or dying. Don't post stories with links to NYT. When you see a page from NYT or Washington Post that wants your name and email address, hit the Back button. The web will evolve to become a better place.

Those who argue that all good web sites and all content will vanish without registration are just silly.

can you elaborate? by Anonymous Coward · 2003-07-14 10:17 · Score: 0

Couple of clicks?

For those of us less technologically inclined, can you provide a little more detail?

tia!

Re:can you elaborate? by anonymous+loser · 2003-07-14 12:05 · Score: 2, Informative

It's easy. Let's say you want to read this article (which is the top story ATM):

"Iraqi Council to Seek U.N. Seat; One G.I. Killed in Baghdad"
The URL is:

http://www.nytimes.com/login.asp?URL=http://www. ny times.com/2003/07/14/international/worldspecial/14 CND-IRAQ.html
(or something like that)

Well, instead just substitue archive.nytimes.com:
http://archive.nytimes.com/l ogin.asp?http://www.ny times.com/2003/07/14/international/worldspecial/14 CND-IRAQ.html

You will get a message that says something like "authorization error" and the browser then takes you back to the front page. However, when you click on the same story, you will get taken to the content rather than a login page.

furthermore by Anonymous Coward · 2003-07-14 10:22 · Score: 0

the number of reference to nytimes here on slashdot does indicate something about the value of that content, considering that nytimes is not even a technically oriented paper. It just has good covererage (what's a Jayson Blair or two among friends... read with a grain of salt, but still).

PS: besides complainers are lame since nytimes allows several ways around their registration... besides caching (there is the partners. links and there is the fact that you don't have to read it if you go through google or other partners that don't require registration).

Re:You are welcome to use xxxxdd@xxxx.com any time by pyrrho · 2003-07-14 10:28 · Score: 1

or better yet... the nytimes doesn't use that email for anything... you just set your own password, you can use any old email address...

--

-pyrrho

Re:You are welcome to use xxxxdd@xxxx.com any time by valkraider · 2003-07-14 10:44 · Score: 1

Hehe - tell these guys about it:

asdf.com

Re:Free registration..some implications by bobbozzo · 2003-07-14 10:59 · Score: 2, Insightful

Yeah, I always like to try abuse@domain for sites that require registration. Kinda mean to the postmaster, but if I "opt-out" and they still send something then they're spammers anyways.

--
Nothing to see here; Move along.

quagmire by Anonymous Coward · 2003-07-14 12:07 · Score: 0

"They haven't apologized for front-page stories claiming Iraq was a quagmire"

well, maybe because it IS a quagmire... our guys are getting killed there every day since the so-called "end of hostilities"

Caching Middle East coverage by Anonymous Coward · 2003-07-14 12:20 · Score: 0

This debate would be a lot easier if we had a nice Google cache of NYT coverage of the middle east ;-)

The thing that disturbs me the most about the Isreal vs. Palastine situation is what it says about human nature considering the decendants of the victums of Nazi monsters are now acting... well, just like the Nazis. They've turned entire cities into concentration camps and taken to bulldozing houses with people inside.

not sure if this was covered already... by edrugtrader · 2003-07-14 13:10 · Score: 1

but those links provided aren't to google's cache of the site, but to the NYTIMES site directly with a flag that turns off their "require login" scheme... the simple variable change is &partner=GOOGLE

tell google not to cache already happens, they need to fix the hole on their end.

--
MARIJUANA, SHROOMS, X: ONLINE?! - E

Re:moron why Godless corepirate nazis fear the cac by Anonymous Coward · 2003-07-14 14:30 · Score: 0

yes, I agree, and would like to subscribe to your newsletter

What is the problem by SCHecklerX · 2003-07-14 14:51 · Score: 1

Seems the perceived problem is 'dead' links. Isn't an archive of NEWS of all things a GOOD thing??? Sometimes you want to see the old/original version of whatever.

Re:You are welcome to use xxxxdd@xxxx.com any time by Coke+in+a+Can · 2003-07-14 15:26 · Score: 1

My favorite: nobodies@home.com
Also: fyou@your.ass

NYT has bigger fish to fry. by rbullo · 2003-07-14 16:20 · Score: 1

Try the Wayback Machine. It caches EVERYTHING - NO EXCEPTIONS.

--
OH NOES!!! IT APPEARS YUO DO NOT HAVE ENOUGH MONEY TO PAY FOR DIS HERE PIZZA! WAHT EVER ARE YOU GOING TO DO!?!?

Re:NYT has bigger fish to fry. by rbullo · 2003-07-14 16:28 · Score: 1

Oh, wait, sorry. I'm wrong. You can exclude sites from the Wayback Machine the same way you woult for Google's cache.

--
OH NOES!!! IT APPEARS YUO DO NOT HAVE ENOUGH MONEY TO PAY FOR DIS HERE PIZZA! WAHT EVER ARE YOU GOING TO DO!?!?

Use standard nospam/nospam login by Krellan · 2003-07-14 17:38 · Score: 1

What I do at sites like NYTimes is to use a generic login that can be shared with many other people. This will help with concerns of privacy and being tracked.

Start with username "nospam", password "nospam". Enter bogus information when registering the account, and use a throwaway email address. If that username/password is already taken, advance to the next one: username "nospam2", password "nospam2", and so on.

At NYTimes, I registered this, and currently use the following:

nospam2/nospam2

This works great. If this should ever stop working, hopefully someone will register nospam3/nospam3. I'm glad to see this is becoming a standard. At several sites now, I can just try nospam/nospam, maybe nospam1/nospam1, then nospam2/nospam2, and so on. Often, I get in, or just create the account if it hasn't been already created.

Try this the next time you get nagged for registration when trying to read a newspaper :)

--

Dr. Demento On The 'Net!

Tripod by Anonymous Coward · 2003-07-15 06:53 · Score: 0

Aren't there Tripod hosted pages that get around this already? The google cached version of tripod pages tends to cover the background with "Hosted by Tripod" images, making the text near impossible to read, forcing the frustrated user to click on the non-cached link, which doesn't have these images in the background. (Although, it may make google translations difficult as well.) I'm not exactly sure how tripod does this, but they seem to do it pretty effectively.

Big_big_trash@... by Futurepower(R) · 2003-07-15 14:58 · Score: 1

My significant woman tried to register big_trash as an email login name, but it was already taken. She had to use Big_big_trash, instead.

Re:How did google GET the copy ? by smeenz · 2003-07-19 09:43 · Score: 1

-1 redundant ? jeeze you moderators are mean today. There was only one other post asking that when I posted which I didn't see until later as it was buried in a thread.

I didn't see anyone answering the question either.

Slashdot Mirror

Web Caching: Google vs. The New York Times

518 comments