Searching for The New York Times
r.jimenezz writes "Adam L. Penenberg, an assistant professor at New York University, has written an interesting piece over at Wired about the contrast between the New York Times' relevance in the real world and the dismal rankings it gets in modern search engines' results. Penenberg discusses some very interesting ideas about opening up the Times digital archive and the impact this would have on its cyber presence."
Relevance is a highly subjective term. If you're a typical outspoken, liberal New Yorker, then its your Bible. If you live in a cabin in Montana, you probably don't give a shit. Calling something 'relevant' indicates much about the person doing the calling, as much or more than it tells anything about the item being discussed.
Personally, I think its a rag. It's old, its big, its supposedly a "standard", but no more relevant than my local paper. And probably LESS relevant than the sum total of whats available online - BBC, London Times, Die Zeit, Drudge, CNN.com, english.aljazeera.net, etc. etc.
I want to delete my account but Slashdot doesn't allow it.
The article assumes that the fault lies with the NYT and whether their archives are open. Perhaps the real fault lies with Google. Shouldn't there be something in Google that identifies certain sites and more reliable than others rather than basing rank solely on links? How many people link to online news articles? You're more likely to link to your friends beer-and-computer-mods page than a NYT article about Ashcroft's boot fetish.
*** *** You're just jealous 'cause the voices talk to me... ***
I have no problem with registering. If all I have to do is register an email address (heck, even a free hotmail address that i reserve only for spam) and my name, and maybe even my address, and I can get top quality news reporting without having to pay for the newspaper, then by all means I'm for it.
The reason why the NY Times is one of the best papers in the world is because they can afford to pay their employees what they deserve. If my registration helps up the amount of money they can get from their advertisers, then I'm all for it. People deserve to be paid for their hard work.
That said, I do believe they need to have better results on google, and don't agree with paying $3 for their archives that I can get at my local library for free.
Think of the children, people.
I think this touches upon a much larger problem.
Traditionally, libraries were the ultimate source of information. They were organised and well indexed - to help one find what they are looking for.
The internet has become an "instant library" to a lot of us. In ways, the internet is better than a library. Searching is trivial and the amount of information staggering. However, a lot of information is getting lost. I'm aware that there are Archiving sites, but often, these sites cannot index or record the information that sites present from their own MySQL/Oracle databases.
Search engines are really only good for searching a static site, and don't particularly scale well to sites that have content that change frequently.
It all boils down to this: HTML+Search Engine is not a good combination for giving people access to information over a long period of time. Web sites come and go (depending on the interest of their maintainters) and when they go, they're gone for good.
We need to start distributing the content on a global scale - the same way books distribute content among many people.
[ Monday is a terrible way to spend one seventh of your life. ]
In an interesting coincidence, just an hour or so ago, I was looking for an article I read online in the NYT. Specifically, I was looking for an interesting image which was in the article. (Not for any specific use, I just wanted to show a friend.)
Besides the fact that the article is in the archive now (yet less than a month old!) and costs money, the page also informs you that:
Please Note: Archive articles do not include photos, charts or graphics. Our photos are available for purchase, please click here for more information.
Clicking the link reveals that you can order a photographic print for $95, and that's if they have it.
I don't even want a photographic print! A 200x200 pixel bitmap would be fine! (and hardly damaging to their photo sales)
As the article points out, why would anyone casually link to a NYT story? There is simply no point in linking to something most can't access without paying.
They certainly deserve that Google ranking.
I'm not so sure the NY Times is outlandish in their pricing for archived articles. Articles from the past are a niche offering, and thus come with niche prices. If you really need an article from 1964, most likely a few bucks won't be too much trouble. The idea that you'll pay a price directly reflective of the cost of goods is ludacris. If it weren't, we'd be paying 4 cents for a coke, 2 dollars for a movie, and 5 bucks a month for internet service. Take a trip down to the library and spend a few hours finding the article on microfiche, if you can, or pay a few dollars and get it immediately at home.
As a rule I do not read any newspaper online that I have to register for. In fact, I refuse to purchase the Star Tribune or Pioneer Press here in Minnesota because of their policy requiring user registration. Fake accounts be dammed, you want me to read your paper and have to look through your ads you will let me do so without a cookie linked to information, fake or otherwise.
So they are supposed to provide world-class journalism and post it on a world-class website and you can't be bothered to host a cookie and look at some ads (which can be easily blocked anyway) in return?
What a massive sense of entitlement you have. Either that or a severe cookie-phobia...
Stop by my site where I write about ERP systems & more
From a newspapers perspective open archives aren't always a possiblity. I work for a newspaper in a Moderately sized (~100,000 people) midwestern city. We currently have about 135 years of paper archives dating back the the late 1800's. While we do have a decent internet presence, we don't have the resources to provide this conent online for free.
A recent estimate by me showed that we would need about $20,000 to get that project started in a very barebones manner. That isn't a small amount of money to throw at a project that you want to give away for free. On the other hand their is antoher newspaper in town that charges $90/year for access to their sports archives and at last estimate they had close to 1000 subscribers. For a medium sized paper that amount of money is hard to pass up.
Now for a company like the New York Times that is a different story. They certainly have the resources to get their content online. They though, have other reasons to keep their content available on a pay basis. They maintian strict controls over all their copyrighted material. Its hard to blame them for this though, since that content is their lifeblood.
In my opinion I do feel they keep their content under too tight of a lock. Its like having a great idea but never letting anyone hear about it because you are afraid they might steal it. Papers must decide between keeping their copyrighted material secure and providing it to readers in a new medium. But it is that delecate balance that traditional print publications now face while moving into the digital era.
The Times attracts 9 million unique visitors a month, while only about 1 million read the daily paper.
I find the extensive dead-tree version convenient and end up reading more from it than the on-line version that's free.
But, not having a lot of time during the week, I end up buying the print version maybe every 3 days, and quickly scanning the on-line headlines on the off-print days.
The Times really ought to open up its archive and let everyone, including Lexis-Nexis, have free access.
Many years ago at a university library they had an entire special catalog devoted to indexing old NY Times articles that one could read from microfiche. Without the individual paying, either.
There is still a fundamental chasm between archived high-quality material (especially true for scientific journals) and what is freely available and searchable on the web.
Think about how useful it would be for the general public to have access to old, high-quality archives like the NY Times and other scientific periodicals; the pursuit of science and other research would be considerably advanced over where it is today. Then there is the reality: copyright protections and the hope by the copyright owners for a few dollars more by charging for access (that only the very wealthy or institutions can afford) still persists.
It's almost enough that I think the government ought to exercise eminent domain (link to counterpoint about possible abuse of eminent domain - just as they do for land when a freeway needs to go through Aunt Tilly's backyard) and provide some reasonable compensation to the current copyright owners and to appropriate sufficiently old works and make them available publicly.
"Provided by the management for your protection."
To have a paper like the New York Times, who can command advertising rates as high as any paper in the world, bitching and moaning about their web presence and hoarding their articles like some stupid info-miser shows nothing more than a complete lack of understanding somewhere in the company. There is no excuse for it.
Uh, I don't know if you realized this, but newspapers ALSO make a lot -- a LOT -- of money on their archives. In fact, in some areas the only reason the local paper survives is an archival entity, selling their content digitally and on microfilm/fiche to universities and to services like Lexis-Nexus.
There is a big fear in the newspaper industry that opening their archives online will destroy this revenue stream without introducing a comparable new revenue. It is a very realistic fear...I used to work for an online newspaper company, and it was quite common to have customers putting up less than half of their print content after seeing massive drop offs in print sales. Many clients would ask us to clear their archives, so you could only search a month back.
I mean, the Times is a respected paper. Their articles are linked to all over the net despite the required registration, and they can expect every self respecting university to buy the year's microfilm roll. Offering the content for free could ONLY hurt them, so they'd be stupid to do so.
Hey freaks: now you're ju