Google Abandons Plan To Archive World's Newspapers

← Back to Stories (view on slashdot.org)

Google Abandons Plan To Archive World's Newspapers

Posted by Soulskill on Friday May 20, 2011 @04:51AM from the can't-google-a-word-you-don't-know dept.

An anonymous reader writes "Throughout the past few years, Google's newspaper-scanning project has digitized more than 60 million pages from newspapers spanning 250 years, including such gems as the moon landing. But according to the Boston Phoenix, this ambitious effort is slated to soon be axed in favor of Google One Pass, a platform for publishers to monetize content from their own sites."

19 of 60 comments (clear)

Min score:

Reason:

Sort:

What?!? by d3vpsaux · 2011-05-20 04:55 · Score: 3, Funny

Google wants to help others make money (and make a little themselves) with one of their projects?
Forget Google search, I'm going with BING instead. Microsoft would never do this!
1. Re:What?!? by gnick · 2011-05-20 05:00 · Score: 2
  
  Let's examine 2 business models:
  1) Do a bunch of really neat stuff that everyone will like at great expense and give it away for free.
  2) Find a way to make a reasonable profit by providing useful services.
  I know which model I'd pick and don't even consider it 'being evil'.
  
  --
  He's getting rather old, but he's a good mouse.
2. Re:What?!? by hedwards · 2011-05-20 05:13 · Score: 4, Informative
  
  I've got a feeling that this has more to do with them not being able to secure rights to the newspapers. At least for the more recent ones they would require authorization to do so from the copyright owner. I'm not sure why the older ones aren't being scanned though. Perhaps that's a matter of Google needing access and most of those papers being held by the newspapers that published them originally.
Short-sighted... by saleenS281 · 2011-05-20 04:55 · Score: 2

So what happens when some of those publications inevitably go out of business? We lose all of their works forever? I would hope that google could come up with some sort of middle ground. Why not continue the archival process, but allow companies still in business to choose what content is free, and what requires a fee or ads? There has to be a way that the companies can profit while still protecting us from losing the information permanently...
1. Re:Short-sighted... by KenSeymour · 2011-05-20 05:16 · Score: 2
  
  Newspaper articles about the 1969 Moon Landing is (was?) on microfilm in different libraries around the world -- unless every single one of them tossed it all out.
  Then there is the Library of Congress.
  Ironically, if newspapers from the past are not conveniently available online, it increases the value of microfilm or digital media that each local or university library keeps. That makes them less likely to be tossed out.
  In the near future, can people wait for authors and researchers to visit libraries, use a machine to review the material, combine their own analysis info
  a book or article in a monthly magazine? Will people read either if they aren't available on their version of eBook reader or tablet?
  
  --
  "We can't solve problems by using the same kind of thinking we used when we created them." -- Albert Einstein
2. Re:Short-sighted... by Al+Kossow · 2011-05-20 05:23 · Score: 2
  
  In the near future, can people wait for authors and researchers to visit libraries, use a machine to review the material, combine their own analysis info
  a book or article in a monthly magazine?
  The short answer is "NO"
  It is much more efficient for a researcher to search OCRed indexed content.
  You can literally save years of time researching a topic if documentation and artifacts are available somewhere on-line.
  It also helps with peer review. You can now reference hundreds of documents that reviewers may not have physical
  access to.
3. Re:Short-sighted... by Anonymous Coward · 2011-05-20 05:27 · Score: 2, Interesting
  
  In socialist Germany we have a government-run library that archives all German-language print-publications. Submission is mandatory (with a few excepts for low print runs and the likes). No need to rely on the good-will of short-lived corporations.
  http://en.wikipedia.org/wiki/German_National_Library
4. Re:Short-sighted... by DerekLyons · 2011-05-20 06:56 · Score: 5, Interesting
  
  Newspaper articles about the 1969 Moon Landing is (was?) on microfilm in different libraries around the world -- unless every single one of them tossed it all out.
  Funny story about that... A friend of mine was studying old newspapers in the microfilm collection at his university, and found that several key issues of one paper were missing. He didn't think this would be a problem, as it was a semi-important newspaper and was thus archived at multiple libraries across the state. So off he went to the other libraries and found the same dates missing in every library. He checked a couple of out-of-state libraries... and no matter where he checked they were all missing the same issues.
  
  Turns out one company had microfilmed one libraries collection - and the others had all bought the microfilm and trashed their paper copies and nobody had ever actually verified that the microfilm represented a complete run.
  
  So, just because the "backups" are distributed is no guarantee they are complete.
5. Re:Short-sighted... by adolf · 2011-05-20 09:20 · Score: 2
  
  Turns out one company had microfilmed one libraries collection - and the others had all bought the microfilm and trashed their paper copies and nobody had ever actually verified that the microfilm represented a complete run.
  Well now, that's a perfectly reasonable explanation.
  But I, for one, blame the Illuminati.
  
  --
  Kid-proof tablet..
It won't exist in perpetuity at Google either by Anonymous Coward · 2011-05-20 05:00 · Score: 4, Insightful

Google won't exist forever, what happens to the data then?
They should be taking lessons from the Stanford LOCKSS project.
1. Re:It won't exist in perpetuity at Google either by swanzilla · 2011-05-20 05:38 · Score: 2
  
  Ctrl+P
  
  --
  0 = 1 + e^(Alt something)
That's great by H3lldr0p · 2011-05-20 05:06 · Score: 2

If it works.
The real problem being is that those in charge of making decisions at the newspapers don't have the desire to go along with this sort of change. And let's not be coy about it. They need to change and we need them to as well. All of us need good, solid reporting of all sorts.
So far, these sorts of changes have been happening right along without them. This is yet another stop on the grand train to the digital future which most of them are willing to ignore in hopes of something else happening.
Let's see if this time they'll get their collective heads out of the sand.
Libraries by pavon · 2011-05-20 05:10 · Score: 4, Informative

In addition to being archived by the newspaper company, most local newspapers are already archived by local libraries as microfiche/microfilm. This is often required by law, as public notices are required to be placed in newspaper, and a record of this must be kept. Important national newspapers are archived by the Library of Congress, as well as multiple other libraries, where they are also digitized.
This is where Google got their source data to scan/upload in the first place.
1. Re:Libraries by KenSeymour · 2011-05-20 06:47 · Score: 2
  
  Newsprint turns yellow fairly quickly. A physical newspaper is not designed to last. So making an image on Microfilm and/or digital archive is what's important.
  Before I looked it up, I thought Ben Franklin had started public libraries in the US. That is partly true. He started one public and one private (subscription) library.
  Both still exist. Andrew Carnegie started many other public libraries.
  So US public libraries have received both public and private funding. Google could choose to continue the tradition in this way. If they make it a foundation, it
  could live on after Google is gone. If they keep it in "the cloud", who knows?
  
  --
  "We can't solve problems by using the same kind of thinking we used when we created them." -- Albert Einstein
2. Re:Libraries by ArundelCastle · 2011-05-20 07:18 · Score: 2
  
  Libraries around the world also pay large amounts of money to vendors that have already digitized AND full-text indexed decades if not centuries of newspapers.
  ProQuest, Gale, EBSCO, Infotrac, Newsbank...
  I expect those vendors made a bit of a stink that Google was trying to put them out of business.
3. Re:Libraries by vtrhps · 2011-05-20 08:10 · Score: 2
  
  Nicholson Baker's erroneous argument that libraries are gleefully destroying newspapers and other older documents has been refuted again and again.
Best reporting is olders stuff by realsilly · 2011-05-20 05:11 · Score: 2

I've always found that reading through older articles, that one retains more of the facts as they unfolded versus today's news reporting which seems to be mostly heavily biased towards sensationalism and less facts.
But then again maybe that's how reporting always took place, and I only remember the good ole days because the actual fact is my memory is selective in how I remember those days.

--
Life takes interesting turns, but the most interest is when you're off the beaten path.
Makin' Money For Sure! by Frosty+Piss · 2011-05-20 05:15 · Score: 2

People seem to make the mistake in thinking Google is about something other than leveraging search to make money. But Google has been about the BUSINESS of search at least since its IPO. Holding some illusion that Google is altruistic is just fantasy.

--
If you want news from today, you have to come back tomorrow.
Re:OCR. Nope. by ArundelCastle · 2011-05-20 07:22 · Score: 2

See my comment to the parent. Half a dozen global vendors have already digitized and full-text indexed decades if not centuries of newspapers (depending on the importance and history of the paper). No matter how good OCR gets, it won't surpass the work already done by real humans doing data entry, transcription, and correcting OCR by hand.
The value added by Google is potentially offering this incredibly expensive effort of preserving information for free.