Google To Digitize Millions of Old Newspaper Pages
hhavensteincw writes "On Monday Google detailed new plans to digitize millions of newspaper pages with articles, photographs, and headlines intact so they can be accessed and searched online. 'Around the globe, we estimate that there are billions of news pages containing every story ever written,' Google said in a blog post. 'It's our goal to help readers find all of them, from the smallest local weekly paper up to the largest national daily.' For example, Google noted the availability of an original article from the Pittsburgh Post-Gazette from 1969 about the landing on the moon." When you search the news archive for, e.g., "Chicago fire" or "Rosenberg trial," a significant fraction of the result pages cost money to view.
Now, all those guys/girls who streaked during Woodstock are going to repent (more).
But seriously...
1. Guy/girl does something goofy in 70s as a teenager.
2. Gets covered by local news (at that time).
3. Google digitises that news.
4. Now CEO (then guy/girl) is suddenly let go.
Who hasn't done something goofy and thought in retrospect wished they hadn't done it (not necessarily something criminal). Google might make their "second chance" disappear.
ps. Carly F. might have seen this coming ;-)
Guy/girl does something goofy in 70s as a teenager. Gets covered by local news (at that time).
I've seen that already. I looked up an executive, and Google returned a hit from a student newspaper from the 1960s that they'd digitized from microfilm. The story mentioned the guy being a member of the Socialist Workers Alliance.
Oh no! Exec dabbled with left wing ideology in youth! By the way I was a member of the Socialist Worker Student Society when I was a student because I was trying to impress a girl. Why would anybody care?
The people that freak me out are Young Conservatives. Those guys are creepy.
and we are all going to regret it. Remember the public library system? Or the archival organizations? A bunch of highly trained people with literally centuries of experience in classifying and cataloging information, preserving the originals and investing heavily in digitization to help with that task and to make them more accessible? Most of their services are free or at a minimal cost, especially for students and researchers. And completely ad-free (at least here in Europe). Sure, their marketing sucks, they do not have the latest Web x.0 gimmicks. The tend to be a bit stuffier, old fashioned and not as flashy as our bubble heroes of the "do no evil" (but don't do anyting good either) kind, but then they on average tend to think in decades and not in quarterly results. Data (even massive amounts of it) is not information and Google is not a research tool. Google will always tweak search results towards higher advertising revenues. It is at best a brute force instrument with a vey low signal to noise ratio. It is a pest because it leads people to believe that keyword search is a solid method for research and it adds to the funding problems for libraries because who needs a library, when you can "google" everything. Google sucks up all it can get and leaves behind a desert without structure, significance or context, Support and use your local (national) library, while you still have it.
Gather enough newspapers from all around the country and pretty much anything you find will be almost as reliable as finding something written by a random blogger on the web.
I find this comparison a little shaky. Major newspapers have long used professional (paid) journalists who are overseen by professional (paid) editors - both with reputations to protect. I don't see this type of control from a random blogger.