Google To Digitize Millions of Old Newspaper Pages
hhavensteincw writes "On Monday Google detailed new plans to digitize millions of newspaper pages with articles, photographs, and headlines intact so they can be accessed and searched online. 'Around the globe, we estimate that there are billions of news pages containing every story ever written,' Google said in a blog post. 'It's our goal to help readers find all of them, from the smallest local weekly paper up to the largest national daily.' For example, Google noted the availability of an original article from the Pittsburgh Post-Gazette from 1969 about the landing on the moon." When you search the news archive for, e.g., "Chicago fire" or "Rosenberg trial," a significant fraction of the result pages cost money to view.
http://news.google.com/archivesearch?q=%22armadillo+aerospace%22&scoring=t
Fuck I wish Carmack would stop using his Time Machine to get 1957 publicity.
How we know is more important than what we know.
From the billions of dollars of public good that is Google Maps to their true lack of evil, from their sucessful attempts to make the world a better place to the way they treat their employees, Google is truly great.
ALL HAIL GOOGLE. ALL HAIL GOOGLE. ALL HAIL GOOGLE.
If video games influenced behavior the Pac Man generation would be eating pills and running away from their problems.
Now, all those guys/girls who streaked during Woodstock are going to repent (more).
But seriously...
1. Guy/girl does something goofy in 70s as a teenager.
2. Gets covered by local news (at that time).
3. Google digitises that news.
4. Now CEO (then guy/girl) is suddenly let go.
Who hasn't done something goofy and thought in retrospect wished they hadn't done it (not necessarily something criminal). Google might make their "second chance" disappear.
ps. Carly F. might have seen this coming ;-)
I welcome this news. For too long, research on the Internet has been a frustrating task. For any events after about 1997, there's oodles of information. However there's a giant hole in the amount of information available for events before then. Google Books went some way towards addressing this, but it was still an intense task because a lot of the time, you still have to find and buy the books (or find them in a Library).
I really hope they plan to go as far as putting local, regional newspapers online as well.
At last, something that looks really GOOD, from Google! With free access, this will really change the world, even more.
History revisionists will find it even more difficult to dupe.
Maybe there are serious drawbacks, but, for the time I cannot see anything but the positive aspects.
I hope they aren't restricting it to just newspapers. I've saved tons of interesting web articles from official news websites that have mysteriously disappeared over the years. They're not even in the Google cache. Hopefully, most of them will be in the Google News archive.
Guy/girl does something goofy in 70s as a teenager. Gets covered by local news (at that time).
I've seen that already. I looked up an executive, and Google returned a hit from a student newspaper from the 1960s that they'd digitized from microfilm. The story mentioned the guy being a member of the Socialist Workers Alliance.
I hope to god that they edit out the advertising otherwise all us consumers will be frantic with longing for products that are no longer available, what with advertising not being a huge sham and all!
They whose government reduces their essential liberties for temporary security, receive neither liberty nor security.
Most amazing thing to me is on the next page is a story of the fucking asshole kennedy and his murder of Mary Jo Kopechne at Chappaquiddick...nice one Google.
My thirty-year, $50-billion plan to consolidate the microfiche market may well be in the shitter.
Why doesn't Google just purchase some of the better newspaper archive databases, such as NewsBank, and simply release all the stories for free? It'd likely be a lot cheaper than duplicating effort, and would help information be released more quickly.
Incidentally, if you're close to a university or a good library, many of these places already hold subscriptions to such services and offer the use of them for free. I'd love to see Google expand upon this already-good base rather than duplicating effort.
I wonder how the news cartels will react to their copyrighted works being copied and put online... they've tried to sue google just for displaying content available on their sites and referenced from their sites with links...
-- Sex is the antonym of pringles. Once you pop it's time to stop.
You can already access the archives of The Times online :
http://archive.timesonline.co.uk/
It's quite interesting to read about Marie-Antoinette's execution or Jack the Ripper's crimes, I especially like the writing style :)
I've latterly been thinking about the googlization of everything digital. I've latterly also been thinking about the spread of botnets (Storm, Kraken and the like). This has led me to conclude there is a Google Black Ops department intent on replacing Google's vast server farms with users' own PCs - i.e., Google aims covertly to use our computers as its hardware!
From Google's perspective it makes perfect sense to use idle cycles on Aunt Harriet's aging Dell to serve googlicious applications to an eager populace. Why shouldn't she host your gmail account?
The whole concept can even be justified from an environmental point of view: scaling is naturally proportional to demand and load-spreading is extremely efficient. In the long term, Google won't need any of its own hardware other than expensive corporate buildings equipped with limitless executive toys and a few dumb terminals. Hell, we're beginning to see that already. Everyone benefits.
As for the the spam emanating from botnets, this is a mere smoke-screen (or should I say cloud-screen?) designed to keep us off the scent.
I, for one, salute our new Gotnet overlord.
"The Google makes work for idle scanners."
Genesis 1:32 And God typed
Google news, brought to you by the department of truth! :)
Let's hope they'll not be too selective in which articles they publish.
I don't care if they take over the world, just so long as i don't have to scroll through years of microfilmed newspapers ever again - it makes me feel seasick!
Now I can find out everyone I knew who's died with Google archiving the obituaries.
I'm not sure why this was modded "Funny". If Google really is doing regional and local papers, given enough time and effort on Google's part, I may well be able to find stories and obits detailing the lives of relatives and grandparents with whom I never had the opportunity to talk.
Now, if Facebook gets in on this action, things could get a little bit creepy. I don't look forward to being cyber-stalked by the dead.
At last, I can finally go back and tell my 3rd grade teacher THIS is why I didn't need to learn how to use a flippin microfiche!!
So... just like the London Gazette has already been digitized. The difference is, the Gazette began publishing in 1665. Sod the moon landings! You can read the front-line reports about the American Revolution.
http://paperspast.natlib.govt.nz
(already being done in New Zealand for some years thanks to the work of the National Library of New Zealand) papers available back to 1839. With text search too! Cool!
... would allow google to do the same thing. There's been so many times what was interesting came up in a book google searched only to have pages blanked out. Sometimes I wonder if they should just put advertising on the book itself and pay the owners/authors directly (for the hits/adclicks/being read, etc).
So when is google going to start scanning The National Enquirer or other tabloid newspapers, so the slashdotters can look up Natalie Portman's news with grits handy!
The purpose of writing is to inflate weak ideas, obscure poor reasoning, and inhibit clarity....Calvin
Or it might finally make people realize that we are all human, and a stupid act at 18 doesn't equate to judgment post 30. Naaahhh...
You must be new here. Welcome to Earth. We're a little strange here, but you will find that some of us can be relaxed and groovy. Enjoy your stay.
P.S. Please take me with you when you leave the planet
"Empathise with stupidity, and you're halfway to thinking like an idiot." - Iain M. Banks
and we are all going to regret it. Remember the public library system? Or the archival organizations? A bunch of highly trained people with literally centuries of experience in classifying and cataloging information, preserving the originals and investing heavily in digitization to help with that task and to make them more accessible? Most of their services are free or at a minimal cost, especially for students and researchers. And completely ad-free (at least here in Europe). Sure, their marketing sucks, they do not have the latest Web x.0 gimmicks. The tend to be a bit stuffier, old fashioned and not as flashy as our bubble heroes of the "do no evil" (but don't do anyting good either) kind, but then they on average tend to think in decades and not in quarterly results. Data (even massive amounts of it) is not information and Google is not a research tool. Google will always tweak search results towards higher advertising revenues. It is at best a brute force instrument with a vey low signal to noise ratio. It is a pest because it leads people to believe that keyword search is a solid method for research and it adds to the funding problems for libraries because who needs a library, when you can "google" everything. Google sucks up all it can get and leaves behind a desert without structure, significance or context, Support and use your local (national) library, while you still have it.
No, I'm New Here
Do you know anybody who works in the news media? I do, several guys both in TV and paper news who have been placed all over the spectrum from editing room floors to the administrative level and even teaching positions at media and public relations colleges. They ALL report (privately) that the whole game is a giant crock of malarkey. The most interesting aspect is when the news teams don't even realize they're doing it, but simply re-broadcast biases and falsehoods because they are part of a form of non-deliberate groupthink. But it's worst when suggested stories are simply struck from the record because they don't match up with whatever political beliefs the owner happens to hold.
One of the big problems is the AP Newswire, to which so many large journals subscribe and pull feeds from word for word. --One thin little bottleneck through which major breaking news passes, meaning entire nations uniformly learn about events which are filtered by only a very small number of people.
The intriguing thing about bloggers is that they don't do this; they represent a broad and varied non-uniform message. This does not mean all bloggers are accurate or that there isn't the internet 'echo chamber' effect going on, but it does mean that there is actually a higher probability of actual news coming through the system. Have you ever clicked into democracynow.com? Some of the more prolific blogger sites have their own journalists covering stories and you generally get broader coverage, and people being interviewed in a non-soundbite kind of way.
-FL
"Don't be evil" is just an advertising slogan, like "At Pontiac we build excietement" (bad brakes, crappy handling), "Chevy - Like A Rock" (damned thing won't start), "At Ford, Quality is job 1" (Got their work cut out for them).
Pontiac's handling has gotten a lot better. The GTO was a bit squishy but the new G8 is said to be a worthy challenger to the M5. If that's not good brakes and good handling, then I do not know what is.
Similarly, Ford is now routinely winning various quality rankings in it car offerings... but Ford's problem is that it has too much debt and can't build enough of the cars it is selling all too well while at the same time has a lot of people building big trucks that no one wants.
This is my sig.
The problem is that you are going to end up with a single source for information and that in and of itself is a bad thing.
Undetectable Steganography? Yep, there's an app fo