Slashdot Mirror


How Badly is Google Books Search Broken, and Why? (blogspot.com)

An anonymous reader shares a blog post: It appears that when you use a year constraint on book search, the search index has dramatically constricted to the point of being, essentially, broken. Here's an example. While writing something, I became interested in the etymology of the phrase 'set in stone.' Online essays seem to generally give the phrase an absurd antiquity -- they talk about Hammurabi and Moses, as if it had been translated from language to language for decades. I thought that it must be more recent -- possibly dating from printers working with lithography in the 19th century.

So I put it into Google Ngrams. As it often is, the results were quite surprising; about 8,700 total uses in about 8,000 different books before 2002, the majority of which are after 1985. Hammurabi is out, but lithography doesn't look like a likely origin for widespread popularity either. That's much more modern that I would have thought -- this was not a pat phrase until the 1990s. That's interesting, so I turned to Google Books to find the results. Of those 8,000 books published before 2002, how many show up in the Google Books search result with a date filter before 2002? Just five. Two books that have "set in stone" in their titles (and thus wouldn't need a working full-text index), one book from 2001, and two volumes of the Congressional record. 99.95% of the books that should be returned in this search -- many of which, in my experience, were generally returned four years ago or so -- have vanished.
Further reading: How Google Book Search Got Lost; Whatever Happened To Google Books?; and Google's New Book Search Deals in Ideas, Not Keywords.

6 of 106 comments (clear)

  1. I am sorry for your pain using Google. by jellomizer · · Score: 4, Interesting

    However should you have gone to a library, and perhaps worked with a Librarian to help guide you in your research?
    Google is a good search tool, but it isn't a research tool.

    --
    If something is so important that you feel the need to post it on the internet... It probably isn't that important.
    1. Re:I am sorry for your pain using Google. by jellomizer · · Score: 3, Interesting

      Well Google's Marketing would say most anything to keep the company in good graces.

      However Google and its like services, Are part of the solution but not the full solution.

      Searching is an important part of research, Google is a good tool for researchers, but it only help them search. A modern Librarian, can help you use Google to get more context out of your searches, direct you to Non-Google tools, and often the library will have access to data that is often behind a paywall.

      --
      If something is so important that you feel the need to post it on the internet... It probably isn't that important.
  2. Google Books Has Been Deteriorating For Years by careysub · · Score: 4, Interesting

    Like most of its projects, Google has lost interest in Google Books and has not bothered to maintain it, much less continue developing it. This has been going for more than a decade now. NGram search for example stopped adding new texts to the index in 2008.

    Google fought and won a court case to put 25 million more orphan books which it had already scanned, out of print and largely unavailable, into Google Books. But decided it wouldn't bother. Because out of print books cannot be monetized, it would seem and thus are of no interest to Alphabet, which has over $100 billion in cash on hand. Spending a few million to support Books would shave a small fraction of a percentage off the growth of its investment wealth which is unacceptable to the company that has officially retired the "Don't be evil" slogan.

    At least they haven't pulled the plug on it entirely. I guess there is still some monetization to be had from in-print books.

    --
    Starships were meant to fly, Hands up and touch the sky - Nicky Minaj
    1. Re:Google Books Has Been Deteriorating For Years by H3lldr0p · · Score: 3, Interesting

      At least they haven't pulled the plug on it entirely.

      AFAIK Alphabet has put the "good" version in universities where the library admin does all the heavy lifting of scanning in books and such. That was part of the suit settlement. The public doesn't get to access researchers have.

      If I were the Fine Author, I'd head over to one of the unis that signed up with Google and use it there before declaring any sort of hard result.

  3. Re:the future of research is scary by 93+Escort+Wagon · · Score: 5, Interesting

    Wasn’t that one of the early signs of civilization’s decay in Asimov’s Foundation universe? Scholars no longer did original research on their own; they’d just study what previous researchers had already written on a subject, and re-summarize it?

    Sometimes it’s scary how prescient that dude was.

    --
    #DeleteChrome
  4. Re:the future of research is scary by TuringTest · · Score: 3, Interesting

    He was not prescient. Like Orwell with 1984, he was largely documenting contemporary trends of his time, just with enough insight to extrapolate their consequences.

    The literary device of Science Fiction was used to strip it the narration of the emotional attachment to real world politics, which is how SF usually works.

    --
    Singularity: a belief in the "God" idea with the "demiurge" relation inverted.