Google's Book Scanning Technology Revealed

← Back to Stories (view on slashdot.org)

Google's Book Scanning Technology Revealed

Posted by kdawson on Thursday January 7, 2010 @07:32AM from the real-page-turner dept.

blee37 writes "Last March we discussed Google's patent for a rapid book scanning system. This article describes and provides pictures of how the system works in practice. Google is secretive, but the system's inner workings were apparently divulged by University of Tokyo researchers who wrote a research article on essentially identical technology. There are also videos of robotic page flippers and information about how Google wants to use music to help humans flip pages."

14 of 100 comments (clear)

Min score:

Reason:

Sort:

Now my PC by bugs2squash · 2010-01-07 07:33 · Score: 5, Funny

Can RTFA for me

--
Nullius in verba
MRI technology? by maillemaker · 2010-01-07 07:37 · Score: 5, Interesting

I often wondered if it would be possible for a book to be scanned while closed, using some kind of MRI technology that digitally sliced the book page by page, picking up on the density difference between the ink and the paper slice by slice.

--
A work that expires before its copyright never enters the public domain and thus enjoys eternal copyright protection.
1. Re:MRI technology? by SnarfQuest · 2010-01-07 08:04 · Score: 5, Insightful
  
  When the book is closed, the ink from facing pages will be mashed together, shouol you will need to be able to tell which page the ink is attached to. Since the ink mostly sits on top of the paper (if it soaks through you wouldn't be able to read the other side veery well) it is a very thin layer. Your scanning technology would need to be able to sense very small volumes of ink. I don't think we are anywhere close to the necessary precision yet.
  
  --
  Who would win this election: Andrew Weiner vs Andrew Weiner's weiner.
Librarian Chantey by _Sprocket_ · 2010-01-07 07:38 · Score: 4, Informative

Sea Shanties were sung in association with ship-board tasks (often repetitious in nature). Is Google paving the way for the Librarian chantey?
1. Re:Librarian Chantey by FooAtWFU · 2010-01-07 07:57 · Score: 5, Funny
  
  Cape Cod ladies don't use no books -
  Haul away, haul away!
  Well they read their stories on robotic Nook®s
  and we're bound away for Australia!
  
  --
  The World Wide Web is dying. Soon, we shall have only the Internet.
summary of summary. by Anonymous Coward · 2010-01-07 07:42 · Score: 5, Funny

human type book into PC, machine print book on paper, machine binds book ---time goes by--- machine unbind book, robot and human flip pages of book, machine photograph book, machine put book on PC.
Build your own.... by Lumpy · 2010-01-07 07:44 · Score: 4, Interesting

Simply set up a rig with 2 digital cameras and a plexiglass V to photograph 2 pages at a time. It's quite fast and cheap.
http://www.diybookscanner.org/
Works great. I built one to turn a couple of rare automotive books into PDF so I dont damage a $180.00 book in the garage.

--
Do not look at laser with remaining good eye.
1. Re:Build your own.... by Lumpy · 2010-01-07 08:30 · Score: 4, Interesting
  
  What idiot would use a $1800 laptop in the garage to view a PDF?
  Let me guess, you change your oil wearing a cashmere sweater and silk shirts as well.
  Nope. I risk my $40.00 fujitsu tablet PC that views pdf's just fine but has not enough Horsepower to do much else. works awesome as a garage PC to read PDF's and read the engine codes with my RS232-ODBII scanner/logger.
  
  --
  Do not look at laser with remaining good eye.
Missing the point? by johnw · 2010-01-07 07:53 · Score: 4, Insightful

Google is secretive, but the system's inner workings were apparently divulged by University of Tokyo researchers
Surely the whole point of the patent system is to grant exclusive use for a period in return for publishing full details of how whatever it is works? How can you have a patent without divulging the crucial information?
1. Re:Missing the point? by pclminion · 2010-01-07 08:45 · Score: 3, Insightful
  
  I work for a company with a lot of patents. Our products are protected partially by patents and partially by trade secret information. In other words, to recreate our product you would need to license the patents AND figure out how we did the other stuff, that is NOT patented, but is secret. There's no reason you can't mix patented and trade secret technology in one solution.
Musical page flipping. by Yaztromo · 2010-01-07 08:09 · Score: 4, Funny

...and information about how Google wants to use music to help humans flip pages.
...you will know it is time to turn the page when Tinkerbell rings her little bells like this...
Yaz.
We used to call them "Service Bureaus" by kriston · 2010-01-07 08:30 · Score: 4, Interesting

Back when we called them "Service Bureaus" book scanning was fast, easy, and cheap, as long as you didn't want the book back.
You deliver your book, magazine, phone book, map, large format document, or whatever to a Service Bureau.
They will then use a paper saw and cut the binding off and the other three sides to make perfectly smooth edges.
Then they put the whole mess into a hopper. The hopper feeds the pages to a scanner.
When it's done, flip the pile over and put it back into the hopper to get the odd-numbered pages into the scanner.
What you get back is your original book (as a pile of pages with no binding) and a CD-ROM of its contents in both original TIFF and OCRd text files. Now you can get them as PDF/A and DejaVu formats.
I suppose Google's point is that they don't want to ruin the books, or maybe they are so proud of their 3D-scanner enough to use it at all costs. But think of this: there are usually several thousands, perhaps millions, of copies of the books I've seen in Google's library, so destroying one copy of the book seems fair enough.

--
Kriston
1. Re:We used to call them "Service Bureaus" by Monkeedude1212 · 2010-01-07 09:04 · Score: 3, Informative
  
  There are ALOT of books out there which would NOT be suitable for this method. A friend of mine in University for Museum Studies often has to read these books which are incredibly old. I believe the University has a couple that date somewhere around the 1830's which is older than the books you find in the historical village we have in town.
  Yes, the university lets you read books that are old enough to belong in a museum. She showed me one of them one time. It was like a manuscript, Thick leather binding, nothing written on the front, heavy faded pages. I almost couldn't believe it.
  Sadly, that was the most exciting part of it. The writing was dryer than a desert, and it was on some subject that I had zero interest in. They are supposedly starting to go ALL digital, so I have no idea what they're going to do with those old books and mansucripts they've got sitting around.
  I hope they don't destroy them.
Re:Video of book scanner... by Jeremy+Erwin · 2010-01-07 08:34 · Score: 3, Informative

Elegant, hypnotic, and not what google uses. Google scans the books, lying flat. It projects a grid-like pattern over the pages in IR, photographs up the distorted image using 3D cameras, and recreates a 3D model of the book, and uses that model to undistort the pages. It uses human slaves to turn the pages, since robots aren't as gentle.
The link isn't slashdotted anymore