Slashdot Mirror


How Google's High Speed Book Scanner De-Warps Pages

Hugh Pickens writes "Patent 7,508,978, awarded to Google, shows how the company has already managed to scan more than 7 million books. Google's system uses two cameras and infrared light to automatically correct for the curvature of pages in a book. By constructing a 3D model of each page and then 'de-warping' it afterward, Google can present flat-looking pages online without having to slice books up or mash them onto a flatbed scanner. Stephen Shankland writes that the 'sophistication of the technology illustrates that would-be competitors who want to feature their own digitized libraries won't have a trivial time catching up to Google.' First, a book is placed on a flat surface, while above it, an infrared projector displays a special mazelike pattern onto the pages. Next, two infrared cameras photograph the infrared pattern from different perspectives. 'The images can be stereoscopically combined, using known stereoscopic techniques, to obtain a three-dimensional mapping of the pattern,' according to the patent. 'The pattern falls on the surface of (the) book, causing the three-dimensional mapping of the pattern to correspond to the three-dimensional surface of the page of the book.'"

21 of 209 comments (clear)

  1. Patent!!??!! by aashenfe · · Score: 5, Funny

    When is the patent office going to quit giving patents for obvious techniques? :)

    1. Re:Patent!!??!! by Dewin · · Score: 5, Informative

      I believe the pattern barcode scanners use is simply trying to look for the barcode in several different directions, but I could be wrong.

      I also believe there's either rudimentary correction for common types of distortion (i.e. on cylindrical objects) or just wide enough tolerances to allow it to work anyways.

      --
      Of course nobody reads the FAQ! If people read the FAQ, the Questions wouldn't be so Frequently Asked.
    2. Re:Patent!!??!! by Timmmm · · Score: 4, Informative

      You jest, but this technique *has* been around for years. I remember when digital cameras first became available there was a product that could perform a 3D scan by projecting a pattern onto the object and using an offset picture. I think the pattern came on a slide - that's how long ago it was! Here's a whole wikipedia page about the scanning technique: http://en.wikipedia.org/wiki/Structured_Light_3D_Scanner

      This picture is especially good: http://en.wikipedia.org/wiki/File:6-seat.jpg

      Anyway after reading the patent abstract, it isn't about the 3D scanning at all, it appears to be about an algorithm to find the fold once you've already got the point cloud. I would have thought that was fairly trivial. A possible approach would be to take the radon transform of the height map and find the smallest value that's roughly in the middle.

  2. So... by fuzzyfuzzyfungus · · Score: 5, Interesting

    How long before some particularly vengeful luddite publisher starts printing on treated paper stock that has an IR visible pattern, calculated to confuse these scanners, printed on it?

    They've been making "anti-copy paper" designed to defeat optical scanning for years now, surely something similar in the IR band could be effected...

    1. Re:So... by twistedsymphony · · Score: 4, Insightful

      they could probably do it in the visible spectrum as well, it would just take twice as long because they can't map and scan at the same time.

      Failing that there are alternative methods that might work as well.

  3. Re:IMPORTANT QUESTIONS by Anonymous Coward · · Score: 4, Funny

    The same way as your face.

  4. The New Bell Labs? by ObsessiveMathsFreak · · Score: 4, Interesting

    I've read many comments over the years about the old Bell Labs and how a huge amount of pioneering research came out of them over the course of their existance, i.e. before they got axed.

    It would seem that Google Labs is performing somewhat the same function, albeit more oriented towards software rather than physical research.

    --
    May the Maths Be with you!
  5. Mostest importanly... by Anonymous Coward · · Score: 4, Interesting

    ...who's flipping the pages?

  6. Re:Playing Catch-up by jsnipy · · Score: 5, Insightful

    but to be honest this is at least worthy patent

    --
    -- if you mod me down, I will become more powerful than you can possibly imagine
  7. What are the chances... by Shaterri · · Score: 4, Interesting

    ...that Google licenses this to scanner manufacturers and we see this at a consumer level at some point in the future? I know I'd pay good money for a book scanner that doesn't need to have a 'book edge' (which you already have to pay through the nose for)...

  8. Re:Obvious question... by Captain+Spam · · Score: 4, Funny

    That's cool and all that, but who (or what) flips the pages?

    Interns.

    --
    Demanding constant attention will only lead to attention.
  9. Re:Playing Catch-up by fuzzyfuzzyfungus · · Score: 4, Informative

    Obviously it was worthy enough to be issued; but I don't know how worthy it is in the broader sense.

    Notably, for instance, there has been a fair bit of interest, for some years, in using digital cameras in concert with projectors, either for automatic keystone/distortion correction, for projectors that aren't perfectly aligned with the projection surface, or for automatic coordination of multiple projectors illuminating the same surface, without laborious manual tiling adjustment. This is, in essence, an equivalent problem(inferring a surface's geometry based on pictures of a known image projected upon it).

    The IEEE has held "Projector-Camera systems" workshops since 2003, and somebody was obviously working on it before that. I'm not saying that Google's patent falls into asshole troll territory or anything; but the notion of doing surface geometry inference based on known image projection isn't nearly as novel as it might seem.

  10. Why is this a big deal? by MBoffin · · Score: 4, Insightful

    I don't see why this is such a showstopper for other book scanning projects. Right off the top of my head I can think of three methods of dewarping book scans that have nothing do to with Google's methods. While Google's method is definitely quite interesting and seems like a great solution, it is by no means whatsoever the only way of accomplishing this.

  11. You laugh, but look at this by langelgjm · · Score: 5, Interesting

    That's modded funny, but take a look at this.

    Maybe they use automated page turning machines for normal books, and turn pages by hand for older/more fragile works?

    --
    "Anyone who [rips a CD] is probably engaging in copyright infringement." - David O. Carson
    1. Re:You laugh, but look at this by StikyPad · · Score: 4, Funny

      Now THAT'S a page turner.

      Ba dum dum. Thanks, I'll be here all week! Try the veal, and don't forget to tip your waitress!

  12. Re:Playing Catch-up by ushering05401 · · Score: 5, Interesting

    Word.

    I was involved in evaluating rare books back around the turn of the century.

    I can personally attest that representatives of online book search companies were attempting to buy up one of a kind pieces for destructive scanning.

    There was one dealer in possession of a somewhat flawed, but well examined Shakespeare folio that had to put the kabosh on a reputation making deal because he found out the buyer was going to slice the piece out of its binding for scanning.

    I turned down a much smaller offer on a much less significant, but still very cool, two hundred year old angler's guide (with hand colored plates and original binding) for the same reason.

    Quality scans without destruction can only help raise the profile of rare books and the value they offer society - not simply for their content, but as tangible examples of the evolution of the art of communication.

  13. Re:Butt what about... by DRACO- · · Score: 4, Funny

    Is this what the graphics department is talking about bump mapping?

    Karma burn.

    --
    Consider yourself blessed if you are sneezed on by a dragon and only get wet, it could have been a fireball.
  14. Re:Why? by ChaosDiscord · · Score: 4, Informative

    Google is mostly scanning books borrowed from university libraries. Librarians get cranky if you borrow a book and return a stack of loose sheets of paper.

  15. Re:Playing Catch-up by BikeHelmet · · Score: 4, Funny

    This is actually what I envisioned for a book scanner, years ago.

    But unlike Google, I...

    1) Never built it.
    2) Am not facing lawsuits from overzealous sue-happy publishers.

    Seems like a good defensive patent to have.

  16. Re:Playing Catch-up by Anonymous Coward · · Score: 5, Interesting

    This trick has been used for 20 years in astronomy. You shine a really powerful laser of known metrics into the sky and measure the atmospheric distortion suffered by the beam.

    Then you take those numbers and calculate what it would take to even out the beam, and you feed THAT set of numbers to a telescope with adaptive optics which will then correct for the atmospheric distortion. Bingo, suddenly your telescope is able to take sharp images without having the air screw it up.

    The technique is very effective and results in ground-based telescopes that rival anything the Hubble can do. Plus they are easier to fix.

    I want to say this is called Guidestar but I am not sure.

    Anyway the similarity to Google's process is simply that you shine a light or image of known value on something unknown and look at how the image now deviates from what you expect. A little math and suddenly you know exactly the shape of the unknown object. Brilliant.

  17. Re:Playing Catch-up by tomz16 · · Score: 4, Informative

    It's simply called adaptive optics (AO). In AO, a guidestar is a natural isolated point-like star that is close to your science object (what you are trying to look at). If a laser is used to excite the sodium layer to create an artificial reference, it's called a "laser guidestar".

    Anyway, this "trick" is completely different from adaptive optics in both the mathematics and implementation.