Slashdot Mirror


Book-Digitizing Robots

Makarand writes "Robotic digitization systems are the new help available to complete voluminous scanning tasks. Robots that can turn the pages of books and newspaper volumes and attain scanning speeds of more than 1000 pages/hour are now available. They even use puffs of compressed air to separate sticky pages!"

2 of 233 comments (clear)

  1. Scanned pages by Ed+Avis · · Score: 4, Interesting

    This story is a good opportunity to plug some free software you could use to help digitize books.

    Stuart Inglis's tic98 is a lossless compressor designed for black-and-white scanned documents. It achieves better compression ratios than anything else, or at least it did a couple of years ago. If you have scanned documents to make available online, it's fairly simple to write a CGI script to convert tic98 on the fly to PDF.

    Hopefully someone else will reply to this comment with a recommendation of good free OCR software.

    --
    -- Ed Avis ed@membled.com
  2. Re:Project Gutenberg by tempestdata · · Score: 4, Interesting

    Well I have some good news for you. While, I was working (and I still am actually) on this project I asked the Digital Library Projects Manager, who is basically in charge of this project about releasing the books they scan to the public. His reply was that they were probably going to release a pretty significant portion of the books they scan to the public. The rest would only be available within Stanford University Libraries.

    So, you may at one point see those books freely available for download, provided they can get those copyright issues ironed out.

    --
    - Tempestdata