Slashdot Mirror


Scan a Book In Five Minutes With a $199 Scanner? (teleread.com)

New submitter David Rothman writes: Scan a 300-page book in just five minutes or so? For a mere $199 and shipping — the current price on Indiegogo — a Chinese company says you can buy a device to do just that. And a related video is most convincing. The Czur scanner from CzurTek uses a speedy 32-bit MIPS CPU and fast software for scanning and correction. It comes with a foot pedal and even offers WiFi support. Create a book cloud for your DIY digital library? Imagine the possibilities for Project Gutenberg-style efforts, schools, libraries and the print-challenged as well as for booklovers eager to digitize their paper libraries for convenient reading on cellphones, e-readers and tablets. Even at the $400 expected retail price, this could be quite a bargain if the claims are true. I myself have ordered one at the $199 price.

7 of 221 comments (clear)

  1. Welcome to 2006 by ShooterNeo · · Score: 4, Insightful

    You've been able to do this for years and years a different way.

    1. Get a sheet fed scanner like a Fujitsu Snapscan ($400)
    2. Cut the binding off the book
    3. Place the stack of pages into the scanner
    4. Get a coffee

    And you're done, the thing's 600 DPI and does both sides in the same pass. It creates a PDF directly, and you then want to OCR the PDF, running a sharpen filter on the text, and decide on how much you want to compress the PDF. A 1000 page textbook ends up being about 700 megabytes, in crystal clear quality.

    1. Re:Welcome to 2006 by DavidRothman9947 · · Score: 5, Insightful

      Thanks, but what about those of us who might prefer nondestructive scanning? Also consider other factors--for example, the speed and quality of the scans, as well as the price. The Czur appears to be several times faster than a $600 model from Fujitsu that allows nondestructive book scans. If you're scanning lots of books, that won't be a trivial detail. As for quality, the Fujitsu is good but not nirvana. Let's see if the Czur will do better.

  2. The actual big news here: by tlambert · · Score: 4, Interesting

    The actual big news here: The company doing the indiegogo is located in Shenzhen, China.

    This is the first one of these I've seen. It struck me as very odd that the video narrator was an almost perfect midwest accent, but had terrible grammar and word choice, but when looking at the location of the startup, it became more obvious that this was actually an Indiegogo out of China.

    Anyway, good on them; I expect that we will be seeing a lot more people doing crowd-sourcing from non-U.S. locations, given that VC thends to be pretty tight outside of specific regions of the U.S. (which is, in turn, why most startups that go anywhere are U.S. based, rather than being in Europe, or elsewhere, where the funding climate is pretty terrible).

  3. OCR is the main problem by DrXym · · Score: 4, Interesting
    I read a lot of books from OpenLibrary (an awesome resource for old books). Most e-books are offered for download in EPUB and PDF format. The PDF is a direct book scan, the EPUB is OCR'd from the scan. Invariably the EPUB is filled with errors caused by OCR - hyphenated words not joined back together, page numbers appearing in the middle of text, words autocorrected to something else, chapter headings screwed up etc. Sometimes the OCR gives up entirely.

    It's simply easier to read the PDF although the file size is enormous and you're basically looking at images of some yellowing old book which means lots of panning and zooming particularly on small devices. And forget reading it on an e-reader.

    So yeah I think you could automate scanning of books, but the second step of getting it into EPUB format is the tricky part.

  4. Perhaps this entry should be marked as an Ad by Rob+Lister · · Score: 5, Informative

    Since this product gets free placement here at /., I figure it is okay to put in a word for the good folks at Distributed Proofreaders.

    Books are scanned and [sometimes roughly] OCR'd.
    Each and every word, period, hyphen, and ellipsis on each and every page is scrutinized by at least three proofreaders.
    Each bold, italic, underline and indent is evaluated by at least two formatters.
    The work is finalized in HTML, proofread as a whole, and published to Project Gutenberg in various formats, txt, pdf, html and epub.

    The resulting publication typically has far fewer publishing errors than the original book. This is especially true of books from the 17th century where drinking was part of a typesetter's expectation.
    Be a part of it.
    Sign up at http://www.pgdp.net/c/

  5. Re:CCD on a stick by naughtynaughty · · Score: 4, Informative

    A digital camera on a tripod PLUS ... Proper lighting Foot pedal interface Lots of software to take the pictures, manipulate the images and stitch them all together into an eBook So a bit more than just a digital camera and a tripod

  6. Is this a Cloud-only system? by timg11 · · Score: 5, Insightful

    The indigogo site says "Your sketches, paintings, and notes can be scanned and stored in the Czur cloud".
    Do we have the option to use our choice of server (maybe local)?
    What if I don't want everything that I scan going to a company in China?
    What if one day the "Czur cloud" is gone - is the scanner then unusable?

    Has anybody tracked down these answers? The product seem appealing if non-cloud, independent operation is allowed.