Slashdot Mirror


Scan a Book In Five Minutes With a $199 Scanner? (teleread.com)

New submitter David Rothman writes: Scan a 300-page book in just five minutes or so? For a mere $199 and shipping — the current price on Indiegogo — a Chinese company says you can buy a device to do just that. And a related video is most convincing. The Czur scanner from CzurTek uses a speedy 32-bit MIPS CPU and fast software for scanning and correction. It comes with a foot pedal and even offers WiFi support. Create a book cloud for your DIY digital library? Imagine the possibilities for Project Gutenberg-style efforts, schools, libraries and the print-challenged as well as for booklovers eager to digitize their paper libraries for convenient reading on cellphones, e-readers and tablets. Even at the $400 expected retail price, this could be quite a bargain if the claims are true. I myself have ordered one at the $199 price.

12 of 221 comments (clear)

  1. Welcome to 2006 by ShooterNeo · · Score: 4, Insightful

    You've been able to do this for years and years a different way.

    1. Get a sheet fed scanner like a Fujitsu Snapscan ($400)
    2. Cut the binding off the book
    3. Place the stack of pages into the scanner
    4. Get a coffee

    And you're done, the thing's 600 DPI and does both sides in the same pass. It creates a PDF directly, and you then want to OCR the PDF, running a sharpen filter on the text, and decide on how much you want to compress the PDF. A 1000 page textbook ends up being about 700 megabytes, in crystal clear quality.

    1. Re:Welcome to 2006 by DavidRothman9947 · · Score: 5, Insightful

      Thanks, but what about those of us who might prefer nondestructive scanning? Also consider other factors--for example, the speed and quality of the scans, as well as the price. The Czur appears to be several times faster than a $600 model from Fujitsu that allows nondestructive book scans. If you're scanning lots of books, that won't be a trivial detail. As for quality, the Fujitsu is good but not nirvana. Let's see if the Czur will do better.

    2. Re:Welcome to 2006 by Maxo-Texas · · Score: 3, Insightful

      Several comments:

      1 So do VCR's.

      2 Most books are available within a day on multiple sites.

      3 Most books are available within days at libraries.

      4 This only slightly speeds up/makes the process easier. Anything you can read can be transcribed.

      5 80 people can transcribe 80 different books quickly.

      Who knows- they might try- but it seems like a waste of their money to me.

      --
      She was like chocolate when she drank... semi-sweet at first and then increasingly bitter.
    3. Re:Welcome to 2006 by Panoptes · · Score: 3, Interesting

      There are two curses of modern book publishing that cause problems whatever hardware you use. The first is so-called 'perfect binding' in which the folds of page gatherings, through which the sections are traditionally sewn together, are instead sliced off and glued to make a rigid spine with an exceedingly narrow angle of opening; the second is the use of low-grade, thin paper with high show-through that mucks up the scan.

      The best software I've found to scan and collate is Softi ScanWiz. With it you may scan one stack of pages, flip the stack and scan the other side - the program then shuffles the page images into the correct order. It also automatically adjusts brightness and contrast so as to minimise ink show through.

  2. The actual big news here: by tlambert · · Score: 4, Interesting

    The actual big news here: The company doing the indiegogo is located in Shenzhen, China.

    This is the first one of these I've seen. It struck me as very odd that the video narrator was an almost perfect midwest accent, but had terrible grammar and word choice, but when looking at the location of the startup, it became more obvious that this was actually an Indiegogo out of China.

    Anyway, good on them; I expect that we will be seeing a lot more people doing crowd-sourcing from non-U.S. locations, given that VC thends to be pretty tight outside of specific regions of the U.S. (which is, in turn, why most startups that go anywhere are U.S. based, rather than being in Europe, or elsewhere, where the funding climate is pretty terrible).

  3. OCR is the main problem by DrXym · · Score: 4, Interesting
    I read a lot of books from OpenLibrary (an awesome resource for old books). Most e-books are offered for download in EPUB and PDF format. The PDF is a direct book scan, the EPUB is OCR'd from the scan. Invariably the EPUB is filled with errors caused by OCR - hyphenated words not joined back together, page numbers appearing in the middle of text, words autocorrected to something else, chapter headings screwed up etc. Sometimes the OCR gives up entirely.

    It's simply easier to read the PDF although the file size is enormous and you're basically looking at images of some yellowing old book which means lots of panning and zooming particularly on small devices. And forget reading it on an e-reader.

    So yeah I think you could automate scanning of books, but the second step of getting it into EPUB format is the tricky part.

  4. Perhaps this entry should be marked as an Ad by Rob+Lister · · Score: 5, Informative

    Since this product gets free placement here at /., I figure it is okay to put in a word for the good folks at Distributed Proofreaders.

    Books are scanned and [sometimes roughly] OCR'd.
    Each and every word, period, hyphen, and ellipsis on each and every page is scrutinized by at least three proofreaders.
    Each bold, italic, underline and indent is evaluated by at least two formatters.
    The work is finalized in HTML, proofread as a whole, and published to Project Gutenberg in various formats, txt, pdf, html and epub.

    The resulting publication typically has far fewer publishing errors than the original book. This is especially true of books from the 17th century where drinking was part of a typesetter's expectation.
    Be a part of it.
    Sign up at http://www.pgdp.net/c/

  5. Re:CCD on a stick by naughtynaughty · · Score: 4, Informative

    A digital camera on a tripod PLUS ... Proper lighting Foot pedal interface Lots of software to take the pictures, manipulate the images and stitch them all together into an eBook So a bit more than just a digital camera and a tripod

  6. Re:that's like 40 ebooks to break even by jolyonr · · Score: 3, Insightful

    There are a lot of things that simply aren't available on ebooks. And if I purchased the book and I'm using the pdf for my own use then it's not piracy. At least it's not morally wrong to me, and that's the only thing that matters as far as I am concerned.

    --


    Please read my Canon EOS tech blog at http://www.everyothershot.com
  7. Re:reading by Chris+Mattern · · Score: 3, Informative

    Hint: That's why thousands of bookstores are closed, because people prefer eBooks over paper ones.

    No, thousands of bookstores are closed because people can select from a much wider selection from Amazon. Paper book sales increased 2.4% last year.

  8. Re:Finally a foot pedal for hands free application by aitikin · · Score: 3, Informative

    The only reason devices that can display printed sheet music like tablets and e-ink readers are not popular is that they are essentially useless for sight reading. A foot pedal for page turns could easily create a reader for musicians. It would catch on like wild fire and the music publishers could finally start to distribute good editions again. I have been saying this for years and no one listens, it is the usual routine with industry not seeing the forest for the trees that are still being cut to print music.

    You clearly have done zero research. There's a number of options, the most popular I've come across is the AirTurn, although the Cicada works well too from what I've heard.

    --
    "Don't meddle in the affairs of a patent dragon, for thou art tasty and good with ketchup." ~ohcrapitssteve
  9. Is this a Cloud-only system? by timg11 · · Score: 5, Insightful

    The indigogo site says "Your sketches, paintings, and notes can be scanned and stored in the Czur cloud".
    Do we have the option to use our choice of server (maybe local)?
    What if I don't want everything that I scan going to a company in China?
    What if one day the "Czur cloud" is gone - is the scanner then unusable?

    Has anybody tracked down these answers? The product seem appealing if non-cloud, independent operation is allowed.