Slashdot Mirror


Software To Flatten a Photographed Book?

davidy writes "I have photographed some pages of a book for reading on my PDA. This is much faster than scanning and I don't have to carry the heavy books. However, the photographed books are not as nice: curved, skewed, and shadowed, as opposed to the much flatter, cleaner scanned books. I have searched for software that can flatten the pages for better reading on the PDA. So far I have come across Unpaper and Scan Tailor. Unpaper doesn't seem to have a windows GUI, and Scan Tailor doesn't unskew well. I remember reading about Google's technique of converting books to e-books with a camera and a laser overlay. Is there any home user software that can do a similar job without the need for a laser overlay or other sophisticated (and patented) technology?"

172 comments

  1. Snapter by brusk · · Score: 4, Informative

    Snapter is a bit cumbersome but that's what it does.

    --
    .sig withheld by request
    1. Re:Snapter by DingerX · · Score: 2, Informative

      Okay, as per my previous post, I'm trying Snapter. It might have crashed, for alll I know. I'm at 3 bars (out of about 20) on the left side of the first page, and one processor is pegged. We'll see if it comes out.

    2. Re:Snapter by DingerX · · Score: 3, Informative

      restarted. 30 minutes later, it threw a fatal exception.

      My short review: FAIL.

    3. Re:Snapter by Heymoe · · Score: 1

      Narrowly defined, Snapter works as advertised. It really does take the warp out of images, and it will also take out fingers on the sides if you need to hold the book open while photographing. The pdf files it saves are just images, not OCR'd text. That's a big drawback.

    4. Re:Snapter by brusk · · Score: 3, Informative

      I include that in my definition of cumbersome.

      --
      .sig withheld by request
    5. Re:Snapter by crath · · Score: 2, Interesting

      I trialed the software using files from a Pentax istDS (6 megapixel) DSLR, and since it worked quite well I purchased a copy. However, when I attempted to use it with files from a D700 (12 megapixel), it failed completely. So, I would only recommend it for 6 mp resolution files or less.

      Also, Snapter hasn't released a new version in over a year; so, it's almost abandonware---it's not possible that they haven't received a bug report or two that needed to be fixed, over the past 12 months.

      Net conclusion: try the demo with the actual files you plan to convert. If it works, great; if, not, don't buy it. If your files are >6mp in resolution; don't bother even toying with it.

    6. Re:Snapter by stephanruby · · Score: 1

      Have you tried qipit???

      It's supposed to work directly from your phone camera. And if your phone is not directly supported, it should still work if you email the picture to: copy AT kipit . com

    7. Re:Snapter by DingerX · · Score: 4, Informative

      Okay Cool. I found out what my problem was:

      1. The book must be on a uniform surface.
      2. All the edges of the book must be in the frame.
      3. Only hold the book down from the side.
      4. The photograph must be taken directly over the book.
      5. Use a dSLR for best results.

      Okay, so now try holding a dSLR directly over an open book that you're holding with another hand, from the side, and at a range where the entire book fits in the frame. At that point, you might as well build that book scanning rig.

      In short: FAIL.

    8. Re:Snapter by windex82 · · Score: 1, Funny

      If only there were some kind of device to hold a camera. Maybe something with three legs that retract for easy storage and some hinges for holding the camera at different angles....

    9. Re:Snapter by byung9 · · Score: 1

      you can just restart-- .sig withheld by request definitely

  2. sooo.... by Anonymous Coward · · Score: 1, Interesting

    after photographing your book you have these huge image files that are barely readable, and now you want to spend MORE time trying to make em legible, wouldn`t it just be faster to scan em?, after OCR they would be much smaller and you could edit em/annotate to your hearts delight too.

    seems like you made a problem looking for a solution rather than just scanning em in the first place

    1. Re:sooo.... by Anonymous Coward · · Score: 2, Funny

      Aye, this Ask Slashdot sounds more like "Doctor, it hurts whenever I do this." The traditional response to such nonsense is, "well, then don't do that!"

      Next question?

    2. Re:sooo.... by LarrySDonald · · Score: 1

      For one book, this kind of thinking makes sense. If this is a regularly occurring issue, it'd make sense to find a faster way then a flatbed scanner to get through it. I have no idea how feasible this is in real life. Working data entry we'd usually chop the back and put them in a sheet feed, but then you have a destroyed book and also sheet feed scanners aren't exactly commonplace. It's pretty interesting that there are at least some solutions around, I may just be too out of the loop to realize how much things have advanced.

    3. Re:sooo.... by Yvan256 · · Score: 4, Funny

      He can easily get his tiny 10 megapixels camera into the book store, but he would be stopped immediately if he tried to bring his scanner instead.

    4. Re:sooo.... by Anonymous Coward · · Score: 0

      I've taken my camera into remote parts of the library to copy things (and it was legal to do so). A few minutes of photography got me a half-dozen articles which took several days to wade through and analyze.

    5. Re:sooo.... by Anonymous Coward · · Score: 0

      we'd usually chop the back and put them in a sheet feed

      Exactly.

      you have a destroyed book

      Meh. You can always punch a hole and bind 'em together for future use. For non-rare books, it's a small cost and you haven't really destroyed much. Unless you're scanning library books, this is the way to go.

      sheet feed scanners aren't exactly commonplace

      They're not exactly expensive these days. Maybe $250.

    6. Re:sooo.... by Anonymous Coward · · Score: 0

      Huge images that are barely readable? You need a camera with a higher resolution. One of my aging cameras has better vision than the human eye, and I can photograph two pages and later zoom in enough to see the dots in non-glossy photos.

    7. Re:sooo.... by Anonymous Coward · · Score: 0

      No. Take the picture, image correct, OCR. *Much* faster than scanning/OCR. Scanning is mechanical. A picture is high-enough res and an order of magnitude or two faster.

    8. Re:sooo.... by Nefarious+Wheel · · Score: 1
      Is there a better way to bind a hardcopy book than the way we've done it for the last oh, eon or so? With so many different minds to tap, can't we come up with a better way to hold paper together than the common edge binding we're used to? Binding in signatures is elegant, but it's really only optimised for carrying the book - not reading (or scanning) it.

      I like paper books. Still.

      --
      Do not mock my vision of impractical footwear
    9. Re:sooo.... by b0bby · · Score: 1

      I have a book I'd like to photograph - it's a large, hand-typed bound family history, done in the 70s by a relative. You can't get it on a scanner without breaking the binding. I tried years ago to scan & ocr it with a handheld scanner, but the tools available then were just too painful I'm going to give it another shot now. So, some people may well find this discussion useful.

    10. Re:sooo.... by Mozk · · Score: 1

      I was going to post something similar. Just chop the binding off so you have a pile of flat pages that you can photograph. Then get the book rebound with coil/spiral binding.

      --
      No existe.
  3. Anonymous Coward by Anonymous Coward · · Score: 4, Informative

    Get a thick, heavy piece of glass and lay it atop the pages to flatten them out before you photograph them. Use ambient light and avoid the flash.

    1. Re:Anonymous Coward by polymeris · · Score: 3, Informative

      Also use a zoom lens and take the shot from as far as possible, to reduce curvature. The longer the focal distance, the flatter the picture will appear.

    2. Re:Anonymous Coward by Yvan256 · · Score: 4, Funny

      Also, avoid being seen by the bookstore clerks.

    3. Re:Anonymous Coward by Anonymous Coward · · Score: 2, Informative

      It doesn't have to be glass. Target stores have these nice plexiglass photo boxes. An advantage of them over glass is that the edge of the box helps hold the opposing page up.

    4. Re:Anonymous Coward by bazim2 · · Score: 1

      Many zoom lenses curve the field the opposite way when zoomed all the way in. Most lenses have barrel distortion when zoomed out and pincushion distortion when zoomed in. There'll probably be somewhere between where the field is flat. A few experimental shots of a grid will quickly find this out. Here's some examples from SLR lens reviews: http://www.photozone.de/canon_eos_ff/426-canon_28300_3556is_5d?start=1 http://www.photozone.de/canon-eos/295-tamron-af-24-135mm-f35-56-ad-aspherical-if-sp-lab-test-report--review?start=1 (there's many more on that site) Smaller compact cameras lenses often show hugely higher field distortion than decent SLR lenses.

    5. Re:Anonymous Coward by miggyb · · Score: 1

      Just rip the book's spine off. At this point, it's probably easier and faster to learn how to re-sew a book than to do all the software voodoo (unskewing, OCR) to get a somewhat usable ebook.

      --
      This signature serves no purpose other than to help you see which posts were made by me.
    6. Re:Anonymous Coward by timeOday · · Score: 1

      Using a zoom also allows you to use a flash. Flashing from too close creates less even illumination and, on the cameras I've tried, overexposure.

    7. Re:Anonymous Coward by Hurricane78 · · Score: 1, Insightful

      I think such a device is called a "scanner" :P

      --
      Any sufficiently advanced intelligence is indistinguishable from stupidity.
    8. Re:Anonymous Coward by Anonymous Coward · · Score: 1, Funny

      Yeah, a dumb fucking computer nerd thinks software is the best solution for every fucking problem. sheesh.

    9. Re:Anonymous Coward by rolfwind · · Score: 1

      If money isn't an issue and you can rip the book apart, the scansnap series of scanner is nice and fast. Just drop in 50 sheets at a time, and, depending on the settings, a 500 page book will be scanned in under 20 minutes. Pages stay flat, and you'll have an automatic PDF too - no conversion necessary except probably for small devices (iPhone).

      I tend to think photographing pages is slow, require either an expensive set-up, or you just get half-assed results that will drive you nuts when you actually sit down and read it.

    10. Re:Anonymous Coward by Jerry+Coffin · · Score: 1

      Better yet, avoid zoom lenses and get a dedicated macro lens. Nearly every camera manufacturer makes at least one macro lens, and often two or three of different focal lengths. For books, a ~100mm macro works quite reasonably.

      The biggest problem is that a macro lens can be somewhat on the expensive side. If you want to stay cheap, Cosina makes a 100mm f/3.5 macro that looks and feels cheap, but has quite decent quality optics. This is widely available under various other names (Promaster, Quantaray, etc.) In fact, nearly any current, off-brand, 100mm f/3.5 macro lens is likely to be made by Cosina.

      Compared to zooms, dedicated macro lenses are nearly always sharper, and (particularly) have extremely low distortion. FWIW, most of them work pretty decently as portrait lenses too.

      --
      The universe is a figment of its own imagination.
    11. Re:Anonymous Coward by Anonymous Coward · · Score: 2, Informative

      That's not what polymeris is getting at. Wide angle lenses create strong perspective foreshortening. That's why there is a sweet spot for portrait photography: too wide makes noses look big, too long leaves no perspective. Lens distortion is easily removed because it is inherent to the lens, so you only need to calibrate once and can use the profile for all pictures shot at the same focal length. Perspective distortion depends on the scene, so there is no "calibrate once, correct all" option without creating a repeatable setup.

    12. Re:Anonymous Coward by jonbryce · · Score: 2, Informative

      Barrel distortion can be easily fixed in photoshop, and once you get the right settings for your first pic, you can batch process the rest of them.

    13. Re:Anonymous Coward by fractalspace · · Score: 1

      Doesn't work.You get glare and/or uneven light no matter what you do.

    14. Re:Anonymous Coward by Anonymous Coward · · Score: 1, Informative

      Also use a zoom lens and take the shot from as far as possible, to reduce curvature. The longer the focal distance, the flatter the picture will appear.

      I'm sure you mean a tele lens. Zoom just means variable focal length. and could even a wide angle one.

    15. Re:Anonymous Coward by Hadlock · · Score: 2, Interesting

      If you're doing fixed height/lighting camera photography, you might as well just buy a cheapo screw mount macro lens + screw mount adapter.

      --
      moox. for a new generation.
    16. Re:Anonymous Coward by stephanruby · · Score: 1

      Or if you have an extra DVD player lying around, you could just take it apart use that to make the macro lens

    17. Re:Anonymous Coward by clone53421 · · Score: 1

      The flash still doesn't typically work well when photographing something behind glass, though.

      --
      Alexander Peter Kristopeit bought his basement from his mommy for one dollar.
    18. Re:Anonymous Coward by Lumpy · · Score: 1

      using the right flash allows you to use a flash.

      I take closeups within 8=12 inches of a persons face. I get perfectly illuminated flash photography.

      standing back and zooming works great for low end cameras, upgrade to a real flash with a bounce or diffuser and power setting and you can do far more. Upgrade to a pair of light boxes and adjustable stand flashes and you can do everything.

      --
      Do not look at laser with remaining good eye.
    19. Re:Anonymous Coward by Khashishi · · Score: 1

      Actually, lens distortion also depends on the scene, but usually the scene is much farther than the focal length so it doesn't change very much. But for close up shots, you'd do better with a macro lens.

  4. No it wouldn't be faster by DingerX · · Score: 1

    Seriously, have you ever compared the time photographing a book vs. scanning it? The fastest scanners run like photocopiers. With a book, all you need is to set up a decent or ghetto rig for the camera and turn the pages. Until now, I've been shooting with a DSLR at the same lighting/camera settings for each shot, and applying a batch transform process followed by a universal levels setting, finishing up with a PDF assembly. But I'll report back on how Snapter works on the same files.

    1. Re:No it wouldn't be faster by Anonymous Coward · · Score: 1, Informative

      Seriously, have you ever compared the time photographing a book vs. scanning it? The fastest scanners run like photocopiers. With a book, all you need is to set up a decent or ghetto rig for the camera and turn the pages. Until now, I've been shooting with a DSLR at the same lighting/camera settings for each shot, and applying a batch transform process followed by a universal levels setting, finishing up with a PDF assembly. But I'll report back on how Snapter works on the same files.

      Exactly, the document scanners used in libraries and archives are pretty much high resolution cameras on an adjustable stand. They don't work like flatbed desktop scanners where you have to squash the book flat on a plate of glass. As a result they are much faster, easier on the books and you get better quality scans for OCR processing.

    2. Re:No it wouldn't be faster by ZERO1ZERO · · Score: 1
      Flat bed scanners - well slow. So slow it's not even worth bothering with

      Orbital/Planetary scanners - much faster, essentially the same as your DSLR hanging above the page. Less than a second for a 'scan', rate limited by speed of turning the pages. A 5 secs to turn an scan that's 12 images a min or 720 an hour.

      What the GP was talking about: Duplex feeder scanners. If you can cut the spine of the book and feed in into one of these then your laughing. Some of them will do 2-300 full colour pages a minute with no sweat. That's 10,000 scans per hour..... Orders of magnitude faster.

    3. Re:No it wouldn't be faster by DingerX · · Score: 1

      Yeah. Totally sweet. We have tons of them where I work too. Only thing is, the IT department decided that scanning to PDF would be impossible to bill and invite copyright abuse, so they disabled any action that didn't result directly in printing to paper.

    4. Re:No it wouldn't be faster by plasmacutter · · Score: 1

      sounds like its time to find a place where the legal department doesn't run IT?

      when you leave tell them their tools are "defective by design" and you won't support them.

      --
      VLC FOR MAC IS DYING! IF YOU DEVELOP, PLEASE SAVE IT!!
    5. Re:No it wouldn't be faster by slim · · Score: 1

      Flat bed scanners - well slow.

      AFAIK "flat bed scanner" simply means a scanner where you place the item to be scanned on a flat plane of glass to be scanned. The technology that actually does the scanning is irrelevant.

      In which case, the Xerox printer/scanner in my office does flat bed scanning in the blink of an eye. I ran it with the lid up once to prove to myself it wasn't just a camera - and sure enough a sensor bar whips across the page.

      Of course, it's not as practical for scanning bound books as the Google solution.

    6. Re:No it wouldn't be faster by ZERO1ZERO · · Score: 1
      Good point.

      I suspect that Xerox machine you talk of will actually be moving just a light/mirror which is reflecting into a camera. (or possibly a CCD arrangement, but usually they are in cheaper machines) But either way scanning a book on a 'flatbed' whereby the book has to be placed face down is a major chore. turning the page means lifting the book, turning the page, then turning the book back over and repositioning it etc etc. Even if the scan takes a blink all that manual work will make it tedious at best.

    7. Re:No it wouldn't be faster by bronney · · Score: 1

      dude, hopefully you shot both left and right page open into 1 file. If that's the case, drag one into photoshop, record actions while you tweak. Then auto/batch the whole series with that action and see what's up.

    8. Re:No it wouldn't be faster by Anonymous Coward · · Score: 0

      Wow you guys have morons running your IT department. Scan to PDF eliminated 80% of the paper in the office here. We are a small company with only 30,000 employees in 3 locations. Finance department does everything exclusively as PDF, even contracts are scanned and the original destroyed as legal has deemed that a PDF scan of a signed document is as legally binding and secure as the actual paper. It eliminated the 1000 sq feet of filing cabinets we had.

      maybe you should recommend that the uneducated people running the IT department be replaced with some that actually know what they are doing.

    9. Re:No it wouldn't be faster by Blakey+Rat · · Score: 1

      Flat bed scanners - well slow. So slow it's not even worth bothering with

      Any decent mid-range multi-function copier will be able to feed its flatbed scanner from the input tray, meaning you can do 25+ pages at a time (depending on the model). And while it's not as fast as a drum scanner, it's not nearly as slow as you're implying here.

      Of course you have to destroy the book to use that option. But the equipment isn't hard to find: you likely have one in your office right now.

    10. Re:No it wouldn't be faster by Anonymous Coward · · Score: 0

      We are a small company with only 30,000 employees in 3 locations.

      And how big is a mid-size company in your opinion? 1,000,000 employees? Geez! The "small company" that I used to work for had 8 employees. 8! When your small company employs two times the entire population of my city, it's no longer a small company...

  5. Contact Scan Tailor Author? by syousef · · Score: 4, Insightful

    At version 0.9.6, perhaps Scan Tailor is 96% of what you want and it's F/OSS. If you *politely* contact the author(s) and lay out your concerns perhaps you can get what you need AND help make a project better. Worth a try.

    --
    These posts express my own personal views, not those of my employer
    1. Re:Contact Scan Tailor Author? by Anonymous Coward · · Score: 1, Informative

      Whoo, Mod +1 Funny!

      Any time I've ever done that I've either gotten crickets or flames.

    2. Re:Contact Scan Tailor Author? by Seth+Kriticos · · Score: 1

      From the screens on the site it looks like the author does not value Windows very much (the screens are done on Linux).

      If you want him to do development for the Windows platform, then you will have to ask nicely with some motivational argument (ask him how much he wants for making it work on the Windows platform, pay half in advance).

    3. Re:Contact Scan Tailor Author? by Anonymous Coward · · Score: 1, Informative

      Or, you know, if you find that there is a Linux only app that is exactly what you're looking for, you could just use Linux for it. Many of us are stuck with a Windows box, partition, or VM for the same reason. With Linux you can even run it off a CD or USB drive.

    4. Re:Contact Scan Tailor Author? by Al+Dimond · · Score: 1

      The home page says there is both a Windows and Linux version. Just because the screenshots are Linux doesn't mean the author doesn't value Windows; they have to have been made on one platform or the other, whoever made them happens to have done them in Linux.

      If the Windows version was broken for some reason it still might be possible to build the Linux version under Cygwin.

    5. Re:Contact Scan Tailor Author? by AmiMoJo · · Score: 1

      Or just check the Pirate Bay^H^H^H^H Mininova for a .pdf version and save yourself some hassle.

      Seriously, if fair use allows format shifting, does that mean you are required to do the shift yourself?

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  6. You need the proper kit by Anonymous Coward · · Score: 1, Interesting

    There are perfectly good machines that will do this for you.

    They have suction systems to turn the pages of the book, and hold the book partially open so that the pages are more or less flat. There's one camera for each page, and the software that comes with the system deals with the curving, and obviously gets the lighting right to avoid shadowing etc.

    OK, so maybe these machines aren't exactly cheap ... ... but at least one publisher is using them to photograph books (ones that are out of copyright, obviously) and put them into a print-on-demand programme. The deal is that if you suggest a book to the publisher, and they take you up on your suggestion, you get a free copy.

    1. Re:You need the proper kit by ZERO1ZERO · · Score: 1
      software that comes with the system deals with the curving

      This is what the OP is asking about. Plenty of these systems ship with the combined hardware/software pack and cost thousands or even tens of thousands of dollars.

      It sounds like the OP has taken a few shots of a handy book maybe at a friends house or whatever and would like to just 'sort' them out before finally archiving them on to his system. Kind of like how some people like to tag all the MP3s before 'committing' them to their system. Free, and FOSS options for this type of software are limited, I would be interested in some kind of thing as well.

    2. Re:You need the proper kit by markdavis · · Score: 1

      Yeah, I don't know why people keep suggesting such expensive and complex stuff for his "project". It isn't a project- he is just photographing a few pages.

      You could just fix them in Gimp.

      Anyway, as others have suggested, there is Scan Tailor, which is FOSS and multiplatform (and GUI)

      http://sourceforge.net/project/screenshots.php?group_id=227253&ssid=90796

    3. Re:You need the proper kit by Anonymous Coward · · Score: 0
  7. Rather than photograph or scan each page... by Anonymous Coward · · Score: 0

    Cut the binding off and run it through a duplex scanner with a document feeder.

  8. ahhh - book scanning by ZERO1ZERO · · Score: 3, Informative
    As with most scanning and other things, you can save your self immense amounts of hassle time and money later, by spending a fraction of that time up front sorting out the 'input'. A bit of glass over the book, using a scanner, or even getting a friend to hold a book will mean that your source image will be much better to start with.

    Not everyone has 5-10mm thick peices of book sized glass lying around and it can be hard to take that sort of thing about the place in case of requiring to photo a book.

    There is software called Book restorer that does this removes curves 'geometrical correction' etc but it's pricy.

    i've tried un paper and it's pretty decent for what it does but it does have some limitations and it's not the most convenient to use.

    Deskewing, cropping, filling, etc etc are all easily done and I've even written imagemagick batch scripts in windows to do these things. The major trick is the curve removal.

    There's various ways you can determine the curve from a scanned image. If you have the edge of the page, you can calculate the movement required to straighten that, and then apply it to the whole image. You can use text based curve removal, similar to well known deskew algorithms for text, but takes into account different parts of the text may be 'more' skewed. i.e. rather than a rotational deskew a 'sliced' deskew. This needs to be done from the top to the middle and the bottom to the middle.

    If you have a good 'shape' of the page, and know the true size of the page, you can use a kind of morph operator to morph the corners back to th eright position and hope the image follows.

    Using a Greyscale/colour source will work better than a black and white source image in general.

    the other option is if the scanned / photoed page is actually of reasonaly good quality but if just a bit squint, then OCR it to a PDF and generate a new document using the OCR text, which will be pin sharp accurate, compress a lot better and be easier to use, although may not be ideal if there are too many errors.

    1. Re:ahhh - book scanning by DingerX · · Score: 1

      Yup. They want an email address just to transmit a PDF info sheet (I'd apologize now to the sysadmin at mailer-daemon.styx.hel, but I have a funny feeling it's not going to help me much in the long run), and don't give the price anywhere = either $2k or (more likely) the mission-critical purchase price of $50-100k.

    2. Re:ahhh - book scanning by Anonymous Coward · · Score: 0

      $2k is awful cheap for an "if you have to ask..." price. In the cases I have asked the price has always been at least $10k for the most basic version. It was a different niche market, though.

    3. Re:ahhh - book scanning by edschurr · · Score: 1

      A 6mm 12"x9.5" piece of regular (not anti-glare) glass costs less than $10 to get cut. However, I haven't figured out a good way yet to use the glass without damaging the book's spine (while also operating the entire contraption quickly). That's why I want to use "heavy" image manipulation myself.

  9. Re:And you need a GUI why? by Teun · · Score: 1

    Command line!, it looks like you revel in illegal activities...

    --
    "The likes of Facebook and WhatsApp are free to those whose privacy is of zero value."
  10. Some cheap cameras can do it by has2k1 · · Score: 1

    I borrowed a simple 5 Mega Pixel camera from a friend 3 years ago and it had a built-in flat picture taking mode. It was one of the common brands probably a cybershot (though not really sure).

    1. Re:Some cheap cameras can do it by JavaManJim · · Score: 1

      That's a Nikon CoolPix 5200. p38 in the manual.
      "Scene Mode" "Copy"

      "Copy provides clear pictures of text or drawings on a white
      board or in printed matter such as a business card."

      "Colored text and drawings may not show up well in the final picture"

  11. a nifty new program by Anonymous Coward · · Score: 0

    There's this nifty new program that just came out that's called Photography Shoppe. It lets you adjust the skew on images, slong with a whole bunch of other things. It's one to watch out for. It'll be really popular some day!

    1. Re:a nifty new program by Anonymous Coward · · Score: 0

      Sounds great. One second per page to photograph, one minute to edit. I know an even better way: Print out the photographed pages, bend them until the image looks flat and photograph them again.

    2. Re:a nifty new program by Anonymous Coward · · Score: 0

      Guess you aren't aware that many actions can be scripted. Quickly click and define the page borders, run action to fix.

    3. Re:a nifty new program by DingerX · · Score: 1

      Speaking of which, how do you automate a task in GIMP? In Photoshop it's dead easy: just hit record, type in a name, and you're off.

    4. Re:a nifty new program by Anonymous Coward · · Score: 0

      You have to write it in Script-Fu. Basically you need to learn a mini programming language to get it to work. Fine if you're a programmer. Not good if you're a graphics person. But don't expect them to ever change that.

    5. Re:a nifty new program by doti · · Score: 1

      The macro recording is more or less planned for GIMP 3.0

      http://www.gimp.org/docs/userfaq.html#Macro

      --
      factor 966971: 966971
    6. Re:a nifty new program by Taxman415a · · Score: 2, Informative

      Basically you look up the internal command names such as file-jpeg-save and what arguments it takes then program either a plugin for gimp or a non interactive script. You can do it in scheme which they refer to as script-fu, or you can write them in python and it's called python-fu. The former is lightly documented and the latter barely at all. The only way I've found to look up all the command names for the python interface is to run gimp, then go to filters -> python-fu -> console then hit the browse button. But yeah basically it's either learn to program in scheme or python at this point.

  12. Viewing images on iPod Touch by Anonymous Coward · · Score: 0

    I scanned my books in so I did not have the page curve problem of davidy, but when I loaded the images up through iPhoto to my iPod they were compressed further and unreadable.
    Anyone have any suggestions on what I did wrong?
    Sorry for the question hijack, thanks!

    1. Re:Viewing images on iPod Touch by Anonymous Coward · · Score: 0

      Don't use iShit. (bad experience with iunes wiping one of my ipods and automatically starting to convert my files into m4a without asking)

      For more helpful advice: I would look at the settings, to try and turn off auto compression, i would hope it is in there, and then try again. Unless it compressed the originals, then you are out of luck, and have to scan them in again.

    2. Re:Viewing images on iPod Touch by Anonymous Coward · · Score: 0

      You did wrong when you put them in the iPod, apparently. I'm not familiar with iPhoto, but I suspect that it knew the destination was an iPod so it resized the images for an iPod's screen. Hijacking back to the original topic: Perl seems able to do anything; maybe there's a Windows GUI interface which can be used as a front end for a command line tool.

  13. Look at some Google books by frovingslosh · · Score: 3, Insightful

    I remember reading about Google's technique of converting books to e-books

    My suggestion is that you look at some of the Google books that are on-line. I have, and they show the problems that you mention and more, curved pages, dark areas, and even text that is distorted and harder to read than most captchas. Whatever you have read (and yea, I remember reading it too), it doesn't seem to actually be viable in practice. Sure, photographs are easier than scanning, particularly if you do it fast and cheap, but the result is poorer. If you can scan the book without damaging them I suggest you go back and do that.

    --
    I'm an American. I love this country and the freedoms that we used to have.
  14. Re:And you need a GUI why? by DragonMantis · · Score: 0

    What about Unpaper in a VM so that it has a GUI?

  15. Solution by Anonymous Coward · · Score: 0

    Learn to use the command line.

  16. Depending on the book of course... by FlyHelicopters · · Score: 1
    Cut the spine off and run the pages through a scanner's page feeder... OCR if desired, then enjoy...

    I'm surprised that Google doesn't do this, it would be SO much faster than scanning each page one at a time.

    Another option, see if Amazon sells the book in digital format. Sometimes a few bucks saves a world of headache.

    Now if these are expensive textbooks or reference books, or don't belong to you, the above may not apply, just my first thought on the subject.

    1. Re:Depending on the book of course... by s4m7 · · Score: 1

      I'm surprised that Google doesn't do this, it would be SO much faster than scanning each page one at a time.

      Except that they use high-quality book scanners that can go through a hundred pages in a few seconds flat, and it would cost a fortune for them to do it this way, and no libraries would let them touch any archival materials which is half the point.

      --
      This comment is fully compliant with RFC 527.
    2. Re:Depending on the book of course... by Anonymous Coward · · Score: 2, Funny

      Cut the spine off and run the pages through a scanner's page feeder... OCR if desired, then enjoy...

      I'm surprised that Google doesn't do this, it would be SO much faster than scanning each page one at a time.


      Yeah, I don't see why they don't just slice the spine off that one-of-a-kind 16th century book so they can scan it in. That's such an easier way to do it. And I have NO idea why a library would have an issue with that.



      Are people around here really this dumb?

    3. Re:Depending on the book of course... by HappyEngineer · · Score: 1

      Cut the spine off and run the pages through a scanner's page feeder... OCR if desired, then enjoy... I'm surprised that Google doesn't do this, it would be SO much faster than scanning each page one at a time.

      You just described a major plot point in Vernor Vinge's book "Rainbows End" except that instead of cutting off the spine they actually used paper shredders to cut up the books and then used computers to put the books back together. It was incredibly fast (no need to cut spines and feed pages, just shred the book and suck the results in through a hose and let the computer scan the particles as they fly past.

      The downside was that a small subset of people wanted physical books to stay around and were willing to perform sabotage where necessary in order to prevent the destruction of the books.

      If google did this they might just have to face a cadre of dangerous rogue librarians!

  17. Casio Exilim digital cameras built-in mode by AwaxSlashdot · · Score: 3, Interesting

    Digital cameras from Casio (Exilim serie) have a dedicated to take picture of sheets of paper, whiteboards and visit cards. It detects the content boundaries, crops and unskew it. You could also save time (and money since time = money) by looking for and buying the electronic version of the book you want to read.

    --
    Sig (appended to the end of comments you post, 120 chars)
  18. Use a homemade book scanner. by s4m7 · · Score: 4, Informative

    If you have ~$300 to drop on the project, Make has plans for a nice book scanner: http://blog.makezine.com/archive/2009/04/how-to_book_scanner_on_the_cheap.html It seems to hold the pages at an angle so there's little-to-no distortion on the page.

    --
    This comment is fully compliant with RFC 527.
    1. Re:Use a homemade book scanner. by DingerX · · Score: 1

      Awesome. Can I borrow your tools?

    2. Re:Use a homemade book scanner. by s4m7 · · Score: 1

      I don't even let people I *like* borrow my tools.

      --
      This comment is fully compliant with RFC 527.
    3. Re:Use a homemade book scanner. by Anonymous Coward · · Score: 0

      Good idea. Might work better and be easier if everything was inverted. I.e. book on top of an inverted glass V . . . each page will then always be as close to the same position relative to the camera as possible, with the same distortions.

    4. Re:Use a homemade book scanner. by edesio · · Score: 1

      Not a great idea. You will have to lift the book to change pages.

    5. Re:Use a homemade book scanner. by RickTucker · · Score: 1

      I've seen an inexpensive scanner designed for books that scans to the edge, the Plustek OpticBook. Anyone have experience with it? I may buy one.

  19. Why Bother by Anonymous Coward · · Score: 0

    By the time you've spend all this time digitizing, couldn't you have just read the book?

  20. Decapod project by phr1 · · Score: 1

    Take a look at decapod-project.org for a complete system. Note that software dewarping is quite a hard problem, but it is part of decapod.

    1. Re:Decapod project by Anonymous Coward · · Score: 0

      Note too that decapod is using FOSS so it won't cost anything but your time to get setup :)

    2. Re:Decapod project by Anonymous Coward · · Score: 0

      Yes, DECAPOD is intended to address this problem. A first release should be available some time next year.

  21. If you had a Mac... by ctmurray · · Score: 1

    You could consider Prizmodo. It includes OCR should you desire for $40.

  22. Hugin could help by Graemee · · Score: 4, Interesting

    http://hugin.sourceforge.net/tutorials/scans/en.shtml Here's a tutorial to stitch scans together the slight curve of the page is minimized where the scan joins. Might be what you are looking for.

  23. Here you go by arndawg · · Score: 2, Funny
  24. Hmmm by Anonymous Coward · · Score: 0

    What is this PDA of which you speak?

  25. Re:And you need a GUI why? by Anonymous Coward · · Score: 0

    What is even sadder is that in 2009, you HAVE to resort to the CLI for anything if you prefer not to. CLI is often faster, granted. I develop under linux every day and could not imagine not having a CLI, but if I prefer not to type to run a program dealing with images, I should not need to. I would expect typing for a GUI, but I should not have to type to resize and distort images. The fact that you complain about the windows jockeys who want the GUI is exactly the reason Linux is an OS for hobbyists, and not a mainstream desktop alternative.

  26. Re:And you need a GUI why? by DragonMantis · · Score: 0

    I happen to like Linux, but GUI's are convenient -- particularly for images, as you said. Personally, I'd like it if a few more of the Windows configs were easily editable plain text. But you are also right, to *need* to use a text edit or a command line is different than choosing to do so.

  27. What does "and patented" have to do with it? by wjsteele · · Score: 0

    Really, if you are doing this for yourself and have no intention of selling your product, then you are free to use their method all you want. Patent's protect the original inventor from companies who would profit from that invention. If you are not profiting (in ANY way except knowledge) then you are free to build your own system. There are several instances of this, for example the "balancing scooters" that resemble the Segway. There are even plans and software available on the internet for building your own. Bill

    --
    It's my Sig and you can't have it. Mine! All Mine!
    1. Re:What does "and patented" have to do with it? by purduephotog · · Score: 1

      Exactly.

      How you stitch together a book doesn't matter if it's for your own personal use.

      Now, if you want to go commercial you've got quite a few things to figure out.

      Regardless, what you're wanting to do is basically orthorectification. There is an open source package out there that does that. Figuring out how to do so would be left to you, but I'd recommend using some sort of yellow projection grid (or red from a red laser) to map the distortion and correct it by treating it as a DEM.

      Poor man method- so long as you only want bw scans :)

    2. Re:What does "and patented" have to do with it? by Dachannien · · Score: 3, Informative

      Really, if you are doing this for yourself and have no intention of selling your product, then you are free to use their method all you want.

      35 U.S.C. 271 (a) Except as otherwise provided in this title, whoever without authority makes, uses, offers to sell, or sells any patented invention, within the United States, or imports into the United States any patented invention during the term of the patent therefor, infringes the patent.

      Yes, it's extremely unlikely that anyone would ever sue you for infringing a patent in the privacy of your own home because the damages would be minuscule and it would be very difficult to prove infringement, but it's still an infringement.

    3. Re:What does "and patented" have to do with it? by Anonymous Coward · · Score: 0

      That seems so wrong. So you could do something in the privacy of your own home without affecting anyone else and be in breach of the law? Words fail me.

    4. Re:What does "and patented" have to do with it? by KnownIssues · · Score: 1

      That seems so wrong.

      At first, I thought so too. Then I started thinking of it in terms of, "if I assume it is right, how could it be right?" If I copy a book or CD, I can buy the original, make an exact copy, and use that copy to avoid damaging the original. The writer of the book has lost no money on a sale.

      A patent is essentially a free blueprint to build something. I can see why, if I invented something, I wouldn't want millions of people to be able to build for free what I invented in their own homes.

    5. Re:What does "and patented" have to do with it? by Anonymous Coward · · Score: 0

      What if I want to treat it as a REP?

  28. OCR? by Anonymous Coward · · Score: 0

    Wouldn't OCR (http://en.wikipedia.org/wiki/Optical_character_recognition) be a much more practical option for this?

    You could easily put it on your PDA, etc. and wouldn't have to worry about geometrically correct pages or reducing flashes.

  29. Allow me to extract the informative part of this.. by plasmacutter · · Score: 2, Insightful

    Unpaper may work for you if you're not afraid to deal with a CLI.

    There's no harm in giving it a look. Assuming it's properly designed I can see it being quite elegant.

    --
    VLC FOR MAC IS DYING! IF YOU DEVELOP, PLEASE SAVE IT!!
  30. How about a $300 home-built scanner? by plover · · Score: 5, Informative

    Some guy posted a great instructables on building your own high speed book scanner, purposely designed to rapidly photograph book pages without curves. He even includes a software stream that OCRs the contents and sticks them into PDFs.

    It's been quite popular -- so much so that he's created an online forum at http://www.diybookscanner.org/ dedicated to discussions from DIY book scanners all over the place, where they talk about builds, parts, and software.

    I've been very tempted to build one myself just to avoid carrying heavy books around in my backpack.

    --
    John
    1. Re:How about a $300 home-built scanner? by Anonymous Coward · · Score: 0

      This is a very cool and ambitious project. Not a trivial build, just complex.

      Now that Instructables has gone to a pay for download content model, maybe someone with a pro account wouldn't mind hosting the PDF of the build somewhere? (Apparently sharing is perfectly withing the realm of their TOS...)

    2. Re:How about a $300 home-built scanner? by MisterSquid · · Score: 2, Interesting

      Something the narrator in the bookscanner video said at the end of his video really resonates with me, which is that

      powerful tools invariable have powerful effects. If everyone had a book scanner in their home, it might force the publishing industry to rethink an obviously broken business model.

      Lately, I've been posting like crazy about digital print and related topics such as the conversion of paper print to digital audio. Google started their project several years back and publishers are suing to stop the threat to the paper print business model represented by a millions-of-volumes digital book corpus. If these publishers (copyright holders) are successful, it won't be too long before efforts such as the one depicted in the Instructables video are multiplied on a mass scale, what Clay Shirky calls the mass amateurization of a formerly professional field (book publishing).

      My best guess is that media incumbents will not adapt, and that what happened to the music industry will happen to all media incumbents. It's not that publishers can't change, but rather something about their culture, their collective belief in entitlement and the "rightness" of legacy media structures prevent them from pioneering the transition.

      You can't say we didn't try.

      --
      blog
    3. Re:How about a $300 home-built scanner? by plover · · Score: 1

      Follow the http://diybookscanner.org/ link. He says he's migrating it to there.

      --
      John
    4. Re:How about a $300 home-built scanner? by AmiMoJo · · Score: 1

      I think the day when eBook readers can flick pages in 0.1 seconds is the day when the shit really hits the fan for publishers. Trying to find information in textbooks when it takes 2 second per page turn is not very practical at the moment.

      People will want to buy the latest Stephen King novel, but they don't really care about the publisher at all. If they can get it directly from his web site, they will. At best there will be a load of out-of-work editors willing to work on a per-book contract.

      The key is going to be for publishers to realise what the real value of their product is - you get a (hopefully) concise, checked and edited text written by an expert. Obviously that applies to textbooks rather than novels. It is more valuable than the same information on the internet, which could be spread around different sites and is hard to verify. People will pay for that.

      --
      const int one = 65536; (Silvermoon, Texture.cs)
      SJW, n: "Someone I don't like, and by the way I'm a fuckwit" - AC
  31. Much faster than scanning? by MartinSchou · · Score: 1

    Does that include the time needed to now fix the artefacts that scanning doesn't get you?

    If you have a scanner, then why don't you just use that? And if you do not have a scanner, why even bother with the speed comparison and not settle for "I don't have a scanner"?

    There's a reason that scanning takes time compared to just pointing a camera at a book and snapping a picture. You've now found one of those reasons. Congratulations.

    Now you just have to find out if the up front time savings are greater than the post processing time costs, and since you're going to spend time reading through Slashdot and trying out the suggestions, I'm going to say "no, the savings aren't bigger". As someone else said, even Google's scanned/photographed books have issues, so don't be surprised when you do as well.

    1. Re:Much faster than scanning? by Anne+Honime · · Score: 1

      It is, but you need a (possibly home made) reproduction stand, uniform lightening (not necessarily colour corrected, if you intend to go grayscale only in the end, can be home made too) and a V shaped holder for the book. Be sure your CCD plan is mostly parallel to your page plan.

  32. I want I want by ethicalBob · · Score: 2, Insightful

    I want a solution to my problem that lets me be incredibly lazy, so I don't have to scan the books in (which i know will work VERY well if I just take the time to do it)...

    I want software that will do it. For free.

    Can I do it without a camera, too?

    Actually I'd like it if there were some way I could get paid for using the software.

    Can i just put my iPhone/PDA on the book and have it all sucked in via osmosis?

    and then have the book read back to me w/ Morgan Freeman as the narrator?

    Is there software that will turn the pages for me too? Oh wait - Morgan is going to narrate for me, that's right...

    (sigh)

    I know I'm probably getting modded Troll for this one; but there isn't always always an easy (magic!) software solution for every little thing. Sometimes you still have to put the work in if you want quality.

    --
    Politics will sooner or later make fools of everybody... - Dick Armey
    1. Re:I want I want by LihTox · · Score: 4, Insightful

      It never hurts to ask; if there were an easy magic software solution to do X, wouldn't you rather find out about it now, instead of after doing X?

    2. Re:I want I want by Odinlake · · Score: 2, Insightful

      This guy didn't ask for 'magic' but for software to perform a conceptually simple task. He got two or three concrete examples above. There are nuttier posts on /. you can go and make fun of.

    3. Re:I want I want by m.alessandrini · · Score: 0

      I wonder I wonder.... Why would a Slashdot section called "Ask Slashdot" be there in your opinion?

    4. Re:I want I want by noidentity · · Score: 1

      I know I'm probably getting modded Troll for this one; but there isn't always always an easy (magic!) software solution for every little thing. Sometimes you still have to put the work in if you want quality.

      Good thing we haven't had any enlightened lazy people in the past who actually solved problems similar to this. If they had, we'd be able to keep things cool without having to constantly refill our iceboxes, calculate things without having to do them by hand or mentally, or even have a machine write things for us just by pressing keys. Oh, wait...

    5. Re:I want I want by Anonymous Coward · · Score: 0

      This is why I never, ever use existing software to get my work done. I just start from the basics. So what if someone already did it? I didn't invent that wheel, mister.

  33. Try a heavy piece of non-glare glass by KharmaWidow · · Score: 2, Insightful

    Try a heavy piece of non-glare glass

    1. Re:Try a heavy piece of non-glare glass by Anonymous Coward · · Score: 1, Insightful

      or try a flatbed scanner instead of a camera

    2. Re:Try a heavy piece of non-glare glass by thexile · · Score: 1

      Try not reading the book.

    3. Re:Try a heavy piece of non-glare glass by GigsVT · · Score: 1

      You could hire an illegal immigrant midget to follow you around and carry the book.

      --
      I've had enough abrasive sigs. Kittens are cute and fuzzy.
  34. i remember and instructable... by Anonymous Coward · · Score: 0

    i never built one of these, but if you have time, this might be your answer.

    http://www.instructables.com/id/DIY-High-Speed-Book-Scanner-from-Trash-and-Cheap-C/

  35. Try using one of the Planon Pen Scanners by ArcticBirdman · · Score: 2, Interesting

    I went out and got one of the Planon Pen Scanners. Depending on what you want, they can go all the way to 600 dpi in color. Check out their web site. You can also get bargains on refurbished earlier models if you only need B&W and up to 200 dpi. http://www.planonstore.com/SearchResults.asp

    1. Re:Try using one of the Planon Pen Scanners by ZERO1ZERO · · Score: 1

      Yes. Forgot about these. They are pretty neat for just getting a couple of pages out a book. Scan quality is pretty good aswell

  36. patented tech is teh evil by Kohath · · Score: 1

    Of course, neither his digital camera nor his PDA use any patented technology.

  37. Re:And you need a GUI why? by Neil+Hodges · · Score: 1

    I don't know; for many purposes ImageMagick is perfect for my needs. Unlike the GIMP (as of 2.6.7), it supports 16-bit channels in images, which are useful for grayscale work on my side (256 grays is too limiting). It's also a lot faster for doing work over a lot of images in sequence.

  38. Using a camera would be MUCH better. by Futurepower(R) · · Score: 1

    Also, those scanners are VERY expensive. Using a camera would be MUCH better, if the problems can be solved. It doesn't matter how big the camera images are, since the ABBYY FineReader PDF-making software we use OCRs the image and makes searchable PDFs.

    The Fujitsu fi-6230 Sheet-Fed and Flatbed Scanner gets good reviews and the flatbed scanner is fast, but it costs $1,200, and the sheet-fed and flatbed scanners are weirdly and unnecessarily connected.

    Less expensive Fujitsu scanners lack TWAIN or ISIS driver support. Fujitsu uses proprietary drivers for the less expensive scanners, meaning that it can make them obsolete for some future operating system merely by not providing drivers.

    The Fujitsu ScanSnap S1500 Instant PDF Sheet-Fed Scanner is excellent for what it does, we have one, but it doesn't do books, of course.

  39. Low nerd-factor approach by Cruciform · · Score: 1

    Not as fun as figuring out some massive kluge to do the job, but if it's a book that you can easily find used copies of just cut the binding off or remove the pages with a razor blade, and photograph them flat.
    Some of the ebook-torrent scans use that method. It destroys the book but makes for cleanly readable scans.

  40. where are my mod points when I need them ? by Anne+Honime · · Score: 1

    Hugins certainly can do what's required. But OTOH, it can be painfully slow, so beware. A few seconds of attention when taking the pictures can save hours of struggle with any software later in the post-processing.

  41. How about putting the book under glass? by jafo · · Score: 1

    I would think you could correct for many of these issues by laying down several inches of foam, putting the book in place, and then pressing it down with a sheet of glass. Maybe with one more piece of foam to prop up the short side when you are at the beginning or end of the book. You know, kind of like what a scanner does...

    Of course, you'll still have to deal with lens effects like trapezoidal issues, or skew, but these I imagine are much easier to deal with than curl.

    Personally, I've switched to mostly reading Gutenberg books, there's a lot of good stuff there. I've finished Adventures of Sherlock Holmes, Around The World in 80 Days, and am working on A Study in Scarlet. They are books I can get in the format I want, at 3am when I'm looking for a new book to read before going to bed. I realize you may need books like textbooks that aren't available there, but if you have the option...

    Sean

  42. OpenGL by Ironsides · · Score: 1

    Load your raster files into OpenGL, apply them as a surface, then warp the surface until it appears flat to you. This is nothing more than what Google is doing automatically with lasers.

    --
    Fly me to the moon Let me sing among those stars Let me see what spring is like On jupiter and mars
  43. use a calibration sheet by Tristfardd · · Score: 1

    For any given book it is not that hard to make a thin transparent sheet with lines on it. Lay out the lines in such a way that you can put the sheet on a page and the lines will not overlay the text. Since the sheet will conform to the page, you just need to have software that straightens out the lines and thus straightens out the text. The sheet needn't have a lot of lines, just enough to let the software work. It is easy enough to remove the lines in the final image.

  44. In Russia, book scans you! by Anonymous Coward · · Score: 0

    The best resource I've found for book scanning is (use Google translate). In particular, the ScanKromsator software does pretty much everything, and does it well.

    1. Re:In Russia, book scans you! by Anonymous Coward · · Score: 0

      Sorry, link disappeared - should be http://www.djvu-soft.narod.ru/

  45. Sorry, but... by Anonymous Coward · · Score: 0

    >Is there any home user software that can do a similar job without the
    >need for a laser overlay or other sophisticated (and patented)
    >technology?"

    I generally try to help out instead of criticizing others, but...
    Google almost always uses existing technoloy when it fits the bill. The fact they they spend the time/effort/money to develop their book scanning technology means they had to... so no, there is nothing acceptable on the market, much less for free.

  46. Lens by danahyatt · · Score: 1

    I do the same but use the pages for references. Sometimes I up-load pages to EVERNOTE for searching on words, like indexing. THE ANSWER TO YOUR QUESTION IS THE LENS SHAPE. The lens is concave and gives you a sloping-up contrast on the edges. In a software program one will write a logarithm to flatten the lens. However, if you roll-up the page's sides before taking the picture it will work too.

  47. Great Information! by cjzlducls · · Score: 1

    do these programs have the "FIND" option? like on the MicroSoft programs??? that would be awesome! I really love "FIND" options on the programs;)

  48. PDF v. paper contracts by gd2shoe · · Score: 1

    ... even contracts are scanned and the original destroyed as legal has deemed that a PDF scan of a signed document is as legally binding and secure as the actual paper.

    Wow, you have a dumb legal department.

    It is "as legally binding" only if it can be used to coerce the other party into admitting that they signed the document. A wise but immoral signer could take the opportunity to say they signed something else and that you must have manipulated the file. There's nothing you can do to prove that you didn't. At best you can only show that it is unlikely that you did. I hope they keep the physical paper for any truly important contracts they have.

    --
    I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.
    1. Re:PDF v. paper contracts by Anonymous Coward · · Score: 0

      Wrong. A scan of a signed document is just as legally binding as the original paper.

  49. QT3 by gd2shoe · · Score: 2, Informative

    No need. At a quick glance, Scan Tailor is programmed in QT3 (a superset of C++, used by KDE). This is a multi-platform environment, making it very easy to fix something on all supported platforms at once. If unskew doesn't work well, then that should be addressed in both versions. Fixing the Linux version will fix the Windows version too (unless he's relying on platform specific libraries in addition to QT).

    --
    I won't join Slashcott. OTOH, If Beta goes live, I just won't be back until it's fixed. Sorry Dice.
  50. A diferent way to do this... by dalthaus · · Score: 1

    I don't know of any software that will do this as well as just biting the bullet and scanning it. If you have a scanner, it's a no-brainer. If you have access to an advanced copier, you can scan the pages to a PDF file. Used to do this all the time.

  51. So where was the magic by SuperKendall · · Score: 1

    The link was very interesting, but gave no clue as to how you take the perfect image of a book with a crease down the middle (which is the original starting point of the question), and then get an image with no crease. They demonstrated a result with no crease, that was very convincing - but I didn't see just how they got rid of it. Simple cropping of each side?

    --
    "There is more worth loving than we have strength to love." - Brian Jay Stanley
    1. Re:So where was the magic by theCoder · · Score: 1

      I don't think there was a crease in that set of images. It appeared like it because the left edge of the right image was near the edge of the scanner, and didn't get a lot of data (maybe? I'm not really sure about that). I think this tutorial is more about how to combine two scanned images that are each part of the full image. Notice how control points were dropped that indicate that the images share an overlap. They aren't two separate pages, just a left and right portion of the same page/image.

      As to the "magic" that removed what looks like a book crease, I'm fairly new to Hugin, so I don't really know all the details, but I think the "stitcher" program that puts together all the images doesn't use a "first available" method (like stacking pictures on top of each other), but actually uses the dynamic range properties of each image in an area to determine what to put in the final picture. Thus, because the image "underneath" had better dynamic range than the image "on top", that information contributed to the final product. The preview window (with the artifact) doesn't do that, most likely because it's too computationally difficult to do on the fly.

      That said, I've found that this can make output product appear blurry if the alignment isn't spot on. I don't know if there's a way to disable this compositing feature and go with a simpler first available method.

      As to the original question, Hugin may be very useful because it allows you to straighten out pictures taken at a skew. It probably even allows you to straighten parts of the image in relation to others (like part of the page bending to the inside of the book). The disadvantage is that it doesn't seem very automated, so it may take some time to layout control points and do the processing. An advanced user may be able to figure out how to use the underlying tools (autosift, nona, enblend, etc) to do the work in an automated script, though.

      --
      "Save the whales, feed the hungry, free the mallocs" -- author unknown
    2. Re:So where was the magic by Wodin · · Score: 1

      See also this LWN article: http://lwn.net/Articles/351053/

      He shows how he straightened a picture he took and also mentions the SIFT algorithm which is disabled by default because of software patents. He says:

      Your editor, being a daring sort of person, decided that he wanted to find out just what sort of functionality is being denied to hugin users by the oppressive US software patent regime. As it happens, Fedora users can get around patent-based repression by installing the autopano-sift-C package from the rpmfusion repository and tweaking the program preferences to use the real autopano tool. The difference is striking: with autopano-sift-C installed, the program proceeds immediately from image selection to a preview window; the whole "control points" and "optimization" process just sort of goes away. This package does a great job of finding control points, at least on your editor's sample image set.

      --
      -- Wodin
  52. for books you can moderately mangle by uniquegeek · · Score: 1

    For some cases, this might be suitable:
    i) take the book into a small print shop and have them chop off the spine with a paper guillotine (about $2)
    ii) scan the pages through a high-capacity feeder (or just scan individually at home)
    iii) re-bind the book with a plastic coil (about $2)

    I do steps i) and iii) with all my music books so they lay flat while I'm trying to read them.. Might do it with some textbooks for a course I'm taking next year, too.

  53. What you get out of it by Triv · · Score: 1

    I'm sorry but this question is just ridiculous, or if not actually ridiculous then wasteful.

    Let me see if I understand this correctly: The author has photographed a selection of pages from a book that he wants to have available to read at his leisure. He doesn't want to carry the book around because it's too heavy, so what he wants to do, because the photographs aren't as pretty as scanned copies but scanned copies have artifacts of the scanning process, is to develop a home rig to scan the pages he's interested and run them through some piece of software to straighten them out before loading them onto a pda so that they're available whenever and wherever he wants them.

    Right?

    Here's my issue: This is a problem that's been solved by a library, a photocopier and a pocketful of nickels for 30 years - no matter how efficiently he manages to get his whole basement book scanning setup working (which, let's be honest here, is going to be a mess of trouble and a timesink of not insignificant proportions) the benefit (PDA-stored documents) can't possibly be outweighed by the hours he'll end up spending researching and implementing a system that can't compete with a manila folder with a stack of photocopied pages in it.

    I admire the thought experiment, and I bet there're a whole bunch of nerds here who are going to get all in a tizzy about cameras and lighting and whatnot, but this, and I mean it kindly, is a waste of time.

    And if you think photocopying is too time-consuming, suck it up and bring the book with you - there are worse fates than slinging a backpack over your shoulder.

    1. Re:What you get out of it by o'reor · · Score: 1
      Let me rephrase this in caveman terms:

      Caveman Gaor hears fellow caveman Ourgah has piled buffalo meat on few branches tied together.

      Ourgah asks if there be some invention he calls "weal" to fix under his brancard and carry meat around with no effort.

      Gaor says problem solved long time ago: just pick a big piece of leather, wrap the meat in it, carry it around on your shoulder. Problem solved. Anything else, waste of time.

      There be worse fates than carrying heavy bag on your shoulders.

      Did I miss something ?

      --
      In Soviet Russia, our new overlords are belong to all your base.
  54. There is Prizmo if you are using a Mac by rse_infinity · · Score: 1

    http://www.creaceed.com/prizmo Prizmo straightens perspective, does lens distortion correction, and OCR. Book curvature straightening is a feature we'd like to have not too far away...

  55. Orowan by orowan · · Score: 1

    Google - and libraries - use copiers designed to allow you to photograph a book page without mangling or destroying the book, and with the ability to flatten a page so that the resulting copy is clean. These copiers aren't hugely expensive. If someone can build one on their own, as referred to in an earlier post, that's probably the best solution.

  56. Try PostProcessor by Anonymous Coward · · Score: 0

    I haven't tried the software, but it's recommended by the guys at "make" zine who assembled a DIY book scanner:

    http://diybookscanner.org/forum/viewtopic.php?f=3&t=34

    Alejo

  57. eBook? by BlueF · · Score: 1

    Are we talking books that are not available in eBook format? Wouldn't it be exceptionally easier to buy ebooks?

    I imagine the time required to photograph/scan a book of any length, as well as searching for and tweaking a application to correct these images is significant. Say we're talking a PDA which doesn't support common ebook formats, in that case wouldn't it be easier and much better quality to convert ebooks into a compatible format (even if removing DRM was required). That, or drop $100 on a user eBook. Finally, if money is the concern, where time is not, there's always the download route. Again, unless we're talking books where someone hasn't already done all the work to scan to nice electronic format...

    All that being said and done, best of luck. Sounds like an interesting project for it's own sake.

  58. Bkrpr.org by Anonymous Coward · · Score: 0

    Photograph them with a rig from bkrpr.org. Plans are there to build one yourself on the cheap.

  59. ABBYY FineReader? by grikdog · · Score: 1

    A bit expensive, but an accurate pdf (jpg, etc.) to text OCR. Urban legend has it, this is old Russian software developed by the KGB, and back in the Yeltsin era there was a free loss-leader demo making the rounds for Macintosh. I doubt that's still available, but even at $400 a pop, ABBYY FineReader is pretty good stuff, IMHO.

    --
    ``Tension, apprehension & dissension have begun!'' - Duffy Wyg&, in Alfred Bester's _The Demolished Man_
  60. Just use an '09 Malibu... by Anonymous Coward · · Score: 0

    Surprisingly enough, an '09 Chevy Malibu seems to be able to flatten things you wouldn't even dream they possibly could.

  61. Don't forget the copyright issue by Dj_fishlover · · Score: 0

    If any program or technology makes it too easy to copy parts or entire works of literature, and many people start to do this, the book industry will start organizations that work against this, and try to ban programs, issue campaigns to tell the public what is morally right and in the end, try to limit your (or copyright infringers) access to public libraries.

  62. UGGGHHH by PhreakinPenguin · · Score: 1

    So you took pictures of a book instead of scanning and now you want software to flatten the images like a scanner would? /facepalm

    --


    My sig of choice is Marlboro
    1. Re:UGGGHHH by Anonymous Coward · · Score: 0

      > So you took pictures of a book instead of scanning and now you want software to flatten the images like a scanner would? /facepalm

      Yes, that's what he wants. We know it's possible, so what's wrong with asking how it can be done?
       

  63. Re:And you need a GUI why? by hmar · · Score: 1

    right click the network interface and choose "repair". It won't do all interfaces at once, but it combines release and renew into a single step.

  64. For $deity's sake by mrjb · · Score: 1

    Visit freecycle/freegle and arrange a free scanner instead of using the wrong software to fix problems caused by using the wrong hardware. Seriously. You'll be glad you did. In fact, I have a redundant scanner here, which I'll gladly mail to you at no more than shipping costs.

    --
    Visit http://ringbreak.dnd.utwente.nl/~mrjb/growingbettersoftware to download your free copy of the book
  65. how to photograph a flat page well.... by seekertom · · Score: 1

    Easiest way is just keep the page flat! For many cases, a thin piece of metal, plastic etc clipped behind the page being photoed would do it. Of course it's work, but if there are just a few pages, it's worth the effort. 'sides, it works and it's cheap! In case you aren't home where the metal is, just use another book's cover. thanks for lis'nin' seekertom

  66. scanning,saving,converting files...... by ccrasher · · Score: 1

    The best solution that I have found after long/painful research is to use ABBYY OCR software for OCR and conversions and then finish the file (if needed) with either Photoshop or your preferred Adobe application. ABBYY full version also includes the best screenshot reader I have ever used. The program can scan to ocr, an image file, a pdf, a word or exel document and a few others. If you captured the pages with a camera, you can flatten(optimize) the images in PS, Acrobat or InDesign if it is available for web which can take a huge file of images and compress them for web. This will create a very small file for your PDA. Keep in mind that if your objective is to create a tiny/flattened file for your PDA the pages will not be suitable for printing. For a PDA you can also reduce the files resolution quite a bit since a PDA usually only needs about 320x240 instead of 800x600 min. for a computer monitor. This is a huge tradeoff. A scan withh ABBYY can also correct the problem of page edges, dark spots etc because you choose the exact area to capture. I use a stoneage antique model Epson 2450 photo scanner to ABBYY or PS which works great for just about anything The best part about ABBYY software is that if you can see anything on your monitor, you can capture and convert it to anything you want. Program security cannot disable it.

  67. scanning,saving,converting files... by ccrasher · · Score: 1

    The best solution that I have found after long/painful research is to use ABBYY OCR software for OCR and conversions and then finish the file (if needed) with either Photoshop or your preferred Adobe application. ABBYY full version also includes the best screenshot reader I have ever used. The program can scan to ocr, an image file, a pdf, a word or exel document and a few others. If you captured the pages with a camera, you can flatten(optimize) the images in PS, Acrobat or InDesign if it is available for web which can take a huge file of images and compress them for web. This will create a very small file for your PDA. Keep in mind that if your objective is to create a tiny/flattened file for your PDA the pages will not be suitable for printing. For a PDA you can also reduce the files resolution quite a bit since a PDA usually only needs about 320x240 instead of 800x600 min. for a computer monitor. This is a huge tradeoff. A scan withh ABBYY can also correct the problem of page edges, dark spots etc because you choose the exact area to capture. I use a stoneage antique model Epson 2450 photo scanner to ABBYY or PS which works great for just about anything The best part about ABBYY software is that if you can see anything on your monitor, you can capture and convert it to anything you want. Program security cannot disable