Book-Digitizing Robots
Makarand writes "Robotic digitization systems are the new help available to complete
voluminous scanning tasks.
Robots that can turn the pages of books and
newspaper volumes and attain scanning speeds of more than 1000 pages/hour
are now available. They even use puffs of compressed air to separate sticky pages!"
Actually, I've seen this robot operate in person and it is a work of art. The way the arms move makes you think its going to rip the book to pieces, yet some how it manages to pick up exactly one page( It detects if its picked up two pages and drops the extra page) and flip it.
I was the lead developer for the software side that actually does the crunching on the images. However, I'm not sure exactly how much I am allowed to talk about it so I wont. Basically, the software side of it does produce PDFs, JPGs and TXT files from the OCR performed on the images.
- Tempestdata
In regards to your vinyl recording idea, couldn't you just hook up a record changer (yes, they do make these; they have a big spindle and an arm) to a DAT or similar digital recording device, and then use some audio software to cut tracks at blank space?
Actually, the primary thing holding up Project Gutenberg is the Sonny Bono Copyright Extension Act. The copyright law was recently extended so that nothing created earlier than the 1920s is going into the public domain.
There is a large body of great 20th century works that will not enter the public domain for many years. Stuff by F. Scott Fitzgerald, Joseph Conrad, Arthur Conan Doyle, Rudyard Kipling, Willa Cather, Wallace Stevens, Yeats, Virginia Woolf, et al.
Its a shame. I actually enjoy reading literature, and I am forced to go to the library for anything newer than 1923.
Not to long ago I had to do a research paper for a college class. No big deal, I've done many of them, and I was not looking forward to this one. Well, I went to the Houston Public Library in Downtown (which I hadn't been to in many many many , you get the idea, years). I got the library card that gave me access to some computer terminals and computer card catalogue. I was amazed about what they had converted electronically and links to other sites that had dictated material. I was also amazed that I could get all this same access from home using the information printed on the library card. So I go home (I have Road Runner cable modem) and do my research instead of being trapped in the library and get to work. I find electronic format of lots and lots of textbooks, magazines, government docs, and many many more. What put me a notch or two down from my high horse was that I even found that they had radio talk shows transcribed (which I used in my research paper) that helped a lot!
There is a lot of information ALREADY converted from text and audio sources at your fingertips that was unfathomable a few years ago. And all of this is free from the website (and links to other sources) from the public library. Talk about your one stop shop.
Using air to separate and move paper is not new. Heidelburg platen presses (you may remember them from high school graphic arts classes) have had this feature for about fifty years.