Slashdot Mirror


Automated OCR for Forms Processing?

Oscar Carrillo asks: "We have to do a large NIH grant which collects tons of data. And much of that data is in the form of questionnaires. The forms will be available on the web, but it's mostly not feasible to have the subjects sit in front of the computer all day (not to mention that people get annoyed sitting in front of a computer all day). The study is being conducted at several universities and institutions around the country. Using Linux/JSP/Struts/PostgreSQL will take care of most of our needs. But it would save a lot of data entry, if all forms could be scanned at each site, images uploaded to the website, and then automatically put through OCR (Optical Character Recognition) to get only the relevant raw data that subjects wrote. Does anyone know of something that can handle this? Are there any open source projects that can handle this? Any good commercial alternatives?"

2 of 30 comments (clear)

  1. another lowly subject by tps12 · · Score: 4, Funny

    it's mostly not feasible to have the subjects sit in front of the computer all day

    Then I guess somebody forgot to tell my boss.

    --

    Karma: Good (despite my invention of the Karma: sig)
  2. Fla. by Strange+Ranger · · Score: 3, Funny


    Doesn't the State of Florida has a forms tallying system they're looking to unload?

    --

    Operator, give me the number for 911!