Open Source OCR Packages?
MoonCat asks: "Are there any Open Sourced OCR (Optical Character Recognition) packages available which are comparable to their commercial counterparts in terms of recognition performance? "
← Back to Stories (view on slashdot.org)
there was one project in New Zealand(they even offered a job for that but no one took it ... great shame ... one of those holes in open source apps ... just like VIDEO editing and sound sequencing .
There's Qui-ne-faut. I haven't studied it in detail, but on a trial run it was at least isolating character shapes well (though the actual identification of characters from them left much to be desired - probably there's some training needed).
Are there any "payware" OCR programs for linux either? I have a Linux supported scanner, and would like this functionality, free or not..
Of course, Open Source is prefered, but right now I don't know much about the field, so couldn't hack on it for my self very much anyway.
Blessed are the pessimists, for they have made backups.
Check out the project that reads all of our
USPS mail:
http://www.pnwsoft.com/linux.htm
I imagine John Taves, the consultant behind this
masterpiece, could tell you a thing or two about
OCR on Linux. His email address is at the bottom
of the article.
Having worked for years on OCR, trust me when I
say the success of this project is a minor miracle!
Hi all,
:-)
The department of Computer Science at Waikato University in New Zealand is offering the position. It's a great place to live, they pay well and you get to write GPL'd software. They even pay to fly you there!
If you want more information write to Professor Ian Witten and ask about the GNU OCR job! It's a University so they're not after people with 10's of years experience, they're after hackers that can get stuff done
The National Institute of Science and Technology (NIST) has been developing OCR software for some time now (mainly for reading census forms as I understand it). Anyways, since it's a public agency, the code and data they produce are entirely public domain. (At least in the US, I don't know if there are any issues outside the country) You can actually write them and they will send you a set of CDs with the source, sample data, and printed instructions on compiling and installing it. Last I knew it would build on U*IX, but looked like an awful lot of porting to go anywhere else. Oh yeah, a quick search using "NIST+OCR" turned up the site on yahoo. Good luck.
Granted, it is under Windog, but if the engine is the same, it should work beautifully.
John Gunkel