Slashdot Mirror


Google Releases Tesseract as Open Source

An anonymous reader writes "Google recently released Tesseract as open source. Originally developed at the HP Labs from 1985-1995, it has been touted as one of the most accurate Optical Character Recognition (OCR) programs available. Having sat on the shelf gathering dust for so many years, Google cleaned up some of the more outdated portions of the code and released it for general consumption. You can download Tesseract over at Sourceforge.

16 of 251 comments (clear)

  1. improvements by Anonymous Coward · · Score: 5, Funny

    Google cleaned up some of the more outdated portions of the code
    i.e., added AdSense to the OCR output.

    1. Re:improvements by Anonymous Coward · · Score: 1, Funny

      I hope this isn't the same OCR Google Books is using. They managed to mangle one of the most famous chapter titles in literature.

  2. Re:As much as I like open source software ... by illuminatedwax · · Score: 5, Funny

    You're right! Let us never delve into research that could conceivably overturn weak software security! Some things man was never meant to discover! Turn back, before we fly too close to the sun and our wings melt!! O, Prometheus, why hast thou given us this OCR technology??

    --
    Did you ever notice that *nix doesn't even cover Linux?
  3. Finally! by nihilatron · · Score: 3, Funny

    Now I can finally see how to tell the difference between the 'A'-ness of 'A' and the 'P'-ness of 'P'!

    (Credit to S.G.)

  4. I'm sorry Dave... by macadamia_harold · · Score: 4, Funny

    Originally developed at the HP Labs from 1985-1995, it has been touted as one of the most accurate Optical Character Recognition (OCR) programs available.

    Yeah, but how is it on lip-reading? That's when we really need to worry.

  5. Re:Hosting by larry+bagina · · Score: 5, Funny

    Yes. They need the 99.9999% uptime (6 9s) that only sourceforge can provide.

    --
    Do you even lift?

    These aren't the 'roids you're looking for.

  6. Re:Sonny Bono pwned Gutenberg by Anonymous Coward · · Score: 1, Funny
    Once all notable pre-1923 books are scanned, OCR'd, and cleaned up, then what does PG do?

    Maybe they could ... I dunno ... READ the books?
  7. Re:As much as I like open source software ... by binarybum · · Score: 3, Funny

    careful, statements like that are likely to get you voted governor in some states.

    --
    ôó
  8. HP decided to got out of the OCR business? by Frosty+Piss · · Score: 5, Funny
    In 1995 it was one of the top 3 performers at the OCR accuracy contest organized by University of Nevada in Las Vegas. However, shortly thereafter, HP decided to get out of the OCR business...

    Actually, shortly thereafter, HP decided to get out technology innovation business, and into the printer ink business.

    --
    If you want news from today, you have to come back tomorrow.
  9. W0W1 by Anonymous Coward · · Score: 3, Funny

    TH18 IS GRLAT NEWf4 FOR TH0Sj OF US U$1NZ BA) O(R RLCOGN1+ION!

    THAHKS, G00GLL!1!!!

  10. Re:As much as I like open source software ... by ajs · · Score: 2, Funny

    That's no problem! All I really need it to do is allow all of those geeks out there to share those great Playboy articles with me over p2p networks! I'm tired of just getting the filler photography! ;-)

  11. Re:Hosting by Leto-II · · Score: 4, Funny

    I think you need to recalibrate your sarcasm detector.

    --
    Do not anger the worm.
  12. Re:Music OCR by Scaba · · Score: 2, Funny
    I'm sick and tired of a piece of dust being interpreted as a meter change.

    You're just not avant-garde enough.

  13. Re:NFB owns you by maxwell+demon · · Score: 2, Funny

    Of course you can resort to other, harder to calculate questions like: "What is the answer to life, the universe and everything?" Oops, Computers seem to have become much faster since Deep Thought! :-)

    --
    The Tao of math: The numbers you can count are not the real numbers.
  14. Re:Totally OT response to sig. by Anonymous Coward · · Score: 1, Funny

    and don't forget the ADA, the Dyslexics Association of America

  15. Re:NFB owns you by indifferent+children · · Score: 1, Funny
    'm a big fan of asking the user a simple random question, such as "what is 2 + 5".

    I'm tired of all of the anti-Americanism on /. If you want to exclude Americans from your site, go ahead; but don't rub our noses in it.

    --
    Censorship is telling a man he can't have a steak just because a baby can't chew it. --Mark Twain