Slashdot Mirror


reCAPTCHA Hard At Work, Rescuing Fading Texts

sciencehabit writes "Computer scientists have developed a program, called reCAPTCHA, which is being used in lieu of CAPTCHA by several sites, to help digitize old books and newspapers. The reCAPTCHA takes entries from old and faded texts that optical scanners and digital-text readers have trouble with. So every time you solve that string of crooked letters, you may actually be helping historians digitally reconstruct a page from the 1908 New York Times." The Science Now story links to the longer and more informative article at Ars Technica. (We last mentioned this program last year — and now it's good to get some sense of how well it's working.)

5 of 112 comments (clear)

  1. DMCA Violation by Nymz · · Score: 5, Funny

    The feature known as FADING was designed to protect copyright works from being pirated by becoming illegible before the work could fall into the public domain.

  2. Re:Not new by grahamd0 · · Score: 5, Funny

    Facebook uses reCAPTCHA. I guess you can make something useful out of the millions of useless teenagers wasting their time on Facebook.

    That's not fair.

    Plenty of useless adults waste their time on Facebook.

  3. Re:Cool possible uses by burgundysizzle · · Score: 5, Funny

    Or perhaps SLASHDOT-READER:

    OVERWEIGHT

    GEEK

    SPENDS-TO-MUCH-TIME-USING-COMPUTERS

    ALL-OF-THE-ABOVE

    I fit into the category ALL-OF-THE-ABOVE. The only generalisation that is missing about slashdotters is the one about girlfriends.

  4. Recaptcha doesn't recapture context by Mumei+no+koshinuke · · Score: 5, Interesting

    When solving these I sometimes find that there's more than one possibility for an illegible word, yet I can't tell which it is without knowing the context.
    For example, in some fonts "cost" and "cast" might be indistinguishable in the image shown. But given the context of the sentence it's trivial for a human to tell the difference.
    Suppose that they found these words on which people disagreed and had another captcha system which showed the full sentence. I'd guess they could improve their accuracy significantly in this case. Since they could prescreen for ambiguous words using the current captcha system, even if fewer people were willing to solve the "large" captcha, they would still get all the solutions they needed.

  5. Use to hide your own email addy by RJFerret · · Score: 5, Informative

    You can also use reCaptcha for your own email address, and be more willing to provide it "publicly" since they'd have to answer the reCaptcha to get to the mailto... reCaptcha mailhide