Slashdot Mirror


Breakthrough In Automatic Handwritten Character Recognition Sans Deep Learning (technologyreview.com)

subh_arya writes: Researchers from NYU, UToronto and MIT have come up with a technique that captures human learning abilities for a large class of simple visual concepts to recognize handwritten characters from World's Alphabet. Their computational model (abstract) represents concepts as simple programs that best explain observed examples under a Bayesian criterion. Unlike recent deep learning approaches that require thousands of examples to train an efficient model, their model can achieve human-level performance with only one example. Additionally, the authors present several "visual Turing tests" probing the model's creative generalization abilities, which in many cases are indistinguishable from human behavior.

10 of 66 comments (clear)

  1. human-level performance by Anonymous Coward · · Score: 2, Funny

    their model can achieve human-level performance with only one example

    Yeah? Well, I've never encountered a human - myself included! - who can read my handwriting, so suck it, you AI mofos!

    1. Re:human-level performance by davester666 · · Score: 2

      Now all doctors will have to go back to school and learn how to write even worse than they do now. I presume it'll be something along the lines of:

      -just scribble for 3-4" on the paper, using a felt marker, making no attempt to actually move in the shape of any known letter
      -wet tip of finger
      -rub the scribble for several seconds

      --
      Sleep your way to a whiter smile...date a dentist!
  2. Timely discover considering nobody writes anymore by JoeyRox · · Score: 5, Funny

    Maybe they'll also invent a better way to untangle corded phone cables.

  3. Great by sexconker · · Score: 3, Insightful

    I'll never solve the new captchas.

    1. Re:Great by locopuyo · · Score: 2

      I just use a browser plugin for it.

  4. Well that's cleverer by Zero__Kelvin · · Score: 3, Insightful
    Seriously? Cleverer? Oodles ? What editor left those in the paper? Slashdot editors must be working for these guys on the side. I know somebody will say cleverer is technically correct, and while that may be true, it is a disaster aesthetically.

    "“The real inflection point in AI is going to come when machines can actually understand language,” he says. “Not just doing mediocre translations, but really understanding what you mean.”"

    Until we understand what it means to understand, how can we possibly know if we have taught these systems to understand? Even if it responds intelligibly, and what it says makes complete sense, is that the same as understanding? I suppose as Billy C. once said: "It depends on what the meaning of the word 'is' is".

    --
    Guns don't kill people; Physics kills people! - John Lithgow as Dick Solomon on Third Rock From The Sun
  5. Improvements to OCR? by pipedwho · · Score: 4, Interesting

    I hope this heralds in some significant improvements to basic OCR. It amazes me that OCR against a printed document still doesn't always yield 100% success. Even worse are OCRs on printed music manuscripts. The recognition and transcription quality is atrocious.

    And yet, these guys can recognise handwriting with incredible accuracy.

    I keenly await when these algorithms can be expanded to general OCR / document recognition. Even if there need to be specific models for each type of document.

    1. Re:Improvements to OCR? by Richard+Kirk · · Score: 3, Interesting

      Suppose you had a bit of your handwriting that you could not read. How do you figure out what you wrote. One thing that I do, and you may do too, is to try and imagine writing the thing, and work out the rhythm of what you are writing. If you can get some sense of how your hand is writing, you may see that what was a 'u', or maybe an 'n' or half of am 'm' makes sense because of the way it joins up to other stuff. We seem to have some sort of kinematic two-and-a-half axis model for writing. We use different muscles if we are writing with a pen (fingers and wrist), a blackboard (wrist and upper arm), a spray-can (upper and lower arm), or a tiny engraving tool (just fingers) and yet our handwriting remains much the same. So some computer that can try and fit the same kinematic model should make better guesses for a word it has not met before than anything that just trained on the shape.

      This does not directly transfer to OCR. If you have a page of fixed-width text, then every letter has its own little rectangle, and you can either recognize that using the traditional OCR model, or you can't. However, there is something we can do along the same lines. Suppose you have a document that you guess was rendered from PostScript. If you have a guess for a particular word, and the font it was rendered in; you could render that part of text. You can then degrade that rendered image to mimic the properties of the printing and scanning, and check the fit. The best solution will probably be the one that achieves the best fit with the shortest, and hence most probable bit of PostScript. When you have more text, you can pick up hints from the spacing, the justification, and other larger page layout structures.

      I actually worked on OCR, and tried both of these once. It might have worked with a large software team, but I hadn't got one.

  6. Oblig link: source code by Anonymous Coward · · Score: 2, Informative

    If only the /. editors would do some minimal investigation... Oh wait, this is still /.

    https://github.com/brendenlake/BPL

  7. Re:Considering nobody writes cursive anymore by richy+freeway · · Score: 5, Funny

    It's cretin, you cretin.