Slashdot Mirror

← Back to Stories (view on slashdot.org)

Kinect's AI Breakthrough Explained

Posted by Soulskill on Saturday March 26, 2011 @09:58AM from the expensive-hacker-toys dept.

mikejuk writes "Microsoft Research has just published a scientific paper (PDF) and a video showing how the Kinect body tracking algorithm works — it's almost as impressive as some of the uses the Kinect has been put to. This article summarizes how Kinect does it. Quoting: '... What the team did next was to train a type of classifier called a decision forest, i.e. a collection of decision trees. Each tree was trained on a set of features on depth images that were pre-labeled with the target body parts. That is, the decision trees were modified until they gave the correct classification for a particular body part across the test set of images. Training just three trees using 1 million test images took about a day using a 1000-core cluster.'"

3 of 97 comments (clear)

Min score:

Reason:

Sort:

Sounds like vision, all right by liquiddark · 2011-03-26 10:11 · Score: 4, Interesting

Layered classification nets have always struck me as the right approach, particularly as we learn more about how human senses work - it seems like a lot of our "thinking" is done much closer to our sense organce than we might have once imagined. Interesting that the less "organic" type, decision trees, were used rather than neural nets. One wonders if maybe it was more a matter of ease of phrasing/training/debugging than of classification itself that decided which type to use.
1. Re:Sounds like vision, all right by Game_Ender · 2011-03-26 10:52 · Score: 4, Interesting
  
  Yep, it's not exactly an AI break through but it's really cool to see a practical application of machine learning in the consumer arena.
2. Re:Sounds like vision, all right by Twinbee · 2011-03-26 14:12 · Score: 4, Interesting
  
  Yes, now all they need to do is fix the lag which can be quite high, maybe even 200ms:
  http://www.youtube.com/watch?v=weZOjotbuSU
  Something really low like 16ms or better is needed so that we don't notice, according to this article:
  http://www.sussex.ac.uk/Users/km3/hfes.pdf
  
  --
  Why OpalCalc is the best Windows calc