Seeing the Forest For the Trees
swframe writes "A new object recognition system developed at MIT and UCLA looks for rudimentary visual features shared by multiple examples of the same object. Then it looks for combinations of those features shared by multiple examples, and combinations of those combinations, and so on, until it has assembled a model of the object that resembles a line drawing. Popular Science has a summary of the research. I've been working on something similar and I think this accomplishment looks very promising."
David Marr proposed the idea of a primal sketch as the first stage of converting the two-dimensional image on the retina to a full understanding of what is being looked at. This work culminated in a paper published in 1980 called "Theory of edge detection."
Marr was a faculty member at MIT, so it is appropriate for this work to have been done there.
For more information, see:
http://en.wikipedia.org/wiki/David_Marr_(neuroscientist)
and
http://www.amazon.com/Vision-Computational-Investigation-Representation-Information/dp/0716715678
-Todd
Omne ignotum pro magnifico.
I'VE been working on a similar project as well!
Maybe if enough of us with the same project interests get together, we can create an accurate summary of the parent!
You see? We could look at each other's projects for combinations of features shared by multiple examples, and combinations of those combinations, and so on!!??
This is amazing!
Honda just gets on with implementing it. Oh, look, it's even got an automobile analogy: Asimo just did a drive-by on your research.
If you were blocking sigs, you wouldn't have to read this.
I've heard of various other approaches -- to two different things, and I'm not sure which one the researchers are mainly going for. Is the goal here to produce a useful vision system for AI, or to get a better understanding of how the brain works? It seems like while these are compatible goals, it's helpful to distinguish them and decide which you care more about.
Revive the Constitution.
Even neural nets have to be programmed at some level to exhibit behaviour that the programmers think will allow them to learn the task at hand unless these guys used some sort of genetic algorithm. The article doesn't mention it. Does anyone know?
Also it doesn't explain whether the system just recognises similar pictures to what its seen before - eg this picture looks like object type 123 (which to a human would be a horses rump) or whether it can combine all views of an object and recognise them all as that object , eg this picture looks like a horse. If its the latter how does it do it - does it have to be shown the object from a large number of angles or can it just infer from a couple of angles what the object would be like from many others?
You know, two people or groups can arrive at the same conclusion, because it was obvious in the first place. And why is it so appropriate? What if the work had been done elsewhere, would that be inappropriate or offensive?
I should have been more specific in my first post.
David Marr's vision book (published in 1982 after his early death in 1980) is considered a seminal work in understanding human visual processing.
Marr was trying to describe how humans see. The new work at MIT is trying to allow computers to see. David Marr would be glad to see the developments, whether at MIT or elsewhere.
-Todd
Omne ignotum pro magnifico.
Anyone else notice how closely that line drawing of the stag resembled Palaeolithic cave art?
I certainly haven't worked in this area but for years have wondered how people including fairly young children recognize a dachshund, a bulldog and a great dane as dogs and other things as goats, cats, etc. Dogs are amazingly varied in shape and size and color. It seems like a VERY hard problem.
metaphors. i.e. legs, flat surfaces, etc.
The examples of car and a horse for example you could say the wheels of a car and hte legs of a horse are similar because they both provide functionality for movement even though they are differently shaped, they have similar relationships. I'm sure you could do comparing functions by using relationships the same way.
Isn't this essentially the implementation of the ideas described by Jeff Hawkins in his book "On Intelligence"?
Theres no way just from looking at pictures of dogs that you could tell they're all the same species. There are some characterstics that some breads have in common with others (other than the obvious 4 legs etc) but they don't all overlap. With something like this its more than a simple case of pattern recognition - its aquired knowledge.
Okay, program-- this is a quark. Do you see it? Do you recognize it? Great. Get to work, and I'll be back in a few weeks to see how you're doing.
UTF-8: There and Back Again
Hierarchical models of object recognition are decades old, as are attempts to implement them. This work doesn't yet work better than other engineering solutions, and it isn't obviously any more plausible than other approaches. So, it's a nice start, but it isn't a breakthrough.
I was impressed the asimo guessed, "Maybe toy car?" when looking at the hand-sized mini-cooper model.
I did want to see them work on overlapping categories, like: is-a(toy) and is-a(car).
So now we've got an AI that can classify Justin Bieber as a girl too!
I've abandoned my search for truth; now I'm just looking for some useful delusions.
When you try to judge people that provide information instead of flames, it comes off as incredibly pretentious.
According to the article, it performs as well as currnet systems. In other words, very badly.
Which should have been included into TFA from the start:
http://people.csail.mit.edu/leozhu/paper/RCM10cvpr.pdf
The main achievement claimed is that no image labeling or any additional data like viewport position was needed, the learning process was completely automated.
Interestingly enough, very young children will typically generalize and call anything with 4 legs a "doggy" until they are corrected or shown the distinctions. That ability to generalize is one of the things that makes the human mind what it is. So yeah, it's amazing we can take an exemplar and be able to understand that other similar things of varied shapes sizes and colors are related but yet still understand species distinctions. My guess is that we're naturally very good at recognizing 3 dimensional features from many limited viewpoints
The sending of this message pretty much inconveniences everyone involved.
...will it sort my porn collection according to the models?
When Fascism comes to America, it will call itself Anti-Fascism, and tell you to give up your guns.
automated crappy clipart!
Can someone please post a car analogy, or reference it to Soviet Russia?
One obstacle toward progress in this field is how to define an object. Are electrons, atoms, molecules, proteins, cells, leaves, trees, forests and planets all considered to be objects? And who gets to decide - a bunch of undergrad test subjects who draw lines around pictures and give names to each image segment? This algo separates objects and parts, but (from what I can tell, having read the article but not the paper), there's no big reason to say one thing is an object and another thing is a part. Seems pretty arbitrary from a philosophical standpoint, but no doubt its practical value is appreciable.
He once inserted random mutations into his code, just so he could have the experience of debugging.
I wrote a solitaire player, and then I attempted to teach the computer how to play "legal" solitaire by looking at the moves. I intersected every move with every other move, and I could have gone higher (in the intersections), but my poor PC was not capable enough. This is looking at moves like you might construct an inheritance tree. Moving common items to a higher level in tree.
John
...how long it takes before Big Brother decides he can use this to track all of the "troublemakers" in large crowds (everyone is a troublemaker, according to the Gov't).
Apparently the Flamebait mod is now given to people who disagree with the Party...
Disclaimer: The opinions and actions of the US Gov't are in no way representative of those held by this author or its ci
ah ha ha. ...or maybe just to those comments consisting of knowingly over-simplified-to-falsehood, generalizing, argument-inducing statements that don't contribute to the conversation and are about off-topic subjects, particularly "government"?
ah ha ha. ...or maybe just to those comments consisting of knowingly over-simplified-to-falsehood, generalizing, argument-inducing statements that don't contribute to the conversation and are about off-topic subjects, particularly "government"?
It's an extremely valid concern. If I disagree with the Gov't's actions, I can't exactly take my business elsewhere, can I? Sure, I can go find another one, but I can't go without one, or start my own (with the idea of doing it right). A private company can't fine me, put me in jail, nor can it execute me. The government can. That's why I usually direct concerns at governance about really great technology (it is really cool). But I don't want it turned against me. And if the government turns it against me, it's far more likely to be successful.
Disclaimer: The opinions and actions of the US Gov't are in no way representative of those held by this author or its ci
Can I get a mod up? Someone is trying to mod me into oblivion.