AI Astronomer Aids Effort To Analyze Galaxies
kkleiner writes "Scientists are teaching an artificial intelligence how to classify galaxies imaged by telescopes like the Hubble. Manda Banerji at the University of Cambridge, along with researchers at University College London, Johns Hopkins, and elsewhere, has succeeded in getting the program to agree with human analysis at an impressive rate of more than 90%. Banerji used data from Galaxy Zoo, a massive online project that has used more than 250,000 volunteers to analyze more than 60 million galaxies. The new automated astronomer will help with even larger analytical projects on the horizon, taking care of trivial classifications and leaving the tough cases to humans."
I wonder how this program compares to a human doing the same job If given the same "training" I wonder how many humans would get a 90% agreement rate looking at the same data.
I would think with a name like Al Astronmer your career choices would be limiting. I guess I was right.
If it is making a "guess as to the most likely classification", it sounds like there is a measure of confidence. Perhaps the system is capable of presenting questionable cases to human experts.
Insert self-referential sig here.
The paper discussing this work is http://arxiv.org/abs/0908.2033. They appear to be using a pretty standard neural network approach (disclaimer: I don't have much background in neural nets at all. I'm just going off of how they were described in the last class I took that discussed them.) This is part of a very general pattern where programs have done a lot of work that we would think could only be done by people. Other examples include the computerized proof of the Robbins conjecturehttp://en.wikipedia.org/wiki/Robbins_conjecture. TFA lists a few examples as well which are in more applied areas.
Let's say the system of scientific paradigms and their rise and fall is about finding outliers and surprising results that cannot easily be explained by current models.
If this is the case, then using a statistical system to classify observations has the danger that these "outliers" simply get classified in existing categories and whatever abnormality they represent thereby ignored.
An "AI researcher" must therefore have very explicit programming to set anything with even the slightest degree of abnormality aside for human evaluation. If it's set to classify anything and everything according to preset rules, it's actually mostly destructive to good science.
I wonder if it has any Tauntauns!
Careful What You Wish For....
R. Daneel Olivaw
"Kill 'em all and let Root sort 'em out"
If this technologies works for classifying galaxies, perhaps next we could put it to work classifying porn on the web!
I've abandoned my search for truth; now I'm just looking for some useful delusions.
So Galaxy Zoo doesn't need me anymore? That is the one activity where I was contributing to science to benefit all mankind.
Oh well, I guess I'll go back to trying to beat Mario 64 or something equally pointless....
PRO:
Using neural networks allows for graceful degradation when classifying galaxies by indicating to what degree it believes this galaxy is similar to other galaxies of this type (that it has been trained on). A threshold can be set so that if confidence falls below this threshold, the image is flagged for human intervention.
CONS:
Neural nets are largely black boxes. They use learned statistical relationships to classify images, but they're unable to provide an explanation as to why they made the decision that they did.
I'm surprised they're just now getting around to this. It's a straightforward pattern classification problem, and there is a huge set of training examples to be used for training a neural network or other Learning Classifier System technologies.
Sheesh, evil *and* a jerk. -- Jade
Artificial Agent Aids Astral Analysis
It breaks my pluginses, my precious!
There are a couple of answers to your question. The first is the answer to the more general question, "Why study the universe at all?" and the answer is "Because it's there." We want to understand the processes by which the universe we see around us was formed, what it's like now (to the degree that "now" has any meaning on cosmological scales) and where it's going. It is an awe-inspiring place, and becomes more so the more we learn about it.
The second, with respect to the study of the Milky Way, is that we learn a lot about our galaxy by studying other galaxies. We don't have a good vantage point for studying the Milky Way, for obvious reasons. Hell, it wasn't until quite recently that we even knew what shape it was (barred spiral vs. plain spiral.) With the enormous number of galaxies out there, many of them similar to our own, at a variety of viewing angles from Earth, we can get a much better idea of what's going on in our own neighborhood than we could by restricting our observations to the Milky Way alone.
The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Because the farther away they are, the farther back in time we're looking. By collecting images of galaxies at different stages of evolution (and different types of collisions) cosmologists are able to form a much better picture of how galaxies (and the universe in general) form and evolve.
No folly is more costly than the folly of intolerant idealism. - Winston Churchill
Ah, ok. This makes sense. Thanks!
This makes me very happy on one level and very sad on another.
At the amateur end, the advances in technology have meant that what use to be done by a professional with mind blowingly expensive equipment or what was not at all possible because it hadn't been invented can now be done by a dedicated amateur with a reasonable but largish hobby budget. For the amount of money some spend on recreational vehicles and holiday homes an amateur can now do spectroscopy, deep imaging, even adaptive optics. It's not open to everyone - you need to have good circumstances - a job that both pays well and puts somewhere within driving distance from less light polluted skies. But it can be done..
On the other hand the technology has meant at the professional end what was cutting edge a few decades ago is now obsolete and not an area of interest. What use to be done on an individual basis is being taken over by surveys etc.
What this means is that there are only a handful of ways in which an amateur can contribute real science. Mostly this revolves around tasks that are either considered not important enough to dedicate professional resources to, or areas that aren't easily automated or taken over by sky surveys. Stuff like variable star observing and galaxy zoo. Now those areas are dwindling too as the automation gets better. The amateurs have done a wonderful job especially with variable star observing - with records extending back hundreds of years - this is data that professionals did not have the time to gather themselves nor the technology to gather in bulk....until now. With projects like Pan-STARRS coming online, how long will this be a useful way to contribute? The records will improve but the opportunity to contribute will dwindle.
Also there's the nagging feeling that automation, while good for most things, can't completely replace human curiosity. For the Galaxy Zoo project, I wonder if this method would detect anomalous objects like Hanny's Voorwerp. That was only discovered because a schoolteacher bothered to ask "what the heck is that smudge" instead of simply dismissing it as a photographic error. This led to Galaxy Zoo 2 including a button to report such objects.
So overall I think we'll make great progress - much greater than human only efforts - but I do wonder what discoveries we'll miss.
These posts express my own personal views, not those of my employer
Pictures showing galaxies that are billions of light-years away make nice posters, but it seems totally pointless to put too much effort into these things, when there's so much we don't know about the stuff inside our own galaxy.
But to learn about the stuff inside our galaxy, and how it came to be, we need to see how it looked in the past. ;}
Since you haven't gotten around to making that time machine yet, we can't do it that way
Instead we look at light from galaxies that have been traveling in space for an amount of time equal to how far back in time we want to see, and we discover such things as galaxy formation.
This is ONLY possible to do by looking at distant and thus older galaxies. And it does teach us more about our own.
How do we know what the best use for observation resources is? why are ones that are closer to us any better than the ones far away? How do you determine where the good discoveries will be ? This seems like a politician's approach to science.
That's what I was asking, because on the surface it seemed to me to make more sense studying nearby galaxies only. However, some other helpful responders pointed out that far-away galaxies allow us to see farther back in time (essentially, what we see of the far-away galaxies is how they appeared billions of years now, not how they appear now), and see how galaxies form and collide, and this might lead to insight into how our galaxy came into being.
The difference between my questioning and the politicians is that the politicians, being lawyers, aren't honestly looking for answers to their questions. They already have their minds made up and are trying to twist things around to benefit themselves. Normal questions from laymen like myself, when given appropriate answers, yield more understanding for all laymen.
further on, once we have a good idea about what happened in the universe, we can start checking that against the various sets of "laws of the universe" that we can generate, and decide which are valid (i.e. "is string theory ok, or do we need something else?").
Afterwards, we can try to use the "correct" model of the universe to generate cool stuff (like quantum physics was used to properly describe semiconductors, and we got miniaturized electronics).
I wasn't trying to exagerate. Usually in science, each geek gets excited about a different thing; some years later, a less geeky person tries to do something practical, but is geeky enough to understand what the other geeks were doing, and succeeds in putting together all the information to come up with something useful. Generally, we have to try to finance all the excited geeks, because we can't properly decide which of them will come up with something useful.
new sig
Sorry to burst the bubble, but automatic classification of galaxies from sky survey data using machine learning techniques was accomplished in the early '90s by the SKICAT system developed at JPL and Caltech. http://adsabs.harvard.edu//abs/1995PASP..107.1243W is a good overview of the system and its accomplishments as of 1995.
There's no sense in being precise when you don't even know what you're talking about. -- John von Neumann