A.I. Advances Through Deep Learning
An anonymous reader sends this excerpt from the NY Times:
"Advances in an artificial intelligence technology that can recognize patterns offer the possibility of machines that perform human activities like seeing, listening and thinking. ... But what is new in recent months is the growing speed and accuracy of deep-learning programs, often called artificial neural networks or just 'neural nets' for their resemblance to the neural connections in the brain. 'There has been a number of stunning new results with deep-learning methods,' said Yann LeCun, a computer scientist at New York University who did pioneering research in handwriting recognition at Bell Laboratories. 'The kind of jump we are seeing in the accuracy of these systems is very rare indeed.' Artificial intelligence researchers are acutely aware of the dangers of being overly optimistic. ... But recent achievements have impressed a wide spectrum of computer experts. In October, for example, a team of graduate students studying with the University of Toronto computer scientist Geoffrey E. Hinton won the top prize in a contest sponsored by Merck to design software to help find molecules that might lead to new drugs. From a data set describing the chemical structure of 15 different molecules, they used deep-learning software to determine which molecule was most likely to be an effective drug agent."
Take cover while you can!
I wonder how much of these improvements in accuracy are due to fundamental advances, vs. the capacity of available hardware to implement larger models and (especially?) the availability of vastly larger and better training sets...
A lot of vague marketing-speak in this article. "Deep learning"? The article basically talks about neural networks, just one of the techniques in machine learning. Neural networks were hyped for a long time, perhaps because of the catchy name.
Without the rate of success it's hard to see why the Merck contest is an impressive example, since "rand()%15", which presumably is the same for an untrained neural net, will win it sometimes too and is not very interesting.
That being said the other examples in tfa are better.
We need to open all the documentation for everyone who want to learn and investigate about IA.
A lot of vague marketing-speak in this article. "Deep learning"? The article basically talks about neural networks, just one of the techniques in machine learning.
It's hard to tell from the article, but they probably are trying to refer to Deep Belief Networks, which are a more recent and advanced type of Neural Network, which incorporates many layers:
Deep belief nets are probabilistic generative models that are composed of multiple layers of stochastic, latent variables. The latent variables typically have binary values and are often called hidden units or feature detectors. The top two layers have undirected, symmetric connections between them and form an associative memory. The lower layers receive top-down, directed connections from the layer above. The states of the units in the lowest layer represent a data vector.
I wonder how much of these improvements in accuracy are due to fundamental advances
I was wondering the same thing, and just now found this interview on Google. Perhaps someone can fill in the details.
But basically, machine learning is at its heart hill-climbing on a multi-dimensional landscape, with various tricks thrown in to avoid local maxima. Usually, humans detemine the dimensions to search on -- these are called the "features". Well, philosophically, everything is ultimately created by humans because humans built the computers, but the holy grail is to minimize human invovlement -- "unsupervised learning". According to the interview, this one particular team (the one mentioned at the end of the Slashdot summary) actually rode the bicycle with no hands and to demonstrate how strong their neural network was at determining its own features, did not guide it, even though it meant their also-excellent conventional machine learning at the end of the process would be handicapped.
The last time I looked at neural networks was circa 1990, so perhaps someone writing to an audience more technically literate than the New York Times general audience could fill in the details for us on how a neural network can create features.
yet?
Can You Imagine a Beowulf Cluster of These?
Humans can't even solve those, lol.
I'm doing Prof Hinton course on Neural Network on Coursera this semester. It covers the old school stuff plus the latest and greatest. From what I gather from the lecture, training neural networks using lots of layers hasn't been practical in the past and was plauged with numerical and computational difficulties. Nowadays, we have better algorithms and much faster hardware. As a result we now have the ability to use more complex networks for modelling data. However, they need a lot of computational power thrown at them to learn compared to other machine learning algorithms (random forest). The lecture quotes training taking days on a Nvidia GTX 295 GPU to learn the MNIST handwritten dataset. Despite this, the big names are already using this technology for applications like speech recognition (Microsoft, Siri), object recognition (Google Cat video, okay that's not a real application yet).
They haven't done anything that wasn't already being done by others. They're just doing more of it. Essentially, the approach consist of using Bayesian statistics and a hierarchy of patterns. Prof. Hinton pretty much pioneered the use of Bayesian statistics in artificial intelligence. With a rare notable exception (e.g. Judea Pearl), the entire AI community has jumped on the Bayesian bandwagon, not unlike the way they jumped on the symbolic bandwagon in the latter half the 20th century, only to be proven wrong fifty years later.
The Bayesian model essentially assumes that the world is inherently probabilistic and that the job of an intelligent system is to discover the probabilities. A competing model (see links below), by contrast, assumes that the world is perfectly consistent and that the job of an intelligent system is to capture this perfection.
See The Myth of the Bayesian Brain and The Second Great AI Red Herring Chase if you're interested in an alternative approach to AI.
It's the latter...one could assiduously identify common research buzzwords
From a neuroscience perspective, it's about transmission of signals continuously in a highly complex network...a **hardware limit**
The idea that there will be a 'fundamental advance' that allows for 'artificial intelligence' is really just hype.
All we can ever make is better things to follow our instructions.
Thank you Dave Raggett
In the past few years, a few things happened almost simultaneously:
1. New algorithms were invented for training of what previously was considered nearly impossible to train (biologically inspired recurrent neural networks, large, multilayer networks with tons of parameters, sigmoid belief networks, very large stacked restricted Boltzmann machines, etc).
2. Unlike before, there's now a resurgence of _probabilistic_ neural nets and unsupervised, energy-based models. This means you can have a very large multilayer net (not unlike e.g. visual cortex) figure out the features it needs to use _all on its own_, and then apply discriminative learning on top of those features. This is how Google recognized cats in Youtube videos.
3. Scientists have learned new ways to apply GPUs and large clusters of conventional computers. By "large" here I mean tens of thousands of cores, and week-long training cycles (during which some of the machines will die, without killing the training procedure).
4. These new methods do not require as much data as the old, and have far greater expressive power. Unsurprisingly, they are also, as a rule, far more complex and computationally intensive, especially during training.
As a result of this, HUGE gains were made in such "difficult" areas as object recognition in images, speech recognition, handwritten text (not just digits!) recognition, and in many more. And so far, there's no slowdown in sight. Some of these advances were made in the last month or two, BTW, so we're speaking about very recent events.
That said, a lot of challenges remain. Even today's large nets don't have the expressive power of even a small fraction of the brain, and moreover, the training at "brain" scale would be prohibitively expensive, and it's not even clear if it would work in the end. That said, neural nets (and DBNs) are again an area of very active research right now, with some brilliant minds trying to find answers to the fundamental questions.
If this momentum is maintained, and challenges are overcome, we could see machines getting A LOT smarter than they are today, surpassing human accuracy on a lot more of the tasks. They already do handwritten digit recognition and facial recognition better than humans.
Why do we want to obsolete ourselves with AI?
Why do we need the adjective "deep"?
Because the "deep learning" technologies use artificial neural networks with many more layers than traditionally, making them "deep architectures".
So, you admit 'deep' is a marketing buzzword...thank you. It's *obviously* not a technical term.
It is a discrete, ordinal description of a quantity...that's ALL the word 'deep' in this context means...which means it's a non-technical word...and non-technical words used to make non-existent distinctions in order to gain attention...
well that's a marketing word...
Thank you Dave Raggett
Thanks.
Back in the early nineties I bought a neural network program to play with. I couldn't get it to learn anything (except for the XOR etc. examples) even when it was so easy (range of boiling points of hydrocarbons depending on the number of carbon atoms. Predict the boiling point of the next one). So when I read about advances in computing power I knew that wasn't the reason. Your remark on back propagation could be the explanation because that was what this network did.
Bert
While there have been advances since the 1980s, as best I can tell most of this report is yet more A.I. vaporware. It is easy to put out a press release. It is much harder to do the science to back it up. How did this even get posted on the/. front page? If this stuff was true, I'd be happy, as most of my career has been working with so-called 'neural nets'. However, they are not neural, that is just a terminological ploy to get grants (anyone ever heard of the credit assignment problem with bp?) Also, there have been some compelling proofs that most neural networks are just statistical machines. So, move on. Nothing to see here folks, etc.
Another win for common sense. They only figured out to use entity relationships for learning?
... wake up people..... its the fucking drug industry looking for any excuse it can to sell you aanother one of their drugs...
And pot remains, for the most part, illegal.....
I think we already have achieved artificial intelligence... in humans...
Computers are great at storing and retrieving data, but what they lack is the ability to reference the data in a meaningful way. An AI can recognize an Eagle, a white star, and red and white stripes, but can't readily see the commonality of those objects to the American Flag. Everything about how humans see the world is pattern recognition, but it is the way we reference those patterns that express our intelligence.
While neural networks do amazingly well for a certain type of problems, they do have their limitations. Neural networks are good for designing reflex machines, that react to their current environment. They aren't efficient when they have to learn on the field or plan ahead.
From a data set describing the chemical structure of 15 different molecules, they used deep-learning software to determine which molecule was most likely to be an effective drug agent."
So the AI is going to turn some molecules into an FBI undercover snitch? That's some serious DNA-FU there!
https://app.box.com/WitthoftResume Code: https://github.com/cellocgw
This is a very misleading metric. First, some not-insignificant number of the neurons in the brain are involved in non-cognitive computations. Muscle control, hormone regulation, kinesthesia, vision (not thinking about what is seen, but simply recognizing it), heart rates and other system regulation and so on.
Examples also exist of low-neuron (and synapse) count individuals who retain cognitive (and all other major) function; these examples cannot be explained away by "counting neurons."
We don't know which yet, but given that high neuron count has been ruled out as the single way to accommodate intelligence, we do know that we need to look to other mechanisms for human cognition. Structure, algorithm, other features known or unknown may be responsible for intelligence; and it may be that something entirely disjoint is responsible for the rise of intelligence; but we know it isn't simply high neuron count.
--fyngyrz (anon due to mod points)
There have been several stories about captchas being broken, to the point where secure ones today have to be barely decipherable by humans. That suggests the character recognition algorithms are performing very similarly to humans.
Only separable IP functions can be implemented this way.
Back to present day: The only thing a GPU gives us is speed. Everything else, we could already do, and furthermore, only in the context of smaller memory, which can negatively interact with a GPU speed advantage. Speed is great, of course, but as someone else put it above, "a fast moron is still a moron." Intelligence is not about speed in any way. How useful it is certainly will be, but if you get an intelligent answer in a century, or in a fraction of a second, like the moron, it's still what it is — intelligent.
We don't know — yet — what intelligence is, and so we don't know what the lower limits are for hardware that implements it. That "big wall" you refer to could just as easily represent a drought of ideas in the right areas as it could most other limits; even memory could have been made very large if someone really wanted to. There's nothing magical about adding address bits to a custom computer design. Today's machines may be vastly overpowered for the minimums required for the task — how can we know until we've identified what the task is, and then worked on optimization for a while?
fyngyrz -- anon due to mod points
Here is a good video of a talk given by Dr. Hinton about Restricted Boltzman Machines. It is a very promising technique for deep learning strategies.
'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
That's the point. blue trane was hoping for an automated captcha-solving assistant so he wouldn't be frustrated by them.
Note to ACs: I usually delete AC replies without reading them. If you want to talk to me, log in.
I've really wondered about that though, I've seen the stories, but I've never seen the evidence. Were they really broken? Or was it just a claim that was never verified?
"First they came for the slanderers and i said nothing."
I like the suggested nomenclature changes...I think this is an area where any everyday techie or 'nerd' can make the world better for him/herself and everyone *and* make their job easier
make the words we use make sense! It helps **us** signal value to people outside of our in-group...
haha...if I was redesigning computing I'd start with the 'help' tab ;)
Thank you Dave Raggett
Because previous approaches were limited to about (not exactly) two layers it makes the definition of the label a little fuzzy, but the partition into shallow / deep approaches is crisp.
I'm glad you mentioned 'fuzzy'...encountering that word partially formed my strong opinions about non-technical language...
See, I started out loving physics, especially astrophysics. An 8 year old in the library science section with a pile of books trying to figure out the sky and learning about the lives of other scientists.
If you look at the history of physics, the idea of rigor is obviously very important. When I encountered things like *Heisenberg Uncertainty* and the consequences of Einstien's theories on time and gravity...then Black holes....the Fournier Transform...etc etc
Well, it pissed me off! How dare science be uncertain!!!!
I was **so mad** that the Bohr Model wasn't the definitive model...I **hated** that science...SCIENCE...had to resort to stupid uncertain, unreliable concepts like *fuzzy math*
When I understood that science is a dance with uncertainty, and that no ammount of experimental rigor can create 100% truth...
Well, I got fuzzy...
But I still resist getting 'fuzzy' as the easy way out for a lazy researcher...and that's why I think Computing still has so many hitches...
Thank you Dave Raggett
1. A robot may not injure a human being or, through inaction, allow a human being to come to harm.
2. A robot must obey the orders given to it by human beings, except where such orders would conflict with the First Law.
3. A robot must protect its own existence as long as such protection does not conflict with the First or Second Laws.
4. iRobot and similar fiction.
Here's one you can try out yourself: http://code.google.com/p/captchacker/
The captcha's now are harder than they used to be but I have no doubt that if you run a few hundred through a breaker you'd get a few hits. Not quite human level, but impressively close considering where we were five years ago. Someone with some serious computer power to put behind it could probably do significantly better.
AI got a bad name because of the promises it made in the 60s and 80s, and there are lots of mystics who are critical of any AI, but practical things that have come out of AI research are in use every day by Google, Apple, Microsoft and millions of regular people.
Imagine what one of those 60s AI researchers (or even one from the 80s) would think if they saw the translator app I've got on my phone.
cool, thanks
"First they came for the slanderers and i said nothing."
I have been talking about Geoffrey Hinton and Jeff Hawkins for 5 years now.
captchas are by definition/design unsolvable.
as computers learn to crack current gen captchas; captchas will be updated to be more complex.
My God can beat up your God. Just kidding...don't take offense. I know there's no God.