A.I. Advances Through Deep Learning

← Back to Stories (view on slashdot.org)

A.I. Advances Through Deep Learning

Posted by Soulskill on Saturday November 24, 2012 @04:28PM from the skip-the-lesson-on-killing-all-humans dept.

An anonymous reader sends this excerpt from the NY Times: "Advances in an artificial intelligence technology that can recognize patterns offer the possibility of machines that perform human activities like seeing, listening and thinking. ... But what is new in recent months is the growing speed and accuracy of deep-learning programs, often called artificial neural networks or just 'neural nets' for their resemblance to the neural connections in the brain. 'There has been a number of stunning new results with deep-learning methods,' said Yann LeCun, a computer scientist at New York University who did pioneering research in handwriting recognition at Bell Laboratories. 'The kind of jump we are seeing in the accuracy of these systems is very rare indeed.' Artificial intelligence researchers are acutely aware of the dangers of being overly optimistic. ... But recent achievements have impressed a wide spectrum of computer experts. In October, for example, a team of graduate students studying with the University of Toronto computer scientist Geoffrey E. Hinton won the top prize in a contest sponsored by Merck to design software to help find molecules that might lead to new drugs. From a data set describing the chemical structure of 15 different molecules, they used deep-learning software to determine which molecule was most likely to be an effective drug agent."

46 of 162 comments (clear)

Min score:

Reason:

Sort:

Sources of improvements? by drooling-dog · 2012-11-24 16:44 · Score: 4, Insightful

I wonder how much of these improvements in accuracy are due to fundamental advances, vs. the capacity of available hardware to implement larger models and (especially?) the availability of vastly larger and better training sets...
1. Re:Sources of improvements? by PlusFiveTroll · 2012-11-24 16:49 · Score: 2, Informative
  
  from TFA
  " Modern artificial neural networks are composed of an array of software components, divided into inputs, hidden layers and outputs. The arrays can be “trained” by repeated exposures to recognize patterns like images or sounds.
  These techniques, aided by the growing speed and power of modern computers, have led to rapid improvements in speech recognition, drug discovery and computer vision. "
  Sounds like both.
2. Re:Sources of improvements? by Anonymous Coward · 2012-11-24 17:06 · Score: 2, Insightful
  
  Don't forget that it's not impossible to build a specially designed processor to do a particular task; such as the digital orrery. Such devices created to do nothing but neural net simulations would be more efficient than using a general purpose computer. It would be linked to such to provide a convenient interface but do most of the heavy lifting itself.
3. Re:Sources of improvements? by iggymanz · 2012-11-24 17:27 · Score: 4, Insightful
  
  no, that first sentence pretty much sums up digital neural nets over two decades ago. So more likely the over two orders magnitude processing power per chip improvement since then, with addressable memory over three orders magnitude bigger....
4. Re:Sources of improvements? by Anonymous Coward · 2012-11-24 17:49 · Score: 2, Informative
  
  Glad they were able to make it work so quick, but drug discovery has been done like this for over a decade. I worked at an "Infomesa" startup that was doing this in Santa Fe in 2000.
5. Re:Sources of improvements? by Prof.Phreak · 2012-11-24 17:57 · Score: 5, Informative
  
  The ``new'' (e.g. last decade or so) advances are in training hidden layers of neural networks. Kinda like peeling an onion, each layer getting progressively coarser representation of the problem. e.g. if you have 1000000 inputs, and after a few layers, only have 100 hidden nodes, those 100 nodes are in essence representing all the ``important'' (some benchmark you choose) information of those 1000000 inputs.
  
  --
  "If anything can go wrong, it will." - Murphy
6. Re:Sources of improvements? by PlusFiveTroll · 2012-11-24 18:15 · Score: 3, Insightful
  
  Article didn't say, but if I had to make a guess, this is where I would start.
  http://www.neurdon.com/2010/10/27/biologically-realistic-neural-models-on-gpu/
  "The maximal speedup of GPU implementation over dual CPU implementation was 41-fold for the network size of 15000 neurons."
  This was done on cards 7 years old now. The massive increase of power in GPUs in the past few years along with more features and better programing languages for them means the performance increase could possibly be many hundreds of times faster. An entire cluster of servers gets crunched down in to one card, multiple cards in one server, and build a cluster of those and you can quickly see that amount of computing power available to neural networks is much much larger now. I'm not even sure how to compare the GT6800 to a modern GTX680 because of their huge differences, but the 6800 did 54 FLOPs and the 680 does 3090.4. A 57x increase. CPU's how far back to we have to go where CPUs are 57 times slower. If everything scales the same in the papers calculations it would mean over a 2000x performance increase on a single computer with 1 GPU. In 7 years.
7. Re:Sources of improvements? by Black+Parrot · 2012-11-24 18:30 · Score: 5, Informative
  
  I wonder how much of these improvements in accuracy are due to fundamental advances, vs. the capacity of available hardware to implement larger models and (especially?) the availability of vastly larger and better training sets...
  I'm sure all of that helped, but the key ingredient is training mechanisms. Traditionally networks with multiple layers did not train very well, because the standard training mechanism "backpropagates" an error estimate, and it gets very diffuse as at goes backwards. So most of the training happened in the last layer or two.
  This changed in 2006 with Hinton's invention of the Restricted Boltzman Machine, and someone else's insight that you can train one layer at a time using auto-associative methods.
  "Deep Learning" / "Deep Architectures" has been around since then, so this article doesn't seem like much news. (However, it may be that someone is just now getting the kind of results that they've been expecting for years. Haven't read up on it very much.)
  These methods may be giving ANN a third lease on life. Minsky & Papiert almost killed them off with their book on perceptrons in 1969[*], then Support Vector Machines nearly killed them again in the 1990s.
  They keep coming back from the grave, presumably because of their phenomenal computational power and function-approximation capabilities.[**]
  [*] FWIW, M&P's book shouldn't have done anything, since it was already known that networks of perceptrons don't have the limitations of a single perceptron.
  [**] Siegelmann and Sontag put out a couple of papers, in the 1990s I think, showing that (a) you can construct a Turing Machine with an ANN that uses rational numbers for the weights, and (b) using real numbers (real, not floating-point) would give a trans-Turing capability.
  
  --
  Sheesh, evil *and* a jerk. -- Jade
8. Re:Sources of improvements? by phantomfive · 2012-11-24 18:34 · Score: 2
  
  I think this quote says it all:
  
  Referring to the rapid deep-learning advances made possible by greater computing power, and especially the rise of graphics processors, he added: “The point about this approach is that it scales beautifully. Basically you just need to keep making it bigger and faster, and it will get better. There’s no looking back now.”
  
  I'm sure they've come up with a few incremental advances, but it looks primarily like they've just taken advantage of hardware improvements. You can see from the numbers in the article the results are about what you'd expect from improved hardware (as opposed to actually solving the problem):
  
  [some guy] programmed a cluster of 16,000 computers to train itself to automatically recognize images in a library of 14 million pictures of 20,000 different objects. Although the accuracy rate was low — 15.8 percent — the system did 70 percent better than the most advanced previous one.
  
  --
  "First they came for the slanderers and i said nothing."
9. Re:Sources of improvements? by ShanghaiBill · 2012-11-24 19:03 · Score: 3, Informative
  
  Why build a special processor when ATI and Nvidia already do. Probably at a much lower cost per calculation then a custom machine.
  A GPU can run a neural net much more efficiently than a general purpose CPU, but specialized hardware designed just for NNs could be another order of magnitude more efficient. Of course GPUs are more cost effective because they are mass market items, but if NN applications take off it is likely that everyone will want one running on their cellphone, and then customized NN hardware will be mass market too.
10. Re:Sources of improvements? by phantomfive · 2012-11-24 19:12 · Score: 2
  
  using real numbers (real, not floating-point) would give a trans-Turing capability.
  What on earth is trans-Turing capability?
  
  --
  "First they came for the slanderers and i said nothing."
11. Re:Sources of improvements? by Black+Parrot · 2012-11-24 19:20 · Score: 2
  
  using real numbers (real, not floating-point) would give a trans-Turing capability.
  What on earth is trans-Turing capability?
  Can compute things that a TM can't.
  I think the paper was controversial when it first came out, but I'm not aware that anyone has ever refuted their proof.
  
  --
  Sheesh, evil *and* a jerk. -- Jade
12. Re:Sources of improvements? by timeOday · 2012-11-24 19:26 · Score: 3, Insightful
  
  You can see from the numbers in the article the results are about what you'd expect from improved hardware (as opposed to actually solving the problem)
  "As opposed to actually solving the problem"? You brain has about 86 billion neurons and around 100 trillion synapses. It accounts for 2% of body weight and 20% of energy consumed. Do you think these numbers would be large if they didn't need do be?
  I think the emphasis in computer science on focusing so exclusively on polynomial-time algorithms has really stunted it. Maybe most of the essential tasks for staying alive and reproducing don't happen to have efficient solutions, but the constants of proportionality are small enough to brute-force with several trillion neurons.
13. Re:Sources of improvements? by Tagged_84 · 2012-11-24 19:44 · Score: 4, Informative
  
  IBM recently announced success in simulating 2 billion of their custom designed synaptic cores, 1 trillion synapses apparently. Here's the pdf report
14. Re:Sources of improvements? by PhamNguyen · 2012-11-24 20:06 · Score: 3, Interesting
  
  I work in this area. It is mainly the latter, that is bigger data sets and faster hardware. At first, people thought (based on fairly reasonable technical arguments) that deep networks could not be trained with backpropagation (which is the way gradient descent is implemented on neural networks). Now it turns out that with enough data, they can.
  On the other hand there have been some theoretical advances by Hinton and others where networks can be trained on unsupervised data (e.g. the Google cats thing).
15. Re:Sources of improvements? by HalfFlat · 2012-11-24 20:10 · Score: 2
  
  [...] using real numbers (real, not floating-point) would give a trans-Turing capability.
  Given that almost every real number encodes an uncountable number of bits of information, I guess this isn't especially surprising in retrospect. The result though should make us suspicious of the assumption that the physical constants and properties in our physical theories can indeed take any real number value.
16. Re:Sources of improvements? by smallfries · 2012-11-24 22:02 · Score: 2, Insightful
  
  The problem comes when you try larger inputs. Regardless of constant factors if you are playing with O(2^n) algorithms then n will not increase above about 30. If you start looking at really weird stuff (optimal circuit design and layout) then the core algorithms are O(2^2^n) and then if you are really lucky n will reach 5. Back in the 80s it only went to 4, buts thats Moore's law for you.
  
  --
  Slashdot: where don knuth is an idiot because he cant grasp the awesome power of php
17. Re:Sources of improvements? by Anonymous Coward · 2012-11-24 23:39 · Score: 2, Interesting
  
  The way they are trained is very different, and it's this change that improves the performance. It's more than just making them faster, a fast idiot is still an idiot.
18. Re:Sources of improvements? by maxwell+demon · 2012-11-24 23:58 · Score: 5, Informative
  
  Given that almost every real number encodes an uncountable number of bits of information, I guess this isn't especially surprising in retrospect. The result though should make us suspicious of the assumption that the physical constants and properties in our physical theories can indeed take any real number value.
  The number of bits needed to represent an arbitrary real number exactly is infinite, but not uncountable.
  
  --
  The Tao of math: The numbers you can count are not the real numbers.
19. Re:Sources of improvements? by aaaaaaargh! · 2012-11-25 00:08 · Score: 2
  
  He meant that an ANN with real numbers is a hypercomputer, which is true.
  The problem is that like most conceivable hypercomputers neural networks with real numbers would violate natural laws, e.g. the laws of thermodynamics.
20. Re:Sources of improvements? by HalfFlat · 2012-11-25 00:43 · Score: 2
  
  Indeed you are right.
21. Re:Sources of improvements? by Anonymous Coward · 2012-11-25 01:34 · Score: 3, Informative
  
  A garden snail has about 20,000 neurons, a cat has 1 billion neurons, a human has 86 billion neurons.
  http://www.guardian.co.uk/science/blog/2012/feb/28/how-many-neurons-human-brain
22. Re:Sources of improvements? by timeOday · 2012-11-25 03:16 · Score: 2
  
  When you talk about O() you're talking about the worst case for finding an exact solution. Brains don't find exact solutions to anything.
Deep Belief Networks by Guppy · 2012-11-24 17:04 · Score: 5, Informative

A lot of vague marketing-speak in this article. "Deep learning"? The article basically talks about neural networks, just one of the techniques in machine learning.
It's hard to tell from the article, but they probably are trying to refer to Deep Belief Networks, which are a more recent and advanced type of Neural Network, which incorporates many layers:

Deep belief nets are probabilistic generative models that are composed of multiple layers of stochastic, latent variables. The latent variables typically have binary values and are often called hidden units or feature detectors. The top two layers have undirected, symmetric connections between them and form an associative memory. The lower layers receive top-down, directed connections from the layer above. The states of the units in the lowest layer represent a data vector.
Automatic creation of features by michaelmalak · 2012-11-24 17:06 · Score: 4, Insightful

I wonder how much of these improvements in accuracy are due to fundamental advances
I was wondering the same thing, and just now found this interview on Google. Perhaps someone can fill in the details.
But basically, machine learning is at its heart hill-climbing on a multi-dimensional landscape, with various tricks thrown in to avoid local maxima. Usually, humans detemine the dimensions to search on -- these are called the "features". Well, philosophically, everything is ultimately created by humans because humans built the computers, but the holy grail is to minimize human invovlement -- "unsupervised learning". According to the interview, this one particular team (the one mentioned at the end of the Slashdot summary) actually rode the bicycle with no hands and to demonstrate how strong their neural network was at determining its own features, did not guide it, even though it meant their also-excellent conventional machine learning at the end of the process would be handicapped.
The last time I looked at neural networks was circa 1990, so perhaps someone writing to an audience more technically literate than the New York Times general audience could fill in the details for us on how a neural network can create features.
1. Re:Automatic creation of features by Daniel+Dvorkin · 2012-11-24 18:01 · Score: 3, Insightful
  
  the holy grail is to minimize human invovlement -- "unsupervised learning"
  Unsupervised learning is valuable, but calling it a "holy grail" is going a little too far. Supervised, unsupervised, and semi-supervised learning are all active areas of research.
  
  --
  The correlation between ignorance of statistics and using "correlation is not causation" as an argument is close to 1.
Re:Deep learning? by AthanasiusKircher · 2012-11-24 17:27 · Score: 3, Insightful

A lot of vague marketing-speak in this article. "Deep learning"?
Agreed. Why do we need the adjective "deep"? Perhaps it's because a lot of AI jargon uses "learning" when they really just mean "adaptive" (as in, "programmed to respond to novel stimuli in anticipated ways"), whereas normal human "learning" is much more fluid.

The article basically talks about neural networks
Yet another victory for marketing. These things have been around for at least 25-30 years, and the connection to what little we actually have deciphered about how the brain encodes, decodes, and processes information has always been incredibly tenuous. There always seems to be these AI strands of "cognitive science" or "neural modeling," which are often nothing than just somebody's pet algorithm or black box dressed up with words that make it sound like it has some scientific basis in actual neurophysiology or something.
Don't get me wrong -- I'm sure some of the examples in TFA have made great advances, partly due to speed and hardware unthinkable 25-30 years ago. And some of the functionality of the "neural nets" might give significantly better results than previous models.
But I really wish people would lay off the pretend connections to humanity. Why can't we just accept that a machine might just function better with a better program or algorithm or whatever, rather than saying that "our research in cognitive science [i.e., BS philosophy of the mind] has resulted in neural networks [i.e., a mathematical model instantiated into programming constructs] that exhibit deep learning [i.e., work better than the previous crap]."
(Please note: I mean no insult to anyone who works in neuroscience or AI or whatever. But I do question the jargon that seems to make unfounded connections and assumptions that the brain works anything like many algorithmic "models." We may succeed in creating artificial intelligence by developing our own algorithms or we might succeed by imitating the brain, but I don't think we're making progress by pretending that we're imitating the brain when we're really just using marketing jargon for our pet mathematical algorithm.)
Can their handwriting recognition solve captchas by blue+trane · 2012-11-24 17:27 · Score: 2

yet?
Re:Can their handwriting recognition solve captcha by slashmydots · 2012-11-24 17:56 · Score: 4, Funny

Humans can't even solve those, lol.
Re:Open knowledge by AthanasiusKircher · 2012-11-24 18:06 · Score: 2

We need to open all the documentation for everyone who want to learn and investigate about IA.
Absolutely. It's about time we figured out who really won those caucuses -- and what the heck is up with the ethanol subsidies?
Re:Deep learning? by Prof.Phreak · 2012-11-24 18:15 · Score: 2

Advances are in ways of learning hidden layers that are slightly more clever than backpropagation. For example, lets say you have an image, apply some transform to it (dct, wavelet, whatever, neural net layer, etc.) and save all the important features, but at say 10x less space. Then do the same to those features. Every time reducing the amount of data by 10x. After a few such layers, lets say you're left with 10 bits worth of information---the ``most important'' (according to your benchmark used) ten bits of the whole image.
The ten bits could be anything, such as `this image is a car' or `this image is a face', or ``this face looks angry', etc.
The trick is applying the benchmark on the hidden layers---e.g. how do you pick out which features are important after applying a transform. For that, you train another (inverse) transform that recovers original data from the features---the one that gets you closest to the original wins (e.g. lets say you feed 1000 bits into a neural net to get 100 bits out, and then via inverse transform turn those 100 bits into the *original* 1000 bits... that would mean that your 100 bits represented all the information in the input 1000 bits---obviously more often than not you won't get a perfect match but something close---repeat for any number of layers you want).

--
"If anything can go wrong, it will." - Murphy
Neural Network for Machine Learning on Coursera by Anonymous Coward · 2012-11-24 18:32 · Score: 4, Informative

I'm doing Prof Hinton course on Neural Network on Coursera this semester. It covers the old school stuff plus the latest and greatest. From what I gather from the lecture, training neural networks using lots of layers hasn't been practical in the past and was plauged with numerical and computational difficulties. Nowadays, we have better algorithms and much faster hardware. As a result we now have the ability to use more complex networks for modelling data. However, they need a lot of computational power thrown at them to learn compared to other machine learning algorithms (random forest). The lecture quotes training taking days on a Nvidia GTX 295 GPU to learn the MNIST handwritten dataset. Despite this, the big names are already using this technology for applications like speech recognition (Microsoft, Siri), object recognition (Google Cat video, okay that's not a real application yet).
1. Re:Neural Network for Machine Learning on Coursera by IamTheRealMike · 2012-11-25 00:26 · Score: 2
  
  Actually, Google has already launched neural network based speech recognition. The cat demo was for fun, the underlying technology is already applied to real problems though. I can tell you now based on practical experience as a user that the accuracy boost from it has been amazing. The dictation feature in Android went from being "amusing toy" to "actually useful" almost overnight.
Re:Deep learning? by Black+Parrot · 2012-11-24 18:35 · Score: 3, Informative

Why do we need the adjective "deep"?
Because the "deep learning" technologies use artificial neural networks with many more layers than traditionally, making them "deep architectures".
It's widely accepted that the first hidden layer of an ANN serves as a feature detector (possibly sub-symoblic features that you can't put a name to), and each successive layer serves as a detector for higher-order features. Thus the deep architectures can be expected to have some utility for any problem that depends on feature analysis.

--
Sheesh, evil *and* a jerk. -- Jade
Re:Deep learning? by AthanasiusKircher · 2012-11-24 18:55 · Score: 2

I completely agree that you've justified the use of the adjective "deep" in regard to "deep architectures" (and I got that before writing my post). I still don't get how this "deep" has much to do with "learning," though, in the broader world... and even if we equate the jargony connotations of "machine learning" with "learning," it still seems a stretch to use "deep" as an adjective directly applied to that... but perhaps it's just me.
Just more of the same by qbitslayer · 2012-11-24 19:03 · Score: 2

They haven't done anything that wasn't already being done by others. They're just doing more of it. Essentially, the approach consist of using Bayesian statistics and a hierarchy of patterns. Prof. Hinton pretty much pioneered the use of Bayesian statistics in artificial intelligence. With a rare notable exception (e.g. Judea Pearl), the entire AI community has jumped on the Bayesian bandwagon, not unlike the way they jumped on the symbolic bandwagon in the latter half the 20th century, only to be proven wrong fifty years later.
The Bayesian model essentially assumes that the world is inherently probabilistic and that the job of an intelligent system is to discover the probabilities. A competing model (see links below), by contrast, assumes that the world is perfectly consistent and that the job of an intelligent system is to capture this perfection.
See The Myth of the Bayesian Brain and The Second Great AI Red Herring Chase if you're interested in an alternative approach to AI.
1. Re:Just more of the same by qbitslayer · 2012-11-25 07:24 · Score: 2
  
  Do you have *actual* arguments comparing Bayesian to these hypothetical alternatives
  The argument is simple. As Judea Pearl (an early proponent of Bayesian statistics for AI who has since changed his mind) explained, humans are not probability thinkers; they are cause/effect thinkers. If you drop a ball, you know it's going to hit the ground. You don't think that there is a probability that it might not. If you read the word Bayesian in this sentence, you know for certain that you did. There is nothing probabilistic about it. Sure we handle probabilistic sensory signals but we build a perfect model of the world in our cortical memories. We simply compare incoming sensory inputs to our perfect internal model and decide which patterns in memory best fit the sensory evidence. This truth will be forcefully demonstrated in the not distant future. Wait for it.
  The Bayesian brain is a myth, a rather dumb one in retrospect.
Re:Deep learning? by AthanasiusKircher · 2012-11-24 19:16 · Score: 3, Interesting

It looks like you are seeing something that is not there. The majority of neural network research is about developing new and/or improved algorithms to solve problems, not to say anything about how the human brain works.
As someone who has read a lot of the founding literature of modern cognitive science and the philosophy of mind in the 1950s through 80s, which was hugely influential in setting up the early approaches to AI (including neural nets), I have to say -- this is where the stuff came from.
And frankly, a lot of applications in more obscure disciplines, such as in AI analysis in the humanities, researchers are still making claims about these models and their relationships to the actual brain. Hell, just a few years ago I heard a leading cognitive scientist claim that he found evidence for a sort of musical "circle of fifths" neural network in an actual circular physical structure of neurons in the brain... a made-up musical model grafted onto a made-up AI brain model, supported by noisy data... I admit this is an extreme example, but it's not unique.
I understand that modern researchers in "pure" AI may want to avoid recognizing the history or the implications of the terminology -- but there's a reason why the Starship Voyager was equipped with "neural gel-packs" that could get anxious and cause a warp-core breach at a temporal anomaly... words like "neural" actually mean something, and these "neural nets" have about as much connection to the biological function of actual neurons as Voyager's bizarre "neural gel-packs." Yet the implicit metaphor made in continuing to use the term should not be underestimated, not just in a general audience NYT article, but in the way fields are subtly shaped by their nomenclature.
It's both by Anonymous Coward · 2012-11-24 19:19 · Score: 5, Interesting

In the past few years, a few things happened almost simultaneously:
1. New algorithms were invented for training of what previously was considered nearly impossible to train (biologically inspired recurrent neural networks, large, multilayer networks with tons of parameters, sigmoid belief networks, very large stacked restricted Boltzmann machines, etc).
2. Unlike before, there's now a resurgence of _probabilistic_ neural nets and unsupervised, energy-based models. This means you can have a very large multilayer net (not unlike e.g. visual cortex) figure out the features it needs to use _all on its own_, and then apply discriminative learning on top of those features. This is how Google recognized cats in Youtube videos.
3. Scientists have learned new ways to apply GPUs and large clusters of conventional computers. By "large" here I mean tens of thousands of cores, and week-long training cycles (during which some of the machines will die, without killing the training procedure).
4. These new methods do not require as much data as the old, and have far greater expressive power. Unsurprisingly, they are also, as a rule, far more complex and computationally intensive, especially during training.
As a result of this, HUGE gains were made in such "difficult" areas as object recognition in images, speech recognition, handwritten text (not just digits!) recognition, and in many more. And so far, there's no slowdown in sight. Some of these advances were made in the last month or two, BTW, so we're speaking about very recent events.
That said, a lot of challenges remain. Even today's large nets don't have the expressive power of even a small fraction of the brain, and moreover, the training at "brain" scale would be prohibitively expensive, and it's not even clear if it would work in the end. That said, neural nets (and DBNs) are again an area of very active research right now, with some brilliant minds trying to find answers to the fundamental questions.
If this momentum is maintained, and challenges are overcome, we could see machines getting A LOT smarter than they are today, surpassing human accuracy on a lot more of the tasks. They already do handwritten digit recognition and facial recognition better than humans.
Re:Deep learning? by Black+Parrot · 2012-11-24 19:24 · Score: 2

I completely agree that you've justified the use of the adjective "deep" in regard to "deep architectures" (and I got that before writing my post). I still don't get how this "deep" has much to do with "learning," though, in the broader world... and even if we equate the jargony connotations of "machine learning" with "learning," it still seems a stretch to use "deep" as an adjective directly applied to that... but perhaps it's just me.
I have a bigger issue with "learning" than with "deep", since with very few exceptions ANNs don't learn anything autonomously, but rather are adjusted by an external algorithm to to perform well on a given problem. "Deep training" would make sense for "deep architectures".

--
Sheesh, evil *and* a jerk. -- Jade
Yes, but... no. by Anonymous Coward · 2012-11-25 03:22 · Score: 5, Interesting

This is a very misleading metric. First, some not-insignificant number of the neurons in the brain are involved in non-cognitive computations. Muscle control, hormone regulation, kinesthesia, vision (not thinking about what is seen, but simply recognizing it), heart rates and other system regulation and so on.
Examples also exist of low-neuron (and synapse) count individuals who retain cognitive (and all other major) function; these examples cannot be explained away by "counting neurons."
We don't know which yet, but given that high neuron count has been ruled out as the single way to accommodate intelligence, we do know that we need to look to other mechanisms for human cognition. Structure, algorithm, other features known or unknown may be responsible for intelligence; and it may be that something entirely disjoint is responsible for the rise of intelligence; but we know it isn't simply high neuron count.
--fyngyrz (anon due to mod points)
1. Re:Yes, but... no. by Mr.+Slippery · 2012-11-25 05:01 · Score: 2
  
  Examples also exist of low-neuron (and synapse) count individuals who retain cognitive (and all other major) function; these examples cannot be explained away by "counting neurons."
  The example you cite shows images of a compressed brain. It says nothings about "counting neurons"; the person in question could have roughly the same number of neurons as you or I compressed into a smaller space.
  Also the guy is said to have an IQ of 75. That's "borderline intellectual functioning", and it's incorrect to say that cognitive function has been retained, it's clearly degraded.
  Of course it takes more than a high number of neurons and a high degree of interconnection to perform processing; training the network is vital.
  
  --
  Tom Swiss | the infamous tms | my blog
  You cannot wash away blood with blood
Re:Deep learning? by ceoyoyo · 2012-11-25 03:31 · Score: 2

Actually, it seems your post is the vague one. "normal human "learning" is much more fluid." What does that mean?

Learning: (dictionary.com)
1. knowledge acquired by systematic study in any field of scholarly application.
2. the act or process of acquiring knowledge or skill.
3. Psychology. the modification of behavior through practice, training, or experience.
Many machine learning algorithms "learn" exactly the way you'd teach a child. They see examples, you tell them what the object, word, etc. is, and they remember that answer imperfectly. Repetition improves their accuracy and a breadth of examples improves their generality. After not seeing something for a while, they may forget it.
As the other poster pointed out, "deep" describes algorithms that are better able to teach multi-level systems. The changes associated with learning are better propagated to deeper levels, better utilizing all the capacity of the system.
No, it's not just you. There are a lot of people who see the brain as the last bastion of their identity as some kind of special and privileged creature, therefore it must be magical and any attempts to explain how it works are misguided, childish and silly. Whether that's your actual belief or not, that's what your post sounds like. Modern computational neuroscience has actually come a long way. We're even capable of producing chips that can be implanted and replace some parts of the brain. It's not magic.
Restricted Boltzman Machine by Fnord666 · 2012-11-25 03:38 · Score: 3, Informative

Here is a good video of a talk given by Dr. Hinton about Restricted Boltzman Machines. It is a very promising technique for deep learning strategies.

--
'The tyrant will always find pretext for his tyranny.' - Aesop's Fables
Re:Deep learning? by mbkennel · 2012-11-25 15:04 · Score: 2

"The term "deep" comes from the idea that the algorithm is trying to learn something deeper than previous algorithms. In fact, the usual set of machine learning algorithms are termed shallow learning now. The difference is that deep learning tries to model P(X) whereas shallow learning (SVM, NN, naive Bayes, etc..) try to learn P(X|Y) where X is your input space and Y is the label space. "
Well, more correctly, the deep learning tries to model P(X) as P(X | H) for some set of "hidden" or latent features H which in some ways, is far simpler than the raw data and space of X, and then learns P(Y | H, X) after doing some training for P(X|H).
Re:Deep learning? by mbkennel · 2012-11-25 15:08 · Score: 2

"The majority of neural network research is about developing new and/or improved algorithms to solve problems, not to say anything about how the human brain works."
This isn't true---or the connotation of it isn't true. I don't know how to quantify "majority", but there is substantial interest in computational modeling of actual biology across all levels of biological/chemistry fidelity and attention to engineering and statistical problems.
A glance at the work in the journal _Neural Computation_ shows papers both on entirely theoretical statistical computational models and models much more closely tied to experimental neuroscience results.
Many people want to do both: derive useful methods for solving engineering problems and understand biological systems which have shown such abilities. The field is very difficult and deep.