Slashdot Mirror


Toddlers May Learn Language By Data Mining

Ponca City, We Love You writes "Toddlers' brains can effortlessly do what the most powerful computers with the most sophisticated software cannot: learn language simply by hearing it used. A ground-breaking new theory postulates that young children are able to learn large groups of words rapidly by data-mining. Researchers Linda Smith and Chen Yu attempted to teach 28 children, 12 to 14 months old, six words by showing them two objects at a time on a computer monitor while two pre-recorded words were read to them. No information was given regarding which word went with which image. After viewing various combinations of words and images, however, the children were surprisingly successful at figuring out which word went with which picture. Yu and Smith say it's possible that the more words tots hear, and the more information available for any individual word, the better their brains can begin simultaneously ruling out and putting together word-object pairings, thus learning what's what. Yu says if they can identify key factors involved in this form of learning and how it can be manipulated, they might be able to make learning languages easier for children and adults. Understanding children's learning mechanisms could also further machine learning."

21 of 213 comments (clear)

  1. Interesting, but... by gujo-odori · · Score: 5, Informative

    ...I'm not quite sure it's going to change how we think about learning, as they state in TFA. I majored in linguistics, and even way back then, it was well understood by researchers in language acquisition that context played a significant role in both first and second language acquisition, but especially first. A form of data mining may well be part of the mechanics of what was happening in the experiment, but the whole way it was set up, and the way the subjects figured out what word went with what picture, had a lot to do with context. I don't mean to put down their research - this is really quite interesting - but it's also not quite the huge deal TFA seemst o suggest it is.

    1. Re:Interesting, but... by linest · · Score: 5, Interesting

      What they've done is taken the same old thing that wasn't clearly understood and put the label "data mining" on it. Now that it's been labeled, some will feel like we've got a better handle on it than we did before.

    2. Re:Interesting, but... by mrxak · · Score: 4, Interesting

      Yeah, isn't it a lot of almost random trial-and-error, paying attention to non-verbal clues? Like, when a baby smiles, it gets a lot of attention. When a baby manages to put together something simple like ma-ma or da-da, suddenly there's happy parents all jumping up and down with excitement.

    3. Re:Interesting, but... by TapeCutter · · Score: 4, Insightful

      "what seemed far more interesting to me is why when we reach a certain age it becomes significantly harder to acquire languages"

      Someone who is raised with a single language does not even hear certain sounds in other languages because their brain has long since rejected those sounds as irrelevant 'noise'. The same thing applies to vision, a baby sees every meercat face as different but adults don't (without a lot of practice).

      A babies brain actually loses a lot of connectivity between neurons in the first year of life (not so much data minning as connection breaking/forming). In other words we are all programmed by our early environment to exclude irrelevant stimuli, hacking into that 'code' later in life can be extremely difficult.

      --
      And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
    4. Re:Interesting, but... by jp25666 · · Score: 4, Insightful

      That might be part of it, but that's definitely not the whole story. In particular, there are some language errors that babies simply do not make. Likewise, there is a general pattern that all babies follow when acquiring a language. These aspects of acquisition cannot be explained by positive reinforcement alone: they are a result of general cognition or the language faculty, or they are somehow an artifact of the human language learning algorithm.

    5. Re:Interesting, but... by rucs_hack · · Score: 5, Interesting

      When a baby manages to put together something simple like ma-ma or da-da, suddenly there's happy parents all jumping up and down with excitement.

      Actually, the first sound (aside from crying) that a baby is capable of forming is the sound 'ma', and subsequently 'ma-ma'. Unfortunately, all those mothers who believe their child is referring to them are mistaken, although the term rapidly becomes associated with mother anyway, so it gets to be true after a while.

      It should be obvious really, how else would every child ever born (that could vocalise) select the same sound?

      I'm less sure about da-da. I know 'da' is another sound that a child can form earlier, but that's all.

    6. Re:Interesting, but... by gujo-odori · · Score: 5, Interesting

      I give a little more weight to the "ma ma" = "baby is talking to mom" than that. The reason why is my first daughter's first word as an infant was the Vietnamese world for "dad" ( b). She didn't start using the word for "mother" (m ) until much later. Coincidence? Possibly, but she had the tone correct as well, not just the consonant+vowel sound, which is a stronger argument for actual speech rather than coincidence. Additionally, she would say it only when she saw me, not at other times.

    7. Re:Interesting, but... by potpie · · Score: 4, Interesting

      It's actually very interesting how big a role context plays. If a child sees a ball under a table and hears adults referring to it as a ball, the child knows that it is a ball. However, the child cannot be sure exactly what makes it a ball. Does it have to be shiny? Does it have to be round? Does it have to be under a table? Does it have to be in the daytime? Does it have to be a certain color? Does it have to be positioned in a certain way? Does it need to be a certain size? Since the child does not have all this information, overextensions occur. For instance, a child may refer to a dog as a rug because he thinks "rug" means "something furry." Meaningful input is also a huge part of the acquisition mechanism, as you say, but it goes beyond emotional reaction. Actually, children are resistant to correction. If your child keeps making a mistake over and over again, instruction will not help, only time and hearing the correct usage enough. This goes along with the "Active Construction of a Grammar" model.

      --
      Esoteric reference.
  2. Interesting by chuckymonkey · · Score: 5, Insightful

    This is why I've never talked to either of my children in "baby talk". I've always talked to them like they're adults (minus swearing and things like that) and as a result my eldest talks like a six or seven year (she's two) and my youngest... well she's just a few months old but she knows mom and dad. It really is interesting to see the difference between the children that aren't expected to speak and those that are. My eldest has never gotten away with pointing and grunting for things, she always had to at least try to say what she wanted and we'll do the same for the baby when she's around the right age. What kills me though is that the eldest is starting to use sarcasm.... that just blows my mind when she does it. Children's minds are the most amazing things, when people say sponge that doesn't even begin to describe it. Given a lot of patience and a lot of work from the parents children can learn at incredible rates. I only wish that I knew more languages so that I could teach them at this young age so they'd be fluent, I'm really considering taking a job in Europe partially for that reason and cultural exposure for them.

    --
    "Some books contain the machinery required to create and sustain universes."-Tycho
    1. Re:Interesting by elloGov · · Score: 5, Funny

      That's awesome man! I just learned "Da Da" two weeks ago and I am 20 something. I guess the time was just right.

    2. Re:Interesting by kongit · · Score: 5, Funny

      hmm when I somehow come by a kid I will only talk to him in C. Eventually I will start talking in C++ to him. And lastly I will recite perl poetry. After he has mastered these 3 things I will introduce him to shakespeare. hmm maybe I won't be such a good parent, but my kid will be able to hack your kid's computer.

    3. Re:Interesting by Anonymous Coward · · Score: 5, Funny
      I will start talking in C++ to him.

      I'm pretty sure that qualifies as child abuse.

    4. Re:Interesting by xigxag · · Score: 4, Interesting

      I don't know any of the science in this area, but since everywhere I've been in the world, across languages and cultures, parents seem to speak "baby talk" to their kids, I would guess that it has some purpose, evolutionarily speaking. I'm not saying you're doing anything wrong, I'm just saying don't be so sure it is a superior method. Also, I'd venture a guess that at some point, your little ones will more or less know English and that will be that. And eventually other kids will catch up to their level, and maybe surpass them.

      My GF's nephew grew up in a Spanish-speaking household and was basically fluent at 4. But now, at age 13, he seems to have mostly forgotten it in favor of his dominant language, English. Same thing happened with a GF I had when I was much younger. Kids have a tremendous ability to learn things. But also to utterly forget them.

      --
      There are two kinds of people: 1) those who start arrays with one and 1) those who start them with zero.
    5. Re:Interesting by mrogers · · Score: 4, Funny

      "Dada" is easy, it's "surrealism" that trips a lot of kids up.

  3. Effortless? by krazytekn0 · · Score: 5, Insightful

    Have you ever watched a toddler try to talk? Nothing about learning to talk is "effortless" anyone who says so either not a parent, or not thinking clearly.

    --
    Not all life is cyber. Extra Income
  4. no scientific content here by nguy · · Score: 4, Informative

    Data mining is just a new word for discovering statistical associations in data. Of course, children learn words by learning statistical associations between images and speech sounds; that's pretty much a tautology. I mean, what's the alternative? Divine inspiration? Toddlers running around with dictionaries?

  5. Mining but not for data by GodfatherofSoul · · Score: 4, Funny

    My sister had a kid a year ago and the only mining I ever see him doing is in his nose.

    --
    I swear to God...I swear to God! That is NOT how you treat your human!
  6. Taking it a step further. by niktemadur · · Score: 4, Interesting

    Anyone here familiar with the Nicaraguan school for deaf-mute children in the early eighties?

    The first phase of the project was to teach these children the sign-alphabet. After this, I'm not sure if they were going to teach the full english or spanish sign-language (seems there's not an international standard for sign-language), but the point is that after a year, the experiment was deemed a failure and abandoned.

    Then a couple of years later, reports started trickling out of these deaf-mute children exchanging unintelligible gibberish with their hands. A couple of researchers flew in, and were astonished to discover that these kids, using the sign-alphabet as a starting point, had developed a complete, unique language of their own in just two or three years - the first ever documented report of a fully formed, structured language bursting spontaneously into existence. These children are, of course, now adults in their thirties, still in touch with each other and communicating amongst themselves in the language they invented three decades ago.

    And now, for something completely different...

    Terrence McKenna, that lovable old psychonaut, postulated an empirical assumption in the eighties and nineties - language was created over many generations, via deep psilocybin trance rituals, of which the whole tribe partook. One by one, abstract concepts emerged in the back and forth play between members of the tribe, led and refereed of course by the shaman.

    The Nicaraguan kids have poked serious holes into McKenna's whimsical idea. As it turns out, children can develop fully formed languages almost overnight! And so, with concrete data, a new possibility has arisen - languages burst upon the world from the mouths of children, and never mind the psychedelic substances.

    --
    Lil' Thindime, lilting a lacrimose lament, krashes the kwaint konfines of Kokonino Kounty
  7. Multiple languages by Max+Romantschuk · · Score: 5, Interesting

    I'm a parent in a bilingual family. (Finnish & Swedish, two fundamentally different languages.) One of the more interesting things is the way my kids pick up grammar. I speak Swedish to the kids (my first language) and my wife speaks Finnish. The kids (even our younger one and a half year olds) understand both languages more or less perfectly, but they do tend to mess up grammar and sometimes words between them. Every now and then they use the grammar of one language to conjugate a word from the other. It's all pretty interesting.

    But I personally believe that the human brain does a hell of a lot more data mining than we give it credit for. There's a damn good reason why things seem clearer after a good night's sleep. The human brain is designed for massively parallel information processing, and we can't possibly handle it all in a conscious processing context. A lot happens behind the scenes. I'm guessing it's going to be quite some time still until we can fully understand the "inner workings" of the human brain.

    --
    .: Max Romantschuk :: http://max.romantschuk.fi/
  8. Rosetta Stone by KermodeBear · · Score: 5, Interesting

    This is how the Rosetta Stone software works, if anyone was ever curious. Several pictures are shown with a phrase in the foreign language - no translation at all. You have to pick the right one. It goes through permutations of the phrase with different pictures and you eventually learn what each of the words means. It's very effective, much better than the rote memorization that I had to do in school.

    --
    Love sees no species.
  9. Child language acquisition by backformed · · Score: 4, Interesting

    This isn't proof that children acquire language by some sort of data mining process.

    When children start coming up with overregularizations like "goed" instead of "went" or "playses" in place of "plays," that kind of attempt at applying regular morphological rules to irregular items, is when you might say they are acquiring language via data mining. I.e., they hear a form used often enough that it becomes part of their knowledge about words, to the extent that that form is unconsciously applied even to make words they have certainly never heard in adult speech before.

    (Disclaimer:
    1. I will graduate this May with a B.A. in linguistics.
    2. First language acquisition is not wholly understood as of yet, but suffice it to say that it's more complicated and there are many more factors involved than the article makes it seem.
    3. Sorry if I'm misunderstanding what they mean by "data mining.")