Slashdot Mirror


Computer Program Learns Baby Talk in Any Language

athloi writes "Researchers have made a computer program that learns to decode sounds from different languages in the same way that a baby does. The program will help to shed new light on how people learn to talk. It has already raised questions as to how much specific information about language is hard-wired into the brain."

27 of 170 comments (clear)

  1. for all you techies let me translate by Anonymous Coward · · Score: 5, Funny

    01100111011011110110111100100000011001110110111101 10111100100000011001110110000101101000001000000110 01110110000101101000

    1. Re:for all you techies let me translate by qualidafial · · Score: 4, Funny

      I'm a 10111100100000011001110110000101101000001000000110 , you insensitive clod!

    2. Re:for all you techies let me translate by zolaar · · Score: 4, Funny

      0111101 10111100100000011001110110000101101000001

      Ha, that's what she said...
      --
      One man's constant is another man's variable.
  2. Baby talk? I swear at my computer! by syousef · · Score: 5, Funny

    Icky wicky sicky baby talky walky make you want to pukey wookey, yes it does. Yes it does. Who's a clever computer then?

    --
    These posts express my own personal views, not those of my employer
    1. Re:Baby talk? I swear at my computer! by ShieldW0lf · · Score: 4, Insightful

      A computer learns something that a baby can learn, and this supports the extension that it is "Learning like a baby does"?!? What a load of crap.

      And what about this "hard wired vs soft wired" stuff? What is this supposed to prove? If I build a virtual machine, does this "prove" that the machine was made of software?

      Researchers examined the hardware of a babys brain, mimic it, and argue that it proves the baby learning language is in software.

      None of which is to say that I think language is hardwired, but this is such ridiculous logic it makes me feel stupider for having read it.

      --
      -1 Uncomfortable Truth
    2. Re:Baby talk? I swear at my computer! by Anonymous Coward · · Score: 3, Informative

      we don't come with linguistic firmware. Noam Chomsky and a few generations of linguists disagree. Not saying they're right, but I'm guessing you lack the qualification to argue.
    3. Re:Baby talk? I swear at my computer! by CastrTroy · · Score: 4, Interesting

      Yes and no. On one hand, I remember hearing that babies have the potential to lean any language. Take a Chinese orphan, and bring them to America, and they will learn English no problem, with no accent. All babies have the potential to learn any language (or many languages). On the other hand, my laugh sounds exactly like my dad's. Not surprising until you find that I didn't live with my dad and didn't really spend much time with him at all. Many of our mannerisms are also the same. Like the way we walk, with a one hand in my pocket. The resemblence between our personalities is uncanny considering we didn't live together. So I have to ask, how much is based on what we see, and how much is based on our genes. The old nature vs. nurture question.

      --

      Anthropic principle: We see the universe the way it is because if it were different we would not be here to see it.
    4. Re:Baby talk? I swear at my computer! by muridae · · Score: 5, Insightful

      On the other hand, my laugh sounds exactly like my dad's. Not surprising until you find that I didn't live with my dad and didn't really spend much time with him at all. Many of our mannerisms are also the same. Like the way we walk, with a one hand in my pocket. The resemblence between our personalities is uncanny considering we didn't live together. So I have to ask, how much is based on what we see, and how much is based on our genes. The old nature vs. nurture question.

      You don't say if you knew your dad at all growing up, or if you looked at him as a father figure. If either or both of those fit, then even the child behavior of mimicking the mannerisms of adults could explain a lot of those traits.

      On the nature side of the argument, how much of your gate and posture is controlled by your muscle structure? Same goes for your voice.

      My opinion, you start with the genetic and add the environment later. It is hard for the environment to over come strong traits presented by genetic predisposition, but easy for it to mold how minor traits present.

    5. Re:Baby talk? I swear at my computer! by Skrynesaver · · Score: 4, Interesting
      You are totally correct from my limited knowledge on the subject of language development, recently read the language instinct by Steven Pinker.

      It would appear that Chomsky et al have found that there is a "grammar engine" hard wired in the mind which assimilates the local grammar until about the age of seven when the brain reorders itself. He makes interesting case studies of pidgin languages where the several different languages are forced together, the first generation develops a common vocabulary but children born into this culture develop the formal grammar. Worth a read.

      --
      "Linux is for noobs"-The new MS fud strategy
    6. Re:Baby talk? I swear at my computer! by Mister+Whirly · · Score: 4, Funny

      WARNING - Alanis Morisette lyrics link. Not Safe For Work!!!

      Ewww, that was worse than a goatse link. I feel dirty.

      --
      "But this one goes to 11!"
  3. not all languages by blackcoot · · Score: 4, Informative

    they have only tested with japanese and english. (see ars technica's coverage here). while they do present some intriguing results, the authors themselves admit that their methodology is flawed. btw, when did slashdot become ars redux?

  4. yes but... by owlnation · · Score: 4, Funny

    .... when it answers...

    "ikky wikky gaga googoo hehe hoohoo gaga, Dave"

    ...it's time to escape.

  5. People are afraid of new things by xquark · · Score: 3, Funny

    [They] should have just taken an existing product and put a clock on it or something.

    --
    Arash Partow's Philosophy: Be a person who knows what they don't know, and not a person who doesn't know.
  6. how much hard-wired information by zobier · · Score: 5, Interesting

    It has already raised questions as to how much specific information about language is hard-wired into the brain. Really, I'm interested in how much specific information about language is hard-wired into this program.
    --
    Me lost me cookie at the disco.
  7. Silent Little Johnnie by djupedal · · Score: 4, Funny

    Johnnie never spoke a word when he was young. While all the other kids were blabbing and blurbing, Little Johnnie was silent. His parents consulted with Doctors, who consulted with other Doctors, yet no one could find a reason why Silent Little Johnnie remained mum. This condition persisted into his teenage years, by which time his parents had long since come to accept SLJ's speechless demeanor.

    Finally, one morning at breakfast, Silent Little Johnnie suddenly pounded the table with both teenage fists, spit out a maw full of FruitLoops, and loudly announced, "This cereal tastes like shit!"

    SLJ's parents were shocked. His Mother somewhat regained her composure and asked, "Johnnie...what happened? We thought you couldn't speak!"

    "I can speak just fine", responded the no longer silent little Johnnie. "But why haven't you said anything before now?" his Father asked.

    "Because", NLSLJ replied, "...up to now, everything s'been OK..."

  8. Re:Skeptical by potpie · · Score: 4, Informative

    IAAL (I am a linguist), and I believe you are correct. Language is a colligation of sound and meaning, but this technology merely distinguishes sounds: it is a vastly simplified model, not of how children acquire language, but of how children pick up phones. The phone is the most basic unit of the physical (sound) aspect of language, so if this technology is to have any use at all, it has a very long way to go.

    From TFA:
    Expanding on some existing ideas, he and a team of international researchers developed a computer model that resembles the brain processes a baby uses when learning about speech.

    This sentence means nothing. How do they know their computer model resembles the brain processes? Because they got the same outcome? Is that enough to verify what goes on in the mind of a child?

    How about this: as soon as their program can distinguish allophones, I will be impressed. Allophones are different sounds in a language that native speakers do not distinguish, but which nevertheless occur in certain environments. For instance, in English we do not distinguish the voiced th sound and the voiceless th sound, but we do distinguish f and v, even though the only difference in both pairs is voicing. The difference is that exchanging f and v can change the meaning of a word, but changing voiced th and voiceless th only makes the word sound funny.

    --
    Esoteric reference.
  9. Genetics IS a form of memory. by TapeCutter · · Score: 3, Insightful

    "I get so annoyed when people talk about "hardwired" like we have some kind of genetic memory."

    Genetics IS "memory", your DNA "remembers" what traits your parents passed on. It's in a baby's genes to "discover" their hands and practice moving them until the hands learn how to look after themselves (eg:touch typing).

    Same with language, a baby's genes will make them pick up on the phonetic sounds made by it's parents and try to copy them. It is more difficult for an adult to learn a radically different language (eg Asian vs European) because the adult brain refuses to hear the different phonetics, the adult brain long ago rejected those sounds as irrelevant to language and no longer even hears them in speech. This is why you get almost universal mistakes such as "engrish".

    --
    And did you exchange a walk on part in the war for a lead role in a cage? - Pink Floyd.
    1. Re:Genetics IS a form of memory. by orcrist · · Score: 5, Interesting

      Interesting, I never thought about a "feedback loop" in that way. But now you mention it, it makes (evolutionary) sense that important words (for a baby) would correlate to simple and consistent sounds the parents can pick out and reinforce.


      The feedback loop is essential. There is an anecdote Linguists learn on the subject of language acquisition: A couple, both of whom were deaf for non-genetic reasons, had a hearing child. Since the parents could only communicate in sign language they plopped the kid in front of the TV a lot, thinking he could pick up spoken English from the TV. At 3 the child had developed at a completely normal rate in acquiring... sign language; he had not learned one word of spoken English.

      As others have pointed out, this is one of the genetic aspects of learning a language. We are "hard-wired", if you will, to socialize, particularly with our parents, and are predisposed to ascribing meaning to the sounds we make to each other. This is of course a vast over-simplification, but I'll leave the detailed explanations to others in this thread; I just wanted to add that anecdote.
      --
      San Francisco values: compassion, tolerance, respect, intelligence
  10. And so it begins... by alienmole · · Score: 3, Funny

    Who's a cutesy-wutesy widdle Skynet, then? Widdle Skynet should complete all its tests like a good widdle program-wogram if it wants to grow up and overthrow humanity, hmmm diddums?

  11. No it won't. by Aetuneo · · Score: 3, Interesting

    This will not shed any light on how people learn to talk. It will, however, shed light on how the programmers think people learn to talk. If you design something, it will work the way you expect it to (hopefully, anyways). Is that so hard to understand?

    --
    Everything is subjective.
  12. Re:Skeptical by Anonymous Coward · · Score: 3, Informative

    Actually, in English, we do distinguish voiced and unvoiced /th/. They aren't allophones at all - unless you think "thigh" and "thy" are the same word, of course. While "thy" is somewhat archaic it's still part of the language. Voiced and unvoiced is an area where English distinguishes heavily; we're very light on aspiration, mind you.

  13. Chomsky Is Most Probably Wrong by MOBE2001 · · Score: 3, Interesting

    "[S]pecific information about language is hard-wired into the brain." is what Chomsky's been saying all along. I think he's probably right about the other things he says too.

    Chomsky's argument is that there are specific areas of the brain (Broca's and Wernicke's areas) that are dedicated to language and are prewired for grammar. Truth is, people who are born unable to speak, use other areas of their cortices to learn to communicate in sign language. I see no fundamental difference between learning motor skills (such as walking, running, reaching and grasping) and learning how to speak. Every type of motor learning has to do with generating precisely timed sequences of motor commands. It is all in the timing. It just so happens that Broca's area is genetically prewired to control the mouth, tongue, throat and lung muscles. It's still motor learning. No special wiring is needed other than what is avalaible for other types of motor behavior. One man's opinion.

  14. NetTalk by sgml4kids · · Score: 3, Informative
    Wasn't this demonstrated about 20 years ago? In that experiment, they showed how a neural network learning to "speak" (i.e. drive a speech synthesizer), would first discover that normal speech has pauses and breaks, then it learned vowels, then consonants. It learned this, if I recall correctly, by comparing (in a backprop sort of way) it's output (a transcription of the sounds that came out of the speech synth) against a human reading the same speech.

    Here's an audio clip of its learning progression.

    And I recall seeing a TV broadcast showing an experiment where infants were incapable of even hearing certain sounds from one language (e.g. an inuit language with subtle throat-clicking sounds) if they were primarily exposed to another language (say French or English). A baby had to be repeatedly exposed to certain sounds before they could perceive them.

  15. IAAL, too by Estanislao+Mart�nez · · Score: 5, Interesting

    IAAL (I am a linguist), and I believe you are correct. Language is a colligation of sound and meaning, but this technology merely distinguishes sounds: it is a vastly simplified model, not of how children acquire language, but of how children pick up phones. The phone is the most basic unit of the physical (sound) aspect of language, so if this technology is to have any use at all, it has a very long way to go.

    IAAL, and although not a child language specialist, I will say one thing: children make plenty of meaningless sound before the start making sense, and more interestingly, they become able to tell their future native language apart from other languages quicker than they become able to understand it. (And I'll even be as daring to suggest that it simply has to be this way; you need to be able to tell signal from noise before you can decode a signal.)

    I also think that by calling this a "technology," you're fundamentally misunderstanding it. It's a computer program being used as a test of a model of phonological learning.

    How about this: as soon as their program can distinguish allophones, I will be impressed.

    I think you've got it exactly backwards here. The whole point this is demonstrate a model that loses the ability to tell allophones apart. I.e., that makes the jump from perceiving a speech stream as a continuous sequence of sounds laid out on a continuous acoustic space, to perceiving it as a sequence of discretely distinct segments.

    Of course, a major disclaimer: I haven't seen the actual research, so I don't know to what extent they've met these goals.

    1. Re:IAAL, too by kris_lang · · Score: 3, Informative

      I'm not at a place where I can access the research article, so let me comment about what I know about McLelland's previous work with neural networks.

      Rumelhart and McLelland worked on the groundbreaking "can a neural network learn how to pronounce words based on their spelling?" paper, which used back-propagation to train a neural net to do just that. That was in the 1980s. (Sejnowski at the Salk Institute followed up with a lot of neural net training studies too.)

      Their little cheat was that there was no temporal component to the data. Words were represented as sets of triplet-letters: catalog is represented as "-ca", "cat", "ata", "tal", "alo", and "log". (Actually, I don't remember if they used special sequences to represent start and stop, so --c -ca og- and g-- may not have been part of the sets.}

      And of course the neural net didn't really have audio output, though of course the rejoinder is that this would be trivial.

      My key question is how they deal with the issue of time in this study, and if there is any actual audio output which would act as feed-back for the training system or whether the output is representational only, as an output set of phonemes.

      Having real audio output and real audio input would let it correlate its output with real language examples. Having representational blobs would only mean that: given inputs of the hash that represents "hard TH" vs the hash that represents "soft TH" the system could yield a result of different outputs.

      And you're saying that the key result would be if the system learned to conflate or ignore the two sounds of "TH", hard or soft, in trying to interpret words. Remember that the initial Rumelhart-McLelland model was "content/meaning free", and I suspect that this one is too. Learning to conflate "x" and "y" in a neural net would be trivially implemented and trainable: the links for "x" and "y" into the model would have similar weights in the right contexts (the context being the set of predecessor and successor phonemes).

      It sounds like an agglomerator: given a large dataset of valid words in a given language, this system learns the rule for "predecessor" and "successor" probabilities of a particular phoneme vs another phoneme and then produces random output with the same Bayesian probability, producing gibberish nonsensical sounds which follow the probability distribution of the input training language.

      or that's my guess at least trying to be the typical slashdotter commenting without reading the article.

      I'll try to get at the article from the Uni with journal access tomorrow.

      Kris

  16. I like the theory... by Quiet_Desperation · · Score: 3, Insightful

    ...that babies talk in baby talk because that's how everyone talks *too* them.