SDOct 19, 2016

A model of infant speech perception and learning

arXiv:1610.06214v1
Originality Synthesis-oriented
AI Analysis

This work addresses infant speech development modeling, but it appears incremental as it combines existing techniques without major breakthroughs.

The paper tackled the problem of modeling infant speech perception and learning by using Echo State Networks for vowel recognition and Reinforcement Learning for imitation, proposing a method to address speaker normalization through caregiver interaction.

Infant speech perception and learning is modeled using Echo State Network classification and Reinforcement Learning. Ambient speech for the modeled infant learner is created using the speech synthesizer Vocaltractlab. An auditory system is trained to recognize vowel sounds from a series of speakers of different anatomies in Vocaltractlab. Having formed perceptual targets, the infant uses Reinforcement Learning to imitate his ambient speech. A possible way of bridging the problem of speaker normalisation is proposed, using direct imitation but also including a caregiver who listens to the infants sounds and imitates those that sound vowel-like.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes