CLSDNov 6, 2016

Domain Adaptation For Formant Estimation Using Deep Learning

arXiv:1611.01783v1
Originality Incremental advance
AI Analysis

This work addresses the challenge of making formant estimation robust across various speech domains, which is incremental as it builds on existing deep learning approaches with adaptation layers.

The paper tackles the problem of formant estimation across diverse speakers and speech styles by introducing a domain adaptation technique using a deep network, achieving performance that compares favorably with alternative methods on three datasets with different characteristics.

In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes