CVMay 19, 2016

Tongue contour extraction from ultrasound images based on deep neural network

arXiv:1605.05912v142 citations
Originality Incremental advance
AI Analysis

This work addresses the challenge of reducing human intervention in ultrasound image labeling for speech research, though it appears incremental as it builds on existing methods.

The authors tackled the problem of automatic tongue contour extraction from ultrasound images for speech analysis by developing a deep autoencoder method, achieving quality scores comparable to state-of-the-art results.

Studying tongue motion during speech using ultrasound is a standard procedure, but automatic ultrasound image labelling remains a challenge, as standard tongue shape extraction methods typically require human intervention. This article presents a method based on deep neural networks to automatically extract tongue contour from ultrasound images on a speech dataset. We use a deep autoencoder trained to learn the relationship between an image and its related contour, so that the model is able to automatically reconstruct contours from the ultrasound image alone. In this paper, we use an automatic labelling algorithm instead of time-consuming hand-labelling during the training process, and estimate the performances of both automatic labelling and contour extraction as compared to hand-labelling. Observed results show quality scores comparable to the state of the art.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes