SPSDASIVFeb 16, 2021

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

arXiv:2102.07896v171 citations
Originality Synthesis-oriented
AI Analysis

This dataset addresses a bottleneck for researchers in speech science, linguistics, bio-inspired technology, and clinical applications by providing the first public domain raw RT-MRI data, enabling new methods in image reconstruction and biomarker extraction.

The authors tackled the lack of accessible raw multi-coil real-time MRI data for speech production by creating a comprehensive dataset from 75 subjects, including 2D videos, synchronized audio, raw data, 3D volumetric images, and high-resolution anatomical scans.

Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing linguistically motivated speech tasks, alongside the corresponding first-ever public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each subject.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes