CLNov 13, 2018

Corpus Phonetics Tutorial

arXiv:1811.05553v110 citations
Originality Synthesis-oriented
AI Analysis

It serves as a practical guide for researchers in linguistics and speech technology, but is incremental as it compiles existing methods without new findings.

This tutorial introduces speech scientists and engineers to various automatic speech processing tools for corpus phonetics, including acoustic model creation and forced alignment using multiple toolkits, with step-by-step instructions and tips provided.

Corpus phonetics has become an increasingly popular method of research in linguistic analysis. With advances in speech technology and computational power, large scale processing of speech data has become a viable technique. This tutorial introduces the speech scientist and engineer to various automatic speech processing tools. These include acoustic model creation and forced alignment using the Kaldi Automatic Speech Recognition Toolkit (Povey et al., 2011), forced alignment using FAVE-align (Rosenfelder et al., 2014), the Montreal Forced Aligner (McAuliffe et al., 2017), and the Penn Phonetics Lab Forced Aligner (Yuan & Liberman, 2008), as well as stop consonant burst alignment using AutoVOT (Keshet et al., 2014). The tutorial provides a general overview of each program, step-by-step instructions for running the program, as well as several tips and tricks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes