SDASAug 2, 2018

Statistical Speech Model Description with VMF Mixture Model

arXiv:1808.00960v21 citations
Originality Incremental advance
AI Analysis

This is an incremental improvement in speech coding for compression applications.

The paper tackles the problem of modeling LSF parameters in speech coding by representing them as unit vectors and using a von Mises-Fisher mixture model, resulting in a VMM-based vector quantization method that outperforms DVQ and GVQ in experiments.

In this paper, we present the LSF parameters by a unit vector form, which has directional characteristics. The underlying distribution of this unit vector variable is modeled by a von Mises-Fisher mixture model (VMM). With the high rate theory, the optimal inter-component bit allocation strategy is proposed and the distortion-rate (D-R) relation is derived for the VMM based-VQ (VVQ). Experimental results show that the VVQ outperforms our recently introduced DVQ and the conventional GVQ.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes