SDSep 24, 2015

An Investigation of Universal Background Sparse Coding Based Speaker Verification on TIMIT

arXiv:1509.07298v4
Originality Synthesis-oriented
AI Analysis

This is an incremental improvement for speaker verification systems, offering an alternative to Gaussian mixture models without Gaussian assumptions.

The paper tackled speaker verification by proposing a universal background sparse coding (UBSC) method, which avoids local minima and Gaussian assumptions, and found it comparable to Gaussian mixture models on the TIMIT corpus.

In this paper, we propose a universal background model, named universal background sparse coding (UBSC), for speaker verification. The proposed method trains an ensemble of clusterings by data resampling, and produces sparse codes from the clusterings by one-nearest-neighbor optimization plus binarization. The main advantage of UBSC is that it does not suffer from local minima and does not make Gaussian assumptions on data distributions. We evaluated UBSC on a clean speech corpus---TIMIT. We used the cosine similarity and inner product similarity as the scoring methods of a trial. Experimental results show that UBSC is comparable to Gaussian mixture model.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes