SDLGASAug 25, 2018

Multiobjective Optimization Training of PLDA for Speaker Verification

arXiv:1808.08344v2
Originality Incremental advance
AI Analysis

This addresses speaker verification systems by improving backend classifier performance, though it appears incremental as it builds on existing PLDA methods.

The paper tackled the problem of PLDA backend classifiers in speaker verification ignoring speaker distinction by proposing a multi-objective optimization training method, resulting in over 10% relative improvement in EER and MinDCF on NIST SRE14 and about 20% in EER on MCE18.

Most current state-of-the-art text-independent speaker verification systems take probabilistic linear discriminant analysis (PLDA) as their backend classifiers. The parameters of PLDA are often estimated by maximizing the objective function, which focuses on increasing the value of log-likelihood function, but ignoring the distinction between speakers. In order to better distinguish speakers, we propose a multi-objective optimization training for PLDA. Experiment results show that the proposed method has more than 10% relative performance improvement in both EER and MinDCF on the NIST SRE14 i-vector challenge dataset, and about 20% relative performance improvement in EER on the MCE18 dataset.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes