SDCLMLNov 2, 2016

The Intelligent Voice 2016 Speaker Recognition System

arXiv:1611.00514v1
Originality Synthesis-oriented
AI Analysis

This work addresses speaker recognition for diverse languages with less data, but it is incremental as it builds on existing methods.

The paper tackled the challenge of developing a speaker recognition system robust to novel, heterogeneous languages with limited training data, using an i-vector/PLDA approach, and reported results on the NIST 2016 SRE protocol.

This paper presents the Intelligent Voice (IV) system submitted to the NIST 2016 Speaker Recognition Evaluation (SRE). The primary emphasis of SRE this year was on developing speaker recognition technology which is robust for novel languages that are much more heterogeneous than those used in the current state-of-the-art, using significantly less training data, that does not contain meta-data from those languages. The system is based on the state-of-the-art i-vector/PLDA which is developed on the fixed training condition, and the results are reported on the protocol defined on the development set of the challenge.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes