AS CL SDApr 16, 2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

Kong Aik Lee, Ville Hautamaki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li

arXiv:1904.07386v17.321 citations

Originality Synthesis-oriented

AI Analysis

This work provides a retrospective analysis for the speaker recognition community, but it is incremental as it focuses on summarizing past results and trends.

The paper summarizes the I4U consortium's submission to the NIST SRE 2018 speaker recognition evaluation, which was among the best-performing systems, and reflects on lessons learned from a decade of participation, highlighting paradigm shifts such as from supervector representation to deep speaker embedding.

The I4U consortium was established to facilitate a joint entry to NIST speaker recognition evaluations (SRE). The latest edition of such joint submission was in SRE 2018, in which the I4U submission was among the best-performing systems. SRE'18 also marks the 10-year anniversary of I4U consortium into NIST SRE series of evaluation. The primary objective of the current paper is to summarize the results and lessons learned based on the twelve sub-systems and their fusion submitted to SRE'18. It is also our intention to present a shared view on the advancements, progresses, and major paradigm shifts that we have witnessed as an SRE participant in the past decade from SRE'08 to SRE'18. In this regard, we have seen, among others, a paradigm shift from supervector representation to deep speaker embedding, and a switch of research challenge from channel compensation to domain adaptation.

View on arXiv PDF

Similar