SDLGASFeb 20, 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

arXiv:2302.10248v231 citationsh-index: 188
AI Analysis

This challenge provides a benchmark for advancing speaker recognition technology in real-world scenarios, though it is incremental as part of an ongoing series.

The paper summarizes the VoxCeleb Speaker Recognition Challenge 2022, which evaluated state-of-the-art systems for speaker diarisation and recognition using 'in the wild' speech from YouTube, and reports results from four tracks including baselines and participant methods.

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of this challenge was to evaluate how well state-of-the-art speaker recognition systems can diarise and recognise speakers from speech obtained "in the wild". The challenge consisted of: (i) the provision of publicly available speaker recognition and diarisation data from YouTube videos together with ground truth annotation and standardised evaluation software; and (ii) a public challenge and hybrid workshop held at INTERSPEECH 2022. We describe the four tracks of our challenge along with the baselines, methods, and results. We conclude with a discussion on the new domain-transfer focus of VoxSRC-22, and on the progression of the challenge from the previous three editions.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes