SD LG ASFeb 20, 2023

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman

arXiv:2302.10248v213.231 citationsh-index: 188Has Code

Originality Synthesis-oriented

AI Analysis

This challenge provides a benchmark for advancing speaker recognition technology in real-world scenarios, though it is incremental as part of an ongoing series.

The paper summarizes the VoxCeleb Speaker Recognition Challenge 2022, which evaluated state-of-the-art systems for speaker diarisation and recognition using 'in the wild' speech from YouTube, and reports results from four tracks including baselines and participant methods.

This paper summarises the findings from the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC-22), which was held in conjunction with INTERSPEECH 2022. The goal of this challenge was to evaluate how well state-of-the-art speaker recognition systems can diarise and recognise speakers from speech obtained "in the wild". The challenge consisted of: (i) the provision of publicly available speaker recognition and diarisation data from YouTube videos together with ground truth annotation and standardised evaluation software; and (ii) a public challenge and hybrid workshop held at INTERSPEECH 2022. We describe the four tracks of our challenge along with the baselines, methods, and results. We conclude with a discussion on the new domain-transfer focus of VoxSRC-22, and on the progression of the challenge from the previous three editions.

View on arXiv PDF Code

Similar