SD CL ASMar 10, 2022

Parameter-Free Attentive Scoring for Speaker Verification

Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio Lopez Moreno

arXiv:2203.05642v38.35 citationsh-index: 21

Originality Incremental advance

AI Analysis

This work addresses speaker verification for applications requiring flexible and efficient scoring, though it is incremental as it builds on existing attention mechanisms.

The paper tackles speaker verification by introducing a parameter-free attentive scoring method that eliminates the need for a parametric scoring model, achieving a 10% improvement in average equal error rate over the cosine similarity baseline.

This paper presents a novel study of parameter-free attentive scoring for speaker verification. Parameter-free scoring provides the flexibility of comparing speaker representations without the need of an accompanying parametric scoring model. Inspired by the attention component in Transformer neural networks, we propose a variant of the scaled dot product attention mechanism to compare enrollment and test segment representations. In addition, this work explores the effect on performance of (i) different types of normalization, (ii) independent versus tied query/key estimation, (iii) varying the number of key-value pairs and (iv) pooling multiple enrollment utterance statistics. Experimental results for a 4 task average show that a simple parameter-free attentive scoring mechanism can improve the average EER by 10% over the best cosine similarity baseline.

View on arXiv PDF

Similar