SD AIJan 27

Hyperbolic Additive Margin Softmax with Hierarchical Information for Speaker Verification

arXiv:2601.19709v11 citationsh-index: 4Has Code

Originality Incremental advance

AI Analysis

This work addresses speaker verification by improving hierarchical modeling, representing an incremental advancement over existing Euclidean-based methods.

The paper tackles the problem of modeling hierarchical information in speaker embeddings for speaker verification by proposing Hyperbolic Softmax (H-Softmax) and Hyperbolic Additive Margin Softmax (HAM-Softmax), which achieve average relative EER reductions of 27.84% and 14.23% compared to standard methods.

Speaker embedding learning based on Euclidean space has achieved significant progress, but it is still insufficient in modeling hierarchical information within speaker features. Hyperbolic space, with its negative curvature geometric properties, can efficiently represent hierarchical information within a finite volume, making it more suitable for the feature distribution of speaker embeddings. In this paper, we propose Hyperbolic Softmax (H-Softmax) and Hyperbolic Additive Margin Softmax (HAM-Softmax) based on hyperbolic space. H-Softmax incorporates hierarchical information into speaker embeddings by projecting embeddings and speaker centers into hyperbolic space and computing hyperbolic distances. HAM-Softmax further enhances inter-class separability by introducing margin constraint on this basis. Experimental results show that H-Softmax and HAM-Softmax achieve average relative EER reductions of 27.84% and 14.23% compared with standard Softmax and AM-Softmax, respectively, demonstrating that the proposed methods effectively improve speaker verification performance and at the same time preserve the capability of hierarchical structure modeling. The code will be released at https://github.com/PunkMale/HAM-Softmax.

View on arXiv PDF Code

Similar