Maitreya Kocharekar

h-index1
1paper

1 Paper

ASMay 7, 2025
Discrete Optimal Transport and Voice Conversion

Anton Selitskiy, Maitreya Kocharekar

In this work, we address the voice conversion (VC) task using a vector-based interface. To align audio embeddings between speakers, we employ discrete optimal transport mapping. Our evaluation results demonstrate the high quality and effectiveness of this method. Additionally, we show that applying discrete optimal transport as a post-processing step in audio generation can lead to the incorrect classification of synthetic audio as real.