ASLGSDJan 26, 2022

Invertible Voice Conversion

arXiv:2201.10687v1
Originality Incremental advance
AI Analysis

This work addresses security concerns in voice conversion by enabling traceability, which is incremental as it builds on existing invertible neural network methods.

The authors tackled the problem of making voice conversion systems traceable by developing an invertible deep learning framework called INVVC, which allows converted voices to be reversed back to their source inputs using the same parameters, with experimental results showing impressive performance in voice conversion tasks.

In this paper, we propose an invertible deep learning framework called INVVC for voice conversion. It is designed against the possible threats that inherently come along with voice conversion systems. Specifically, we develop an invertible framework that makes the source identity traceable. The framework is built on a series of invertible $1\times1$ convolutions and flows consisting of affine coupling layers. We apply the proposed framework to one-to-one voice conversion and many-to-one conversion using parallel training data. Experimental results show that this approach yields impressive performance on voice conversion and, moreover, the converted results can be reversed back to the source inputs utilizing the same parameters as in forwarding.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes