AS LG SDJan 26, 2022

Invertible Voice Conversion

arXiv:2201.10687v11.2

Originality Incremental advance

AI Analysis

This work addresses security concerns in voice conversion by enabling traceability, which is incremental as it builds on existing invertible neural network methods.

The authors tackled the problem of making voice conversion systems traceable by developing an invertible deep learning framework called INVVC, which allows converted voices to be reversed back to their source inputs using the same parameters, with experimental results showing impressive performance in voice conversion tasks.

In this paper, we propose an invertible deep learning framework called INVVC for voice conversion. It is designed against the possible threats that inherently come along with voice conversion systems. Specifically, we develop an invertible framework that makes the source identity traceable. The framework is built on a series of invertible $1\times1$ convolutions and flows consisting of affine coupling layers. We apply the proposed framework to one-to-one voice conversion and many-to-one conversion using parallel training data. Experimental results show that this approach yields impressive performance on voice conversion and, moreover, the converted results can be reversed back to the source inputs utilizing the same parameters as in forwarding.

View on arXiv PDF

Similar