SD LG ASDec 5, 2022

Audio Latent Space Cartography

arXiv:2212.02610v22.2h-index: 27

Originality Synthesis-oriented

AI Analysis

This work addresses interpretability challenges for audio latent spaces, primarily benefiting researchers in audio and machine learning, but it appears incremental as it applies existing methods to a new domain.

The paper tackled the problem of visualizing audio latent spaces to improve interpretability by developing an audio-to-image generation pipeline, demonstrating results on the NSynth dataset with a web demo available.

We explore the generation of visualisations of audio latent spaces using an audio-to-image generation pipeline. We believe this can help with the interpretability of audio latent spaces. We demonstrate a variety of results on the NSynth dataset. A web demo is available.

View on arXiv PDF

Similar