Audio Latent Space Cartography
This work addresses interpretability challenges for audio latent spaces, primarily benefiting researchers in audio and machine learning, but it appears incremental as it applies existing methods to a new domain.
The paper tackled the problem of visualizing audio latent spaces to improve interpretability by developing an audio-to-image generation pipeline, demonstrating results on the NSynth dataset with a web demo available.
We explore the generation of visualisations of audio latent spaces using an audio-to-image generation pipeline. We believe this can help with the interpretability of audio latent spaces. We demonstrate a variety of results on the NSynth dataset. A web demo is available.