SDLGASDec 5, 2022

Audio Latent Space Cartography

arXiv:2212.02610v2h-index: 27
Originality Synthesis-oriented
AI Analysis

This work addresses interpretability challenges for audio latent spaces, primarily benefiting researchers in audio and machine learning, but it appears incremental as it applies existing methods to a new domain.

The paper tackled the problem of visualizing audio latent spaces to improve interpretability by developing an audio-to-image generation pipeline, demonstrating results on the NSynth dataset with a web demo available.

We explore the generation of visualisations of audio latent spaces using an audio-to-image generation pipeline. We believe this can help with the interpretability of audio latent spaces. We demonstrate a variety of results on the NSynth dataset. A web demo is available.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes