SDCVLGROASApr 4, 2022

Learning Neural Acoustic Fields

MIT
arXiv:2204.00628v2135 citationsh-index: 140
Originality Highly original
AI Analysis

This addresses the gap in spatial auditory modeling for applications in audio-visual learning and scene understanding, representing a novel method for a known bottleneck.

The paper tackles the problem of learning spatial auditory representations by introducing Neural Acoustic Fields (NAFs), an implicit representation that captures sound propagation in physical scenes, enabling rendering of spatial acoustics at arbitrary listener locations and improving visual learning with sparse views.

Our environment is filled with rich and dynamic acoustic information. When we walk into a cathedral, the reverberations as much as appearance inform us of the sanctuary's wide open space. Similarly, as an object moves around us, we expect the sound emitted to also exhibit this movement. While recent advances in learned implicit functions have led to increasingly higher quality representations of the visual world, there have not been commensurate advances in learning spatial auditory representations. To address this gap, we introduce Neural Acoustic Fields (NAFs), an implicit representation that captures how sounds propagate in a physical scene. By modeling acoustic propagation in a scene as a linear time-invariant system, NAFs learn to continuously map all emitter and listener location pairs to a neural impulse response function that can then be applied to arbitrary sounds. We demonstrate that the continuous nature of NAFs enables us to render spatial acoustics for a listener at an arbitrary location, and can predict sound propagation at novel locations. We further show that the representation learned by NAFs can help improve visual learning with sparse views. Finally, we show that a representation informative of scene structure emerges during the learning of NAFs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes