MMSDASMay 3, 2021

Naturalistic audio-visual volumetric sequences dataset of sounding actions for six degree-of-freedom interaction

arXiv:2105.00641v1
Originality Synthesis-oriented
AI Analysis

This dataset addresses the need for test material for audio-visual systems in immersive and interactive applications, but it is incremental as it builds on existing volumetric data efforts.

The researchers tackled the lack of naturalistic audio-visual volumetric datasets for immersive systems by creating a dataset of forty short action sequences with integrated audio and video, covering diverse sound types and features.

As audio-visual systems increasingly bring immersive and interactive capabilities into our work and leisure activities, so the need for naturalistic test material grows. New volumetric datasets have captured high-quality 3D video, but accompanying audio is often neglected, making it hard to test an integrated bimodal experience. Designed to cover diverse sound types and features, the presented volumetric dataset was constructed from audio and video studio recordings of scenes to yield forty short action sequences. Potential uses in technical and scientific tests are discussed.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes