ASCVHCLGSDSPJun 9, 2023

HRTF upsampling with a generative adversarial network using a gnomonic equiangular projection

arXiv:2306.05812v240 citationsh-index: 22
Originality Incremental advance
AI Analysis

This work addresses the need for cost-effective HRTF measurement in VR/AR applications, though it is incremental as it builds on existing upsampling techniques with a novel data transformation approach.

The paper tackled the problem of efficiently creating high-resolution individualized head-related transfer functions (HRTFs) for VR/AR by proposing a GAN-based upsampling method, which outperformed three baselines in log-spectral distortion and localization performance when input HRTFs had fewer than 20 measured positions.

An individualised head-related transfer function (HRTF) is very important for creating realistic virtual reality (VR) and augmented reality (AR) environments. However, acoustically measuring high-quality HRTFs requires expensive equipment and an acoustic lab setting. To overcome these limitations and to make this measurement more efficient HRTF upsampling has been exploited in the past where a high-resolution HRTF is created from a low-resolution one. This paper demonstrates how generative adversarial networks (GANs) can be applied to HRTF upsampling. We propose a novel approach that transforms the HRTF data for direct use with a convolutional super-resolution generative adversarial network (SRGAN). This new approach is benchmarked against three baselines: barycentric upsampling, spherical harmonic (SH) upsampling and an HRTF selection approach. Experimental results show that the proposed method outperforms all three baselines in terms of log-spectral distortion (LSD) and localisation performance using perceptual models when the input HRTF is sparse (less than 20 measured positions).

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes