Learning to reconstruct from saturated data: audio declipping and high-dynamic range imaging

Victor Sechaud, Laurent Jacques, Patrice Abry, Julián Tachella

arXiv:2602.22279v12.6h-index: 11

Originality Highly original

AI Analysis

This addresses the challenge of deploying learning-based methods in real-world applications like audio declipping and high-dynamic range imaging where ground truth is unavailable, offering a practical solution.

The paper tackles the problem of reconstructing audio and images from clipped (saturated) measurements without ground truth data, by extending self-supervised learning to this non-linear inverse problem, and shows it is almost as effective as supervised methods.

Learning based methods are now ubiquitous for solving inverse problems, but their deployment in real-world applications is often hindered by the lack of ground truth references for training. Recent self-supervised learning strategies offer a promising alternative, avoiding the need for ground truth. However, most existing methods are limited to linear inverse problems. This work extends self-supervised learning to the non-linear problem of recovering audio and images from clipped measurements, by assuming that the signal distribution is approximately invariant to changes in amplitude. We provide sufficient conditions for learning to reconstruct from saturated signals alone and a self-supervised loss that can be used to train reconstruction networks. Experiments on both audio and image data show that the proposed approach is almost as effective as fully supervised approaches, despite relying solely on clipped measurements for training.

View on arXiv PDF

Similar