SDAIApr 7, 2025

L3AC: Towards a Lightweight and Lossless Audio Codec

arXiv:2504.04949v25 citationsh-index: 26Has Code
Originality Incremental advance
AI Analysis

This addresses the practicality issue for real-world audio compression applications, though it appears incremental in improving efficiency.

The paper tackles the problem of resource-intensive neural audio codecs by introducing L3AC, a lightweight codec that matches or exceeds the reconstruction quality of leading codecs while reducing computational overhead by an order of magnitude.

Neural audio codecs have recently gained traction for their ability to compress high-fidelity audio and provide discrete tokens for generative modeling. However, leading approaches often rely on resource-intensive models and complex multi-quantizer architectures, limiting their practicality in real-world applications. In this work, we introduce L3AC, a lightweight neural audio codec that addresses these challenges by leveraging a single quantizer and a highly efficient architecture. To enhance reconstruction fidelity while minimizing model complexity, L3AC explores streamlined convolutional networks and local Transformer modules, alongside TConv--a novel structure designed to capture acoustic variations across multiple temporal scales. Despite its compact design, extensive experiments across diverse datasets demonstrate that L3AC matches or exceeds the reconstruction quality of leading codecs while reducing computational overhead by an order of magnitude. The single-quantizer design further enhances its adaptability for downstream tasks. The source code is publicly available at https://github.com/zhai-lw/L3AC.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes