SDAILGASSPSep 13, 2024

LMAC-TD: Producing Time Domain Explanations for Audio Classifiers

arXiv:2409.08655v13 citationsh-index: 31
Originality Incremental advance
AI Analysis

This work addresses the need for more interpretable audio classifiers, offering a domain-specific solution that is incremental in nature.

The paper tackles the problem of explaining audio classifier decisions by introducing LMAC-TD, a post-hoc method that trains a decoder to generate explanations directly in the time domain, building on L-MAC and incorporating SepFormer. The result is a significant improvement in audio quality of explanations without sacrificing faithfulness, as demonstrated through a user study.

Neural networks are typically black-boxes that remain opaque with regards to their decision mechanisms. Several works in the literature have proposed post-hoc explanation methods to alleviate this issue. This paper proposes LMAC-TD, a post-hoc explanation method that trains a decoder to produce explanations directly in the time domain. This methodology builds upon the foundation of L-MAC, Listenable Maps for Audio Classifiers, a method that produces faithful and listenable explanations. We incorporate SepFormer, a popular transformer-based time-domain source separation architecture. We show through a user study that LMAC-TD significantly improves the audio quality of the produced explanations while not sacrificing from faithfulness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes