LGAIJul 27, 2024

CoLiDR: Concept Learning using Aggregated Disentangled Representations

arXiv:2407.19300v13 citationsh-index: 10
Originality Incremental advance
AI Analysis

This work addresses interpretability in AI by bridging generative factors and concepts, offering a flexible method for various data types, though it appears incremental as it builds on existing concept-based and disentanglement research.

The paper tackles the problem of explaining deep neural networks by unifying mathematically disentangled representations with human-understandable concepts, proposing CoLiDR to aggregate generative factors into concepts while maintaining parity with state-of-the-art concept-based approaches across four datasets.

Interpretability of Deep Neural Networks using concept-based models offers a promising way to explain model behavior through human-understandable concepts. A parallel line of research focuses on disentangling the data distribution into its underlying generative factors, in turn explaining the data generation process. While both directions have received extensive attention, little work has been done on explaining concepts in terms of generative factors to unify mathematically disentangled representations and human-understandable concepts as an explanation for downstream tasks. In this paper, we propose a novel method CoLiDR - which utilizes a disentangled representation learning setup for learning mutually independent generative factors and subsequently learns to aggregate the said representations into human-understandable concepts using a novel aggregation/decomposition module. Experiments are conducted on datasets with both known and unknown latent generative factors. Our method successfully aggregates disentangled generative factors into concepts while maintaining parity with state-of-the-art concept-based approaches. Quantitative and visual analysis of the learned aggregation procedure demonstrates the advantages of our work compared to commonly used concept-based models over four challenging datasets. Lastly, our work is generalizable to an arbitrary number of concepts and generative factors - making it flexible enough to be suitable for various types of data.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes