LGNov 18, 2023

Auxiliary Losses for Learning Generalizable Concept-based Models

arXiv:2311.11108v122.648 citationsh-index: 26

Originality Incremental advance

AI Analysis

This addresses the need for transparent neural networks in image classification, though it is incremental as it builds on existing CBM frameworks.

The paper tackles the performance trade-off in Concept Bottleneck Models (CBMs) by proposing coop-CBM with a concept orthogonal loss, achieving higher accuracy across distributional shift settings compared to black-box models.

The increasing use of neural networks in various applications has lead to increasing apprehensions, underscoring the necessity to understand their operations beyond mere final predictions. As a solution to enhance model transparency, Concept Bottleneck Models (CBMs) have gained popularity since their introduction. CBMs essentially limit the latent space of a model to human-understandable high-level concepts. While beneficial, CBMs have been reported to often learn irrelevant concept representations that consecutively damage model performance. To overcome the performance trade-off, we propose cooperative-Concept Bottleneck Model (coop-CBM). The concept representation of our model is particularly meaningful when fine-grained concept labels are absent. Furthermore, we introduce the concept orthogonal loss (COL) to encourage the separation between the concept representations and to reduce the intra-concept distance. This paper presents extensive experiments on real-world datasets for image classification tasks, namely CUB, AwA2, CelebA and TIL. We also study the performance of coop-CBM models under various distributional shift settings. We show that our proposed method achieves higher accuracy in all distributional shift settings even compared to the black-box models with the highest concept accuracy.

View on arXiv PDF

Similar