CVIRLGJun 21, 2022

Automatic Concept Extraction for Concept Bottleneck-based Video Classification

arXiv:2206.10129v121 citationsh-index: 68
Originality Incremental advance
AI Analysis

This work addresses the intractability of manually defining concepts for complex video classification tasks, enabling more interpretable deep learning models in this domain.

The authors tackled the problem of concept bottleneck models requiring predefined concepts for complex tasks like video classification by developing CoDEx, an automatic concept discovery and extraction module, which achieved competitive accuracy on new datasets with crowd-sourced explanations.

Recent efforts in interpretable deep learning models have shown that concept-based explanation methods achieve competitive accuracy with standard end-to-end models and enable reasoning and intervention about extracted high-level visual concepts from images, e.g., identifying the wing color and beak length for bird-species classification. However, these concept bottleneck models rely on a necessary and sufficient set of predefined concepts-which is intractable for complex tasks such as video classification. For complex tasks, the labels and the relationship between visual elements span many frames, e.g., identifying a bird flying or catching prey-necessitating concepts with various levels of abstraction. To this end, we present CoDEx, an automatic Concept Discovery and Extraction module that rigorously composes a necessary and sufficient set of concept abstractions for concept-based video classification. CoDEx identifies a rich set of complex concept abstractions from natural language explanations of videos-obviating the need to predefine the amorphous set of concepts. To demonstrate our method's viability, we construct two new public datasets that combine existing complex video classification datasets with short, crowd-sourced natural language explanations for their labels. Our method elicits inherent complex concept abstractions in natural language to generalize concept-bottleneck methods to complex tasks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes