CVAug 5, 2017

Interpreting CNN Knowledge via an Explanatory Graph

arXiv:1708.01785v3256 citations
Originality Incremental advance
AI Analysis

This provides a method for understanding CNN representations, which is useful for researchers in interpretable AI, though it is incremental as it builds on existing part-based analysis.

The paper tackles the problem of interpreting knowledge in pre-trained CNNs by learning an explanatory graph that disentangles object part patterns from filters, and shows that the graph nodes consistently represent the same parts across images, with significant outperformance in part localization tasks.

This paper learns a graphical model, namely an explanatory graph, which reveals the knowledge hierarchy hidden inside a pre-trained CNN. Considering that each filter in a conv-layer of a pre-trained CNN usually represents a mixture of object parts, we propose a simple yet efficient method to automatically disentangles different part patterns from each filter, and construct an explanatory graph. In the explanatory graph, each node represents a part pattern, and each edge encodes co-activation relationships and spatial relationships between patterns. More importantly, we learn the explanatory graph for a pre-trained CNN in an unsupervised manner, i.e., without a need of annotating object parts. Experiments show that each graph node consistently represents the same object part through different images. We transfer part patterns in the explanatory graph to the task of part localization, and our method significantly outperforms other approaches.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes