CVAug 5, 2017

Interpreting CNN Knowledge via an Explanatory Graph

Quanshi Zhang, Ruiming Cao, Feng Shi, Ying Nian Wu, Song-Chun Zhu

arXiv:1708.01785v326.4256 citations

Originality Incremental advance

AI Analysis

This provides a method for understanding CNN representations, which is useful for researchers in interpretable AI, though it is incremental as it builds on existing part-based analysis.

The paper tackles the problem of interpreting knowledge in pre-trained CNNs by learning an explanatory graph that disentangles object part patterns from filters, and shows that the graph nodes consistently represent the same parts across images, with significant outperformance in part localization tasks.

This paper learns a graphical model, namely an explanatory graph, which reveals the knowledge hierarchy hidden inside a pre-trained CNN. Considering that each filter in a conv-layer of a pre-trained CNN usually represents a mixture of object parts, we propose a simple yet efficient method to automatically disentangles different part patterns from each filter, and construct an explanatory graph. In the explanatory graph, each node represents a part pattern, and each edge encodes co-activation relationships and spatial relationships between patterns. More importantly, we learn the explanatory graph for a pre-trained CNN in an unsupervised manner, i.e., without a need of annotating object parts. Experiments show that each graph node consistently represents the same object part through different images. We transfer part patterns in the explanatory graph to the task of part localization, and our method significantly outperforms other approaches.

View on arXiv PDF

Similar