CVApr 11, 2017

Mining Object Parts from CNNs via Active Question-Answering

arXiv:1704.03173v111 citations
Originality Incremental advance
AI Analysis

This addresses the interpretability problem in deep learning for researchers and practitioners, though it is incremental as it builds on existing CNNs and active-learning techniques.

The paper tackles the problem of interpreting convolutional neural networks (CNNs) by mining object parts using active question-answering, achieving similar or better part-localization performance than fast-RCNN methods with only 1/6-1/3 of the part annotations.

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent a four-layer semantic hierarchy of the part. As an interpretable model, the AOG associates different CNN units with different explicit object parts. We use an active human-computer communication to incrementally grow such an AOG on the pre-trained CNN as follows. We allow the computer to actively identify objects, whose neural patterns cannot be explained by the current AOG. Then, the computer asks human about the unexplained objects, and uses the answers to automatically discover certain CNN patterns corresponding to the missing knowledge. We incrementally grow the AOG to encode new knowledge discovered during the active-learning process. In experiments, our method exhibits high learning efficiency. Our method uses about 1/6-1/3 of the part annotations for training, but achieves similar or better part-localization performance than fast-RCNN methods.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes