AICLCVLGNEOct 18, 2023

From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks

arXiv:2310.11884v219 citationsh-index: 7
Originality Synthesis-oriented
AI Analysis

This is an incremental survey that reviews existing approaches without introducing new methods or results.

The paper surveys recent methods for explaining concepts in neural networks, aiming to bridge learning and reasoning by identifying and integrating concepts for neuro-symbolic AI.

In this paper, we review recent approaches for explaining concepts in neural networks. Concepts can act as a natural link between learning and reasoning: once the concepts are identified that a neural learning system uses, one can integrate those concepts with a reasoning system for inference or use a reasoning system to act upon them to improve or enhance the learning system. On the other hand, knowledge can not only be extracted from neural networks but concept knowledge can also be inserted into neural network architectures. Since integrating learning and reasoning is at the core of neuro-symbolic AI, the insights gained from this survey can serve as an important step towards realizing neuro-symbolic AI based on explainable concepts.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes