LGAIApr 8

ConceptTracer: Interactive Analysis of Concept Saliency and Selectivity in Neural Representations

arXiv:2604.0701953.3Has Code
AI Analysis

This provides a practical framework for researchers and practitioners to explore how neural networks encode concept-level information, though it is incremental as it builds on existing interpretability methods.

The paper tackles the opacity of neural network decision-making by introducing ConceptTracer, an interactive tool for analyzing concept saliency and selectivity in neural representations, specifically demonstrating its utility on TabPFN to discover interpretable neurons.

Neural networks deliver impressive predictive performance across a variety of tasks, but they are often opaque in their decision-making processes. Despite a growing interest in mechanistic interpretability, tools for systematically exploring the representations learned by neural networks in general, and tabular foundation models in particular, remain limited. In this work, we introduce ConceptTracer, an interactive application for analyzing neural representations through the lens of human-interpretable concepts. ConceptTracer integrates two information-theoretic measures that quantify concept saliency and selectivity, enabling researchers and practitioners to identify neurons that respond strongly to individual concepts. We demonstrate the utility of ConceptTracer on representations learned by TabPFN and show that our approach facilitates the discovery of interpretable neurons. Together, these capabilities provide a practical framework for investigating how neural networks like TabPFN encode concept-level information. ConceptTracer is available at https://github.com/ml-lab-htw/concept-tracer.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes