AIDec 11, 2025

Multi-Granular Node Pruning for Circuit Discovery

arXiv:2512.10903v1h-index: 4
Originality Incremental advance
AI Analysis

This addresses scalability and precision issues for researchers analyzing model interpretability, though it is incremental as it builds on prior pruning techniques.

The paper tackles the computational expense and coarse granularity of existing circuit discovery methods in large language models by proposing a node-level pruning framework that identifies smaller circuits with a 5-10x lower memory footprint while maintaining task performance.

Circuit discovery aims to identify minimal subnetworks that are responsible for specific behaviors in large language models (LLMs). Existing approaches primarily rely on iterative edge pruning, which is computationally expensive and limited to coarse-grained units such as attention heads or MLP blocks, overlooking finer structures like individual neurons. We propose a node-level pruning framework for circuit discovery that addresses both scalability and granularity limitations. Our method introduces learnable masks across multiple levels of granularity, from entire blocks to individual neurons, within a unified optimization objective. Granularity-specific sparsity penalties guide the pruning process, allowing a comprehensive compression in a single fine-tuning run. Empirically, our approach identifies circuits that are smaller in nodes than those discovered by prior methods; moreover, we demonstrate that many neurons deemed important by coarse methods are actually irrelevant, while still maintaining task performance. Furthermore, our method has a significantly lower memory footprint, 5-10x, as it does not require keeping intermediate activations in the memory to work.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes