LG CLMar 17

Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models

arXiv:2603.164409.6h-index: 1

Predicted impact top 82% in LG · last 90 daysOriginality Highly original

AI Analysis

This addresses a fundamental limitation in LLM compression that affects researchers and practitioners by providing the first pre-compression mechanism for component-level phase transition prediction.

The paper tackles the problem of capability-blind compression in large language models, where compression budgets are allocated without understanding what individual components functionally encode. They propose Capability-Guided Compression (CGC), which uses capability density maps to allocate differential compression budgets, and prove theoretically that components with higher capability density reach phase transition points at lower compression ratios.

Large language model compression has made substantial progress through pruning, quantization, and low-rank decomposition, yet a fundamental limitation persists across all existing methods: compression budgets are allocated without any representation of what individual model components functionally encode. We term this the capability-blind compression problem and argue it is a root cause of two well-documented failures -- the insensitivity of perplexity-based evaluation to reasoning capability loss, and the abrupt phase transitions in model performance recently characterized by Ma et al. (2026). We propose Capability-Guided Compression (CGC), a framework that addresses this by using Sparse Autoencoder (SAE)-derived capability density maps to allocate differential compression budgets across transformer components. Capability density is a formally defined scalar measure combining the feature breadth, activation entropy, and cross-input consistency of a component's SAE feature activation distribution. We prove theoretically that components with higher capability density exhibit lower structural redundancy and reach their individual phase transition points at lower compression ratios, providing the first pre-compression mechanism for component-level phase transition prediction. Experiments on GPT-2 Medium confirm that capability density is statistically independent of Wanda importance scores (Spearman rho = -0.054, n = 384 heads), establishing it as a genuinely novel compression signal orthogonal to all existing importance metrics. We report a negative result on PPL-based compression comparison and provide a principled diagnosis identifying GPT-2 Medium as an insufficient test bed for the full CGC hypothesis. The theoretical framework, density formalism, and orthogonality finding constitute a foundation for capability-aware compression research.

View on arXiv PDF

Similar