LGAIRTMLOct 4, 2025

Implicit Models: Expressive Power Scales with Test-Time Compute

arXiv:2510.03638v13 citationsh-index: 5
Originality Highly original
AI Analysis

This addresses a foundational gap in machine learning by explaining the mechanism behind implicit models' performance, which is incremental but has broad implications for memory-efficient AI.

The paper tackles the problem of understanding how implicit models achieve expressive power by scaling test-time compute, showing through nonparametric analysis that iterative computation allows these models to match richer function classes, validated across image reconstruction, scientific computing, and operations research with improved solution quality and stability as iterations increase.

Implicit models, an emerging model class, compute outputs by iterating a single parameter block to a fixed point. This architecture realizes an infinite-depth, weight-tied network that trains with constant memory, significantly reducing memory needs for the same level of performance compared to explicit models. While it is empirically known that these compact models can often match or even exceed larger explicit networks by allocating more test-time compute, the underlying mechanism remains poorly understood. We study this gap through a nonparametric analysis of expressive power. We provide a strict mathematical characterization, showing that a simple and regular implicit operator can, through iteration, progressively express more complex mappings. We prove that for a broad class of implicit models, this process lets the model's expressive power scale with test-time compute, ultimately matching a much richer function class. The theory is validated across three domains: image reconstruction, scientific computing, and operations research, demonstrating that as test-time iterations increase, the complexity of the learned mapping rises, while the solution quality simultaneously improves and stabilizes.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes