DSLGMar 30

Universal Approximation Constraints of Narrow ResNets: The Tunnel Effect

arXiv:2603.2859122.2h-index: 2
AI Analysis

This addresses theoretical limitations in neural network design for researchers, providing insights into architecture-target incompatibility, but it is incremental as it builds on existing ResNet and approximation theory.

The paper tackles the universal approximation constraints of narrow ResNets, showing that they cannot represent critical points of input-output maps, leading to a 'tunnel effect' where critical points shift to infinity in classification tasks, with approximation bounds depending on channel ratios and weight bounds.

We analyze the universal approximation constraints of narrow Residual Neural Networks (ResNets) both theoretically and numerically. For deep neural networks without input space augmentation, a central constraint is the inability to represent critical points of the input-output map. We prove that this has global consequences for target function approximations and show that the manifestation of this defect is typically a shift of the critical point to infinity, which we call the ``tunnel effect'' in the context of classification tasks. While ResNets offer greater expressivity than standard multilayer perceptrons (MLPs), their capability strongly depends on the signal ratio between the skip and residual channels. We establish quantitative approximation bounds for both the residual-dominant (close to MLP) and skip-dominant (close to neural ODE) regimes. These estimates depend explicitly on the channel ratio and uniform network weight bounds. Low-dimensional examples further provide a detailed analysis of the different ResNet regimes and how architecture-target incompatibility influences the approximation error.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes