LGAICVJul 22, 2025

Understanding Generalization, Robustness, and Interpretability in Low-Capacity Neural Networks

arXiv:2507.16278v1
Originality Incremental advance
AI Analysis

This work addresses fundamental trade-offs in neural network design for researchers in machine learning, though it is incremental as it builds on existing studies of capacity and sparsity.

The study tackled the relationship between model capacity, sparsity, and robustness in low-capacity neural networks using binary classification tasks from MNIST, finding that required capacity scales with task complexity, networks remain robust up to 95% sparsity, and over-parameterization enhances robustness to input corruption.

Although modern deep learning often relies on massive over-parameterized models, the fundamental interplay between capacity, sparsity, and robustness in low-capacity networks remains a vital area of study. We introduce a controlled framework to investigate these properties by creating a suite of binary classification tasks from the MNIST dataset with increasing visual difficulty (e.g., 0 and 1 vs. 4 and 9). Our experiments reveal three core findings. First, the minimum model capacity required for successful generalization scales directly with task complexity. Second, these trained networks are robust to extreme magnitude pruning (up to 95% sparsity), revealing the existence of sparse, high-performing subnetworks. Third, we show that over-parameterization provides a significant advantage in robustness against input corruption. Interpretability analysis via saliency maps further confirms that these identified sparse subnetworks preserve the core reasoning process of the original dense models. This work provides a clear, empirical demonstration of the foundational trade-offs governing simple neural networks.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes