LGOCMLSep 23, 2025

Diagonal Linear Networks and the Lasso Regularization Path

arXiv:2509.18766v11 citations
Originality Incremental advance
AI Analysis

This provides theoretical insights into neural network training dynamics for researchers in machine learning, though it is incremental as it builds on prior analysis of diagonal linear networks.

The paper tackles the problem of understanding the implicit regularization in diagonal linear networks by showing that their training trajectory closely approximates the lasso regularization path, with training time acting as an inverse regularization parameter, supported by rigorous results and simulations.

Diagonal linear networks are neural networks with linear activation and diagonal weight matrices. Their theoretical interest is that their implicit regularization can be rigorously analyzed: from a small initialization, the training of diagonal linear networks converges to the linear predictor with minimal 1-norm among minimizers of the training loss. In this paper, we deepen this analysis showing that the full training trajectory of diagonal linear networks is closely related to the lasso regularization path. In this connection, the training time plays the role of an inverse regularization parameter. Both rigorous results and simulations are provided to illustrate this conclusion. Under a monotonicity assumption on the lasso regularization path, the connection is exact while in the general case, we show an approximate connection.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes