LGNAJun 23, 2025

On the algorithmic construction of deep ReLU networks

arXiv:2506.19104v1h-index: 25Has Code
Originality Incremental advance
AI Analysis

This work provides insights into the expressivity of neural networks for researchers in theoretical machine learning, though it is incremental as it builds on existing mathematical understanding.

The authors tackled the problem of understanding what deep ReLU networks can represent by constructing them as algorithms rather than training from data, resulting in explicit networks that perform tasks like exact sorting with optimal computational complexity for large inputs and billions of parameters.

It is difficult to describe in mathematical terms what a neural network trained on data represents. On the other hand, there is a growing mathematical understanding of what neural networks are in principle capable of representing. Feedforward neural networks using the ReLU activation function represent continuous and piecewise linear functions and can approximate many others. The study of their expressivity addresses the question: which ones? Contributing to the available answers, we take the perspective of a neural network as an algorithm. In this analogy, a neural network is programmed constructively, rather than trained from data. An interesting example is a sorting algorithm: we explicitly construct a neural network that sorts its inputs exactly, not approximately, and that, in a sense, has optimal computational complexity if the input dimension is large. Such constructed networks may have several billion parameters. We construct and analyze several other examples, both existing and new. We find that, in these examples, neural networks as algorithms are typically recursive and parallel. Compared to conventional algorithms, ReLU networks are restricted by having to be continuous. Moreover, the depth of recursion is limited by the depth of the network, with deep networks having superior properties over shallow ones.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes