MLLGAPCAJun 28, 2019

Neural ODEs as the Deep Limit of ResNets with constant weights

arXiv:1906.12183v235 citations
Originality Incremental advance
AI Analysis

This provides a theoretical justification for viewing Neural ODEs as deep limits of ResNets, which is incremental but clarifies foundational relationships in deep learning.

The paper proves that stochastic gradient descent on a ResNet with constant weights converges to stochastic gradient descent for a Neural ODE in the deep limit, establishing a theoretical foundation for this connection.

In this paper we prove that, in the deep limit, the stochastic gradient descent on a ResNet type deep neural network, where each layer shares the same weight matrix, converges to the stochastic gradient descent for a Neural ODE and that the corresponding value/loss functions converge. Our result gives, in the context of minimization by stochastic gradient descent, a theoretical foundation for considering Neural ODEs as the deep limit of ResNets. Our proof is based on certain decay estimates for associated Fokker-Planck equations.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes