LGHEP-THMLApr 16, 2019

On the Mathematical Understanding of ResNet with Feynman Path Integral

arXiv:1904.07568v19 citations
Originality Incremental advance
AI Analysis

This provides a foundational mathematical framework for understanding ResNet, potentially aiding in the future design of convolutional neural network architectures, though it is incremental in applying existing physics concepts to neural networks.

The paper tackles the problem of mathematically understanding Residual Networks (ResNet) by establishing a connection to Feynman path integrals, proving that residual blocks are equivalent to partial differential equations and that ResNet transformations can be converted to path integrals, which helps explain ResNet's advantage in addressing gradient vanishing issues.

In this paper, we aim to understand Residual Network (ResNet) in a scientifically sound way by providing a bridge between ResNet and Feynman path integral. In particular, we prove that the effect of residual block is equivalent to partial differential equation, and the ResNet transforming process can be equivalently converted to Feynman path integral. These conclusions greatly help us mathematically understand the advantage of ResNet in addressing the gradient vanishing issue. More importantly, our analyses offer a path integral view of ResNet, and demonstrate that the output of certain network can be obtained by adding contributions of all paths. Moreover, the contribution of each path is proportional to e^{-S}, where S is the action given by time integral of Lagrangian L. This lays the solid foundation in the understanding of ResNet, and provides insights in the future design of convolutional neural network architecture. Based on these results, we have designed the network using partial differential operators, which further validates our theoritical analyses.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes