LGAIMar 10, 2021

Partial Differential Equations is All You Need for Generating Neural Architectures -- A Theory for Physical Artificial Intelligence Systems

arXiv:2103.08313v23 citations
Originality Highly original
AI Analysis

This work proposes a foundational theory for physical artificial intelligence systems, potentially enabling analog computing device design, but it appears incremental as it builds on existing physical equations.

The authors generalized physical equations into neural partial differential equations (NPDE) to generate neural network architectures like MLPs, CNNs, and RNNs, aiming to provide an interpretable physical image for deep learning.

In this work, we generalize the reaction-diffusion equation in statistical physics, Schrödinger equation in quantum mechanics, Helmholtz equation in paraxial optics into the neural partial differential equations (NPDE), which can be considered as the fundamental equations in the field of artificial intelligence research. We take finite difference method to discretize NPDE for finding numerical solution, and the basic building blocks of deep neural network architecture, including multi-layer perceptron, convolutional neural network and recurrent neural networks, are generated. The learning strategies, such as Adaptive moment estimation, L-BFGS, pseudoinverse learning algorithms and partial differential equation constrained optimization, are also presented. We believe it is of significance that presented clear physical image of interpretable deep neural networks, which makes it be possible for applying to analog computing device design, and pave the road to physical artificial intelligence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes