LGAINAPRMLOct 31, 2023

Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory

arXiv:2310.20360v327 citationsh-index: 50
Originality Synthesis-oriented
AI Analysis

It serves as an educational resource for students and scientists without prior background in deep learning, offering mathematical insights into established methods, but it is incremental as it reviews existing knowledge rather than presenting new research.

This book provides a comprehensive mathematical introduction to deep learning, covering various neural network architectures, optimization algorithms, and theoretical aspects such as approximation capacities and generalization errors, with the goal of offering a solid foundation for beginners and practitioners.

This book aims to provide an introduction to the topic of deep learning algorithms. We review essential components of deep learning algorithms in full mathematical detail including different artificial neural network (ANN) architectures (such as fully-connected feedforward ANNs, convolutional ANNs, recurrent ANNs, residual ANNs, and ANNs with batch normalization) and different optimization algorithms (such as the basic stochastic gradient descent (SGD) method, accelerated methods, and adaptive methods). We also cover several theoretical aspects of deep learning algorithms such as approximation capacities of ANNs (including a calculus for ANNs), optimization theory (including Kurdyka-Łojasiewicz inequalities), and generalization errors. In the last part of the book some deep learning approximation methods for PDEs are reviewed including physics-informed neural networks (PINNs) and deep Galerkin methods. We hope that this book will be useful for students and scientists who do not yet have any background in deep learning at all and would like to gain a solid foundation as well as for practitioners who would like to obtain a firmer mathematical understanding of the objects and methods considered in deep learning.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes