Construction of neural networks for realization of localized deep learning
This work addresses the need for theoretical foundations in deep learning, particularly for localized manifold learning, but it appears incremental as it builds on existing deep-net concepts with specific layer tasks.
The paper tackles the theoretical development of deep learning by introducing a deep neural network approach for localized manifold learning, achieving an approximation order of O(m^{-2s/(2s+d)}) for regression functions with regularity s, where d is the manifold dimension and m is the number of sample points.
The subject of deep learning has recently attracted users of machine learning from various disciplines, including: medical diagnosis and bioinformatics, financial market analysis and online advertisement, speech and handwriting recognition, computer vision and natural language processing, time series forecasting, and search engines. However, theoretical development of deep learning is still at its infancy. The objective of this paper is to introduce a deep neural network (also called deep-net) approach to localized manifold learning, with each hidden layer endowed with a specific learning task. For the purpose of illustrations, we only focus on deep-nets with three hidden layers, with the first layer for dimensionality reduction, the second layer for bias reduction, and the third layer for variance reduction. A feedback component also designed to eliminate outliers. The main theoretical result in this paper is the order $\mathcal O\left(m^{-2s/(2s+d)}\right)$ of approximation of the regression function with regularity $s$, in terms of the number $m$ of sample points, where the (unknown) manifold dimension $d$ replaces the dimension $D$ of the sampling (Euclidean) space for shallow nets.