ML LGOct 28, 2019

Deep learning is adaptive to intrinsic dimensionality of model smoothness in anisotropic Besov space

arXiv:1910.12799v221.078 citations

Originality Highly original

AI Analysis

This provides theoretical insight into deep learning's performance on high-dimensional data like images, addressing a foundational problem in machine learning theory.

The study demonstrates that deep learning's approximation and estimation errors depend only on the average smoothness across directions in anisotropic Besov spaces, avoiding the curse of dimensionality for highly anisotropic functions, with results showing better dependence on input dimensionality compared to kernel methods.

Deep learning has exhibited superior performance for various tasks, especially for high-dimensional datasets, such as images. To understand this property, we investigate the approximation and estimation ability of deep learning on anisotropic Besov spaces. The anisotropic Besov space is characterized by direction-dependent smoothness and includes several function classes that have been investigated thus far. We demonstrate that the approximation error and estimation error of deep learning only depend on the average value of the smoothness parameters in all directions. Consequently, the curse of dimensionality can be avoided if the smoothness of the target function is highly anisotropic. Unlike existing studies, our analysis does not require a low-dimensional structure of the input data. We also investigate the minimax optimality of deep learning and compare its performance with that of the kernel method (more generally, linear estimators). The results show that deep learning has better dependence on the input dimensionality if the target function possesses anisotropic smoothness, and it achieves an adaptive rate for functions with spatially inhomogeneous smoothness.

View on arXiv PDF

Similar