LGMLJul 3, 2020

On the Similarity between the Laplace and Neural Tangent Kernels

arXiv:2007.01580v2114 citations
AI Analysis

This work provides a theoretical link between neural networks and classical kernels, potentially simplifying analysis for researchers in machine learning theory.

The paper shows that the Neural Tangent Kernel (NTK) for fully connected networks is closely related to the Laplace kernel, with both having the same eigenfunctions and eigenvalue decay rates for normalized data on a hypersphere, implying similar smoothness properties in their function classes. Experiments confirm that NTK and Laplace kernels perform almost identically on real data, suggesting insights about neural networks can be derived from the simpler Laplace kernel.

Recent theoretical work has shown that massively overparameterized neural networks are equivalent to kernel regressors that use Neural Tangent Kernels(NTK). Experiments show that these kernel methods perform similarly to real neural networks. Here we show that NTK for fully connected networks is closely related to the standard Laplace kernel. We show theoretically that for normalized data on the hypersphere both kernels have the same eigenfunctions and their eigenvalues decay polynomially at the same rate, implying that their Reproducing Kernel Hilbert Spaces (RKHS) include the same sets of functions. This means that both kernels give rise to classes of functions with the same smoothness properties. The two kernels differ for data off the hypersphere, but experiments indicate that when data is properly normalized these differences are not significant. Finally, we provide experiments on real data comparing NTK and the Laplace kernel, along with a larger class ofγ-exponential kernels. We show that these perform almost identically. Our results suggest that much insight about neural networks can be obtained from analysis of the well-known Laplace kernel, which has a simple closed-form.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes