LGJun 6, 2021

Reverse Engineering the Neural Tangent Kernel

arXiv:2106.03186v414 citations
Originality Highly original
AI Analysis

This provides a method for translating high-performing kernels into efficient neural network designs, addressing a challenge in deep learning theory for researchers and practitioners.

The authors tackled the problem of principled neural architecture design by proving that any positive-semidefinite dot-product kernel can be realized as the NNGP or neural tangent kernel of a one-hidden-layer fully-connected network with an appropriate activation function, and they verified this numerically in experiments.

The development of methods to guide the design of neural networks is an important open challenge for deep learning theory. As a paradigm for principled neural architecture design, we propose the translation of high-performing kernels, which are better-understood and amenable to first-principles design, into equivalent network architectures, which have superior efficiency, flexibility, and feature learning. To this end, we constructively prove that, with just an appropriate choice of activation function, any positive-semidefinite dot-product kernel can be realized as either the NNGP or neural tangent kernel of a fully-connected neural network with only one hidden layer. We verify our construction numerically and demonstrate its utility as a design tool for finite fully-connected networks in several experiments.

Code Implementations2 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes