LGNASep 15, 2025

Nonlocal Neural Tangent Kernels via Parameter-Space Interactions

arXiv:2509.12467v1h-index: 1
Originality Incremental advance
AI Analysis

This work addresses a theoretical bottleneck for researchers studying neural network training dynamics, but it appears incremental as it builds directly on the NTK framework.

The authors tackled the limitation of the Neural Tangent Kernel (NTK) framework, which assumes differentiability, by proposing a Nonlocal Neural Tangent Kernel (NNTK) that uses nonlocal interactions in parameter space, enabling extension to nonsmooth functions and broader models.

The Neural Tangent Kernel (NTK) framework has provided deep insights into the training dynamics of neural networks under gradient flow. However, it relies on the assumption that the network is differentiable with respect to its parameters, an assumption that breaks down when considering non-smooth target functions or parameterized models exhibiting non-differentiable behavior. In this work, we propose a Nonlocal Neural Tangent Kernel (NNTK) that replaces the local gradient with a nonlocal interaction-based approximation in parameter space. Nonlocal gradients are known to exist for a wider class of functions than the standard gradient. This allows NTK theory to be extended to nonsmooth functions, stochastic estimators, and broader families of models. We explore both fixed-kernel and attention-based formulations of this nonlocal operator. We illustrate the new formulation with numerical studies.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes