MLLGSep 30, 2018

A Kernel Perspective for Regularizing Deep Neural Networks

arXiv:1810.00363v456 citations
Originality Incremental advance
AI Analysis

This work provides a theoretical framework for regularization in deep learning, potentially benefiting researchers and practitioners dealing with overfitting and robustness issues, though it appears incremental as it builds on existing principles.

The paper tackles the problem of regularizing deep neural networks by introducing a kernel perspective using reproducing kernel Hilbert space (RKHS) norms, which unifies existing methods and leads to new penalties, showing effectiveness in small datasets and adversarial robustness.

We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various practical strategies. Specifically, this perspective (i) provides a common umbrella for many existing regularization principles, including spectral norm and gradient penalties, or adversarial training, (ii) leads to new effective regularization penalties, and (iii) suggests hybrid strategies combining lower and upper bounds to get better approximations of the RKHS norm. We experimentally show this approach to be effective when learning on small datasets, or to obtain adversarially robust models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes