LGMLMay 23, 2024

Nuclear Norm Regularization for Deep Learning

arXiv:2405.14544v210 citationsh-index: 3NIPS
Originality Incremental advance
AI Analysis

This addresses a scalability bottleneck for regularization techniques in deep learning, offering a practical solution for researchers and practitioners.

The paper tackles the intractability of nuclear norm regularization for deep learning by proposing an efficient approximation method, enabling its application to high-dimensional problems and demonstrating its utility in denoising and representation learning.

Penalizing the nuclear norm of a function's Jacobian encourages it to locally behave like a low-rank linear map. Such functions vary locally along only a handful of directions, making the Jacobian nuclear norm a natural regularizer for machine learning problems. However, this regularizer is intractable for high-dimensional problems, as it requires computing a large Jacobian matrix and taking its singular value decomposition. We show how to efficiently penalize the Jacobian nuclear norm using techniques tailor-made for deep learning. We prove that for functions parametrized as compositions $f = g \circ h$, one may equivalently penalize the average squared Frobenius norm of $Jg$ and $Jh$. We then propose a denoising-style approximation that avoids the Jacobian computations altogether. Our method is simple, efficient, and accurate, enabling Jacobian nuclear norm regularization to scale to high-dimensional deep learning problems. We complement our theory with an empirical study of our regularizer's performance and investigate applications to denoising and representation learning.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes