MLLGNov 11, 2022

RFFNet: Large-Scale Interpretable Kernel Methods via Random Fourier Features

arXiv:2211.06410v21 citationsh-index: 25
Originality Incremental advance
AI Analysis

This work addresses the need for scalable and interpretable kernel methods in machine learning, offering a domain-specific improvement over existing approximations.

The authors tackled the problem of scaling kernel methods to large datasets while maintaining interpretability, by introducing RFFNet, a method that learns kernel relevances via stochastic optimization, achieving low prediction error and effective feature identification on simulated and real-world data.

Kernel methods provide a flexible and theoretically grounded approach to nonlinear and nonparametric learning. While memory and run-time requirements hinder their applicability to large datasets, many low-rank kernel approximations, such as random Fourier features, were recently developed to scale up such kernel methods. However, these scalable approaches are based on approximations of isotropic kernels, which cannot remove the influence of irrelevant features. In this work, we design random Fourier features for a family of automatic relevance determination (ARD) kernels, and introduce RFFNet, a new large-scale kernel method that learns the kernel relevances' on the fly via first-order stochastic optimization. We present an effective initialization scheme for the method's non-convex objective function, evaluate if hard-thresholding RFFNet's learned relevances yield a sensible rule for variable selection, and perform an extensive ablation study of RFFNet's components. Numerical validation on simulated and real-world data shows that our approach has a small memory footprint and run-time, achieves low prediction error, and effectively identifies relevant features, thus leading to more interpretable solutions. We supply users with an efficient, PyTorch-based library, that adheres to the scikit-learn standard API and code for fully reproducing our results.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes