LG MLJun 1, 2024

Stein Random Feature Regression

Houston Warren, Rafael Oliveira, Fabio Ramos

arXiv:2406.00438v22.6Has Code

Originality Incremental advance

AI Analysis

This work addresses computational scalability and flexibility issues in large-scale regression for machine learning practitioners, representing an incremental improvement over existing methods.

The paper tackled the problem of improving random Fourier features for Gaussian processes by introducing Stein random features, which enhanced kernel approximation and Bayesian kernel learning, leading to superior performance in empirical comparisons.

In large-scale regression problems, random Fourier features (RFFs) have significantly enhanced the computational scalability and flexibility of Gaussian processes (GPs) by defining kernels through their spectral density, from which a finite set of Monte Carlo samples can be used to form an approximate low-rank GP. However, the efficacy of RFFs in kernel approximation and Bayesian kernel learning depends on the ability to tractably sample the kernel spectral measure and the quality of the generated samples. We introduce Stein random features (SRF), leveraging Stein variational gradient descent, which can be used to both generate high-quality RFF samples of known spectral densities as well as flexibly and efficiently approximate traditionally non-analytical spectral measure posteriors. SRFs require only the evaluation of log-probability gradients to perform both kernel approximation and Bayesian kernel learning that results in superior performance over traditional approaches. We empirically validate the effectiveness of SRFs by comparing them to baselines on kernel approximation and well-known GP regression problems.

View on arXiv PDF Code

Similar