LGAIMLFeb 5, 2021

Active Slices for Sliced Stein Discrepancy

arXiv:2102.03159v38 citations
Originality Highly original
AI Analysis

This work provides a more efficient method for goodness-of-fit tests and model learning for researchers and practitioners using Sliced Stein Discrepancy, offering significant speed improvements.

This paper addresses the computational expense and sub-optimal results of gradient-based optimization for finding optimal slicing directions in Sliced Stein Discrepancy (SSD). They propose a fast algorithm based on active sub-space construction and spectral decomposition, achieving a 14-80x speed-up in goodness-of-fit tests compared to gradient-based alternatives.

Sliced Stein discrepancy (SSD) and its kernelized variants have demonstrated promising successes in goodness-of-fit tests and model learning in high dimensions. Despite their theoretical elegance, their empirical performance depends crucially on the search of optimal slicing directions to discriminate between two distributions. Unfortunately, previous gradient-based optimisation approaches for this task return sub-optimal results: they are computationally expensive, sensitive to initialization, and they lack theoretical guarantees for convergence. We address these issues in two steps. First, we provide theoretical results stating that the requirement of using optimal slicing directions in the kernelized version of SSD can be relaxed, validating the resulting discrepancy with finite random slicing directions. Second, given that good slicing directions are crucial for practical performance, we propose a fast algorithm for finding such slicing directions based on ideas of active sub-space construction and spectral decomposition. Experiments on goodness-of-fit tests and model learning show that our approach achieves both improved performance and faster convergence. Especially, we demonstrate a 14-80x speed-up in goodness-of-fit tests when comparing with gradient-based alternatives.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes