DATA-ANSTAT-MECHMLAug 11, 2020

Active Importance Sampling for Variational Objectives Dominated by Rare Events: Consequences for Optimization and Generalization

arXiv:2008.06334v213 citations
AI Analysis

This work addresses a problem in statistical physics and machine learning theory where gathering sufficient data for accurate representations is difficult, offering a method to improve optimization and generalization in such cases.

The authors tackled the challenge of optimizing neural networks when objective functions are dominated by rare events, by combining rare events sampling with neural network optimization. They demonstrated that importance sampling reduces asymptotic variance, aiding generalization, and successfully learned dynamical transition pathways in high-dimensional, rare-data scenarios.

Deep neural networks, when optimized with sufficient data, provide accurate representations of high-dimensional functions; in contrast, function approximation techniques that have predominated in scientific computing do not scale well with dimensionality. As a result, many high-dimensional sampling and approximation problems once thought intractable are being revisited through the lens of machine learning. While the promise of unparalleled accuracy may suggest a renaissance for applications that require parameterizing representations of complex systems, in many applications gathering sufficient data to develop such a representation remains a significant challenge. Here we introduce an approach that combines rare events sampling techniques with neural network optimization to optimize objective functions that are dominated by rare events. We show that importance sampling reduces the asymptotic variance of the solution to a learning problem, suggesting benefits for generalization. We study our algorithm in the context of learning dynamical transition pathways between two states of a system, a problem with applications in statistical physics and implications in machine learning theory. Our numerical experiments demonstrate that we can successfully learn even with the compounding difficulties of high-dimension and rare data.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes