MLLGFeb 18, 2014

Automatic Construction and Natural-Language Description of Nonparametric Regression Models

arXiv:1402.4304v3254 citations
Originality Incremental advance
AI Analysis

This work addresses the need for automated model discovery and explanation in regression tasks, which is incremental as it builds on existing Gaussian process methods.

The paper tackles the problem of automatically constructing and describing nonparametric regression models by exploring an open-ended space of statistical models using Gaussian processes, resulting in state-of-the-art extrapolation performance over 13 real time series data sets.

This paper presents the beginnings of an automatic statistician, focusing on regression problems. Our system explores an open-ended space of statistical models to discover a good explanation of a data set, and then produces a detailed report with figures and natural-language text. Our approach treats unknown regression functions nonparametrically using Gaussian processes, which has two important consequences. First, Gaussian processes can model functions in terms of high-level properties (e.g. smoothness, trends, periodicity, changepoints). Taken together with the compositional structure of our language of models this allows us to automatically describe functions in simple terms. Second, the use of flexible nonparametric models and a rich language for composing them in an open-ended manner also results in state-of-the-art extrapolation performance evaluated over 13 real time series data sets from various domains.

Code Implementations3 repos
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes