LGMLAug 9, 2014

Characterizing predictable classes of processes

arXiv:1408.2036v25 citations
Originality Incremental advance
AI Analysis

This addresses theoretical foundations for sequence prediction in machine learning and statistics, providing a characterization of predictable classes, but it is incremental as it builds on existing Bayesian and convergence frameworks.

The paper tackles the problem of sequence prediction for arbitrary classes of stochastic processes, showing that if a predictor exists whose conditional probabilities converge to the true ones, it can be obtained as a Bayesian predictor with a prior on a countable set, with results proven for both strong (total variation) and weak (expected average Kullback-Leibler divergence) performance measures.

The problem is sequence prediction in the following setting. A sequence x1,..., xn,... of discrete-valued observations is generated according to some unknown probabilistic law (measure) mu. After observing each outcome, it is required to give the conditional probabilities of the next observation. The measure mu belongs to an arbitrary class C of stochastic processes. We are interested in predictors ? whose conditional probabilities converge to the 'true' mu-conditional probabilities if any mu { C is chosen to generate the data. We show that if such a predictor exists, then a predictor can also be obtained as a convex combination of a countably many elements of C. In other words, it can be obtained as a Bayesian predictor whose prior is concentrated on a countable set. This result is established for two very different measures of performance of prediction, one of which is very strong, namely, total variation, and the other is very weak, namely, prediction in expected average Kullback-Leibler divergence.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes