PRLGMLJun 21, 2019

Testing randomness

arXiv:1906.09256v372 citations
Originality Synthesis-oriented
AI Analysis

This work addresses the fundamental problem of testing randomness and exchangeability in sequential data for statistical machine learning and nonparametric statistics, but it is incremental as it reviews and extends known methods.

The paper reviews online methods for testing randomness and exchangeability hypotheses, focusing on conformal martingales, and provides validity results that limit false alarm probabilities and efficiency results linking these concepts.

The hypothesis of randomness is fundamental in statistical machine learning and in many areas of nonparametric statistics; it says that the observations are assumed to be independent and coming from the same unknown probability distribution. This hypothesis is close, in certain respects, to the hypothesis of exchangeability, which postulates that the distribution of the observations is invariant with respect to their permutations. This paper reviews known methods of testing the two hypotheses concentrating on the online mode of testing, when the observations arrive sequentially. All known online methods for testing these hypotheses are based on conformal martingales, which are defined and studied in detail. The paper emphasizes conceptual and practical aspects and states two kinds of results. Validity results limit the probability of a false alarm or the frequency of false alarms for various procedures based on conformal martingales, including conformal versions of the CUSUM and Shiryaev-Roberts procedures. Efficiency results establish connections between randomness, exchangeability, and conformal martingales.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes