MELGSTMLDec 26, 2023

SymmPI: Predictive Inference for Data with Group Symmetries

arXiv:2312.16160v315 citationsh-index: 28Journal of the Royal Statistical Society Series B: Statistical Methodology
Originality Incremental advance
AI Analysis

This addresses predictive inference for data with general symmetries, such as in physics or networks, offering a novel approach but with incremental improvements over existing methods.

The paper tackles the problem of quantifying prediction uncertainty for data with group symmetries, proposing SymmPI, a method that achieves valid coverage under distributional invariance and performs favorably in simulations and empirical analysis.

Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often -- for instance, in standard conformal prediction -- relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive inference aim to predict unobserved outcomes in sequences of feature-outcome observations. Meanwhile, there is interest in predictive inference under more general observation models (e.g., for partially observed features) and for data satisfying more general distributional symmetries (e.g., rotationally invariant or coordinate-independent observations in physics). Here we propose SymmPI, a methodology for predictive inference when data distributions have general group symmetries in arbitrary observation models. Our methods leverage the novel notion of distributional equivariant transformations, which process the data while preserving their distributional invariances. We show that SymmPI has valid coverage under distributional invariance and characterize its performance under distribution shift, recovering recent results as special cases. We apply SymmPI to predict unobserved values associated to vertices in a network, where the distribution is unchanged under relabelings that keep the network structure unchanged. In several simulations in a two-layer hierarchical model, and in an empirical data analysis example, SymmPI performs favorably compared to existing methods.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes