MLLGMEMay 25, 2022

Conformal Prediction Intervals with Temporal Dependence

arXiv:2205.12940v315 citationsh-index: 26
Originality Incremental advance
AI Analysis

This addresses the need for reliable uncertainty quantification in time series applications like healthcare, offering a practical post-hoc method for improving coverage guarantees.

The paper tackles the problem of constructing valid prediction intervals in time series regression with cross-sectional data, showing that distribution-free longitudinal validity is impossible but proposing CPTD, which maintains strict cross-sectional validity while improving longitudinal coverage and often providing narrower intervals.

Cross-sectional prediction is common in many domains such as healthcare, including forecasting tasks using electronic health records, where different patients form a cross-section. We focus on the task of constructing valid prediction intervals (PIs) in time series regression with a cross-section. A prediction interval is considered valid if it covers the true response with (a pre-specified) high probability. We first distinguish between two notions of validity in such a setting: cross-sectional and longitudinal. Cross-sectional validity is concerned with validity across the cross-section of the time series data, while longitudinal validity accounts for the temporal dimension. Coverage guarantees along both these dimensions are ideally desirable; however, we show that distribution-free longitudinal validity is theoretically impossible. Despite this limitation, we propose Conformal Prediction with Temporal Dependence (CPTD), a procedure that is able to maintain strict cross-sectional validity while improving longitudinal coverage. CPTD is post-hoc and light-weight, and can easily be used in conjunction with any prediction model as long as a calibration set is available. We focus on neural networks due to their ability to model complicated data such as diagnosis codes for time series regression, and perform extensive experimental validation to verify the efficacy of our approach. We find that CPTD outperforms baselines on a variety of datasets by improving longitudinal coverage and often providing more efficient (narrower) PIs.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes