MLLGJul 21, 2023

Bounded P-values in Parametric Programming-based Selective Inference

arXiv:2307.11351v22 citationsh-index: 28
Originality Incremental advance
AI Analysis

This work addresses computational efficiency for statisticians and machine learning practitioners dealing with selective inference, though it is incremental as it builds on existing parametric programming-based methods.

The authors tackled the high computational cost of parametric programming-based selective inference by proposing a method to compute lower and upper bounds for p-values, along with efficient search strategies to improve these bounds, demonstrating effectiveness in feature selection for linear models and attention region identification in deep neural networks.

Selective inference (SI) has been actively studied as a promising framework for statistical hypothesis testing for data-driven hypotheses. The basic idea of SI is to make inferences conditional on an event that a hypothesis is selected. In order to perform SI, this event must be characterized in a traceable form. When selection event is too difficult to characterize, additional conditions are introduced for tractability. This additional conditions often causes the loss of power, and this issue is referred to as over-conditioning in [Fithian et al., 2014]. Parametric programming-based SI (PP-based SI) has been proposed as one way to address the over-conditioning issue. The main problem of PP-based SI is its high computational cost due to the need to exhaustively explore the data space. In this study, we introduce a procedure to reduce the computational cost while guaranteeing the desired precision, by proposing a method to compute the lower and upper bounds of p-values. We also proposed three types of search strategies that efficiently improve these bounds. We demonstrate the effectiveness of the proposed method in hypothesis testing problems for feature selection in linear models and attention region identification in deep neural networks.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes