MLLGOct 6, 2022

Probabilistic Model Incorporating Auxiliary Covariates to Control FDR

arXiv:2210.03178v1h-index: 77
Originality Incremental advance
AI Analysis

This addresses the challenge of improving statistical power while controlling FDR in large-scale data analysis, though it appears incremental by building on existing covariate-based methods.

The paper tackles the problem of controlling False Discovery Rate (FDR) in multiple hypothesis testing by incorporating auxiliary covariates, resulting in a method called NeurT-FDR that makes substantially more discoveries in three real datasets compared to baselines.

Controlling False Discovery Rate (FDR) while leveraging the side information of multiple hypothesis testing is an emerging research topic in modern data science. Existing methods rely on the test-level covariates while ignoring metrics about test-level covariates. This strategy may not be optimal for complex large-scale problems, where indirect relations often exist among test-level covariates and auxiliary metrics or covariates. We incorporate auxiliary covariates among test-level covariates in a deep Black-Box framework controlling FDR (named as NeurT-FDR) which boosts statistical power and controls FDR for multiple-hypothesis testing. Our method parametrizes the test-level covariates as a neural network and adjusts the auxiliary covariates through a regression framework, which enables flexible handling of high-dimensional features as well as efficient end-to-end optimization. We show that NeurT-FDR makes substantially more discoveries in three real datasets compared to competitive baselines.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes