LG MLMar 21, 2024

Thinking in Groups: Permutation Tests Reveal Near-Out-of-Distribution

Yasith Jayawardana, Dineth Jayakody, Sampath Jayarathna, Dushan N. Wadduwage

arXiv:2403.14058v24.61 citationsh-index: 26

Originality Incremental advance

AI Analysis

This addresses the challenge of unreliable predictions from biased or incomplete data in biomedical AI, offering a robust solution for near-OoD detection, though it is incremental as it builds on existing OoD detection concepts with a novel group-based approach.

The paper tackles the problem of detecting near-out-of-distribution (OoD) inputs in deep neural networks, especially in biomedical contexts with correlated data, by introducing the Homogeneous OoD (HOoD) framework, which uses permutation tests on groups of measurements to achieve reliable OoD detection and outperforms existing methods.

Deep neural networks (DNNs) have the potential to power many biomedical workflows, but training them on truly representative, IID datasets is often infeasible. Most models instead rely on biased or incomplete data, making them prone to out-of-distribution (OoD) inputs that closely resemble in-distribution samples. Such near-OoD cases are harder to detect than standard OOD benchmarks and can cause unreliable, even catastrophic, predictions. Biomedical assays, however, offer a unique opportunity: they often generate multiple correlated measurements per specimen through biological or technical replicates. Exploiting this insight, we introduce Homogeneous OoD (HOoD), a novel OoD detection framework for correlated data. HOoD projects groups of correlated measurements through a trained model and uses permutation-based hypothesis tests to compare them with known subpopulations. Each test yields an interpretable p-value, quantifying how well a group matches a subpopulation. By aggregating these p-values, HOoD reliably identifies OoD groups. In evaluations, HOoD consistently outperforms point-wise and ensemble-based OoD detectors, demonstrating its promise for robust real-world deployment.

View on arXiv PDF

Similar