MLLGAPCOOct 14, 2024

fastHDMI: Fast Mutual Information Estimation for High-Dimensional Data

arXiv:2410.10082v1h-index: 61
Originality Incremental advance
AI Analysis

This work provides improved tools for variable selection in neuroimaging analysis, though it is incremental as it adapts existing methods to a specific domain.

The paper tackled efficient variable screening in high-dimensional neuroimaging data by introducing fastHDMI, a Python package that applies mutual information estimation methods, and found that FFTKDE-based methods excel for continuous nonlinear outcomes while binning-based methods perform best for binary outcomes.

In this paper, we introduce fastHDMI, a Python package designed for efficient variable screening in high-dimensional datasets, particularly neuroimaging data. This work pioneers the application of three mutual information estimation methods for neuroimaging variable selection, a novel approach implemented via fastHDMI. These advancements enhance our ability to analyze the complex structures of neuroimaging datasets, providing improved tools for variable selection in high-dimensional spaces. Using the preprocessed ABIDE dataset, we evaluate the performance of these methods through extensive simulations. The tests cover a range of conditions, including linear and nonlinear associations, as well as continuous and binary outcomes. Our results highlight the superiority of the FFTKDE-based mutual information estimation for feature screening in continuous nonlinear outcomes, while binning-based methods outperform others for binary outcomes with nonlinear probability preimages. For linear simulations, both Pearson correlation and FFTKDE-based methods show comparable performance for continuous outcomes, while Pearson excels in binary outcomes with linear probability preimages. A comprehensive case study using the ABIDE dataset further demonstrates fastHDMI's practical utility, showcasing the predictive power of models built from variables selected using our screening techniques. This research affirms the computational efficiency and methodological strength of fastHDMI, significantly enriching the toolkit available for neuroimaging analysis.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes