MLLGNov 26, 2024

Validation-Free Sparse Learning: A Phase Transition Approach to Feature Selection

arXiv:2411.17180v41 citationsh-index: 1Has Code
Originality Highly original
AI Analysis

This addresses the need for more frugal and interpretable AI models by providing a flexible, validation-free approach to feature selection, though it appears incremental as it extends phase transition concepts from compressed sensing to broader sparse learners.

The paper tackles the problem of selecting the right amount of sparsity in models to reduce AI's environmental footprint and improve interpretability, proposing a validation-free method based on phase transitions that achieves a good balance between predictive accuracy and feature sparsity in real-world data.

The growing environmental footprint of artificial intelligence (AI), especially in terms of storage and computation, calls for more frugal and interpretable models. Sparse models (e.g., linear, neural networks) offer a promising solution by selecting only the most relevant features, reducing complexity, preventing over-fitting and enabling interpretation-marking a step towards truly intelligent AI. The concept of a right amount of sparsity (without too many false positive or too few true positive) is subjective. So we propose a new paradigm previously only observed and mathematically studied for compressed sensing (noiseless linear models): obtaining a phase transition in the probability of retrieving the relevant features. We show in practice how to obtain this phase transition for a class of sparse learners. Our approach is flexible and applicable to complex models ranging from linear to shallow and deep artificial neural networks while supporting various loss functions and sparsity-promoting penalties. It does not rely on cross-validation or on a validation set to select its single regularization parameter. For real-world data, it provides a good balance between predictive accuracy and feature sparsity. A Python package is available at https://github.com/VcMaxouuu/HarderLASSO containing all the simulations and ready-to-use models.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes