LG CVMar 16

Sampling-guided exploration of active feature selection policies

Gabriel Bernardino, Anders Jonsson, Patrick Clarysse, Nicolas Duchateau

arXiv:2603.151106.6h-index: 2

AI Analysis

This addresses the challenge of efficient feature acquisition for predictive modeling, particularly for instances where global feature choices are suboptimal, though it is incremental as it builds on prior work with heuristic and regularization improvements.

The paper tackles the problem of active feature selection for machine learning models by proposing a reinforcement learning approach that sequentially recommends features to acquire, balancing performance and cost. It achieves better accuracy and policy complexity than state-of-the-art methods on datasets with up to 56 features and 4500 samples.

Determining the most appropriate features for machine learning predictive models is challenging regarding performance and feature acquisition costs. In particular, global feature choice is limited given that some features will only benefit a subset of instances. In previous work, we proposed a reinforcement learning approach to sequentially recommend which modality to acquire next to reach the best information/cost ratio, based on the instance-specific information already acquired. We formulated the problem as a Markov Decision Process where the state's dimensionality changes during the episode, avoiding data imputation, contrary to existing works. However, this only allowed processing a small number of features, as all possible combinations of features were considered. Here, we address these limitations with two contributions: 1) we expand our framework to larger datasets with a heuristic-based strategy that focuses on the most promising feature combinations, and 2) we introduce a post-fit regularisation strategy that reduces the number of different feature combinations, leading to compact sequences of decisions. We tested our method on four binary classification datasets (one involving high-dimensional variables), the largest of which had 56 features and 4500 samples. We obtained better performance than state-of-the-art methods, both in terms of accuracy and policy complexity.

View on arXiv PDF

Similar