LG AINov 28, 2022

Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data

Andrei Margeloiu, Nikola Simidjievski, Pietro Lio, Mateja Jamnik

Cambridge

arXiv:2211.15616v110.420 citationsh-index: 26Has Code

Originality Incremental advance

AI Analysis

This addresses the problem of overfitting for researchers working with small biomedical datasets, though it is incremental as it builds on existing neural network approaches.

The paper tackles overfitting in high-dimensional, small-sample tabular biomedical data by proposing WPFS, a method that reduces learnable parameters and performs feature selection, and shows it outperforms other methods on nine real-world datasets.

Tabular biomedical data is often high-dimensional but with a very small number of samples. Although recent work showed that well-regularised simple neural networks could outperform more sophisticated architectures on tabular data, they are still prone to overfitting on tiny datasets with many potentially irrelevant features. To combat these issues, we propose Weight Predictor Network with Feature Selection (WPFS) for learning neural networks from high-dimensional and small sample data by reducing the number of learnable parameters and simultaneously performing feature selection. In addition to the classification network, WPFS uses two small auxiliary networks that together output the weights of the first layer of the classification model. We evaluate on nine real-world biomedical datasets and demonstrate that WPFS outperforms other standard as well as more recent methods typically applied to tabular data. Furthermore, we investigate the proposed feature selection mechanism and show that it improves performance while providing useful insights into the learning task.

View on arXiv PDF Code

Similar