LGAINov 28, 2022

Weight Predictor Network with Feature Selection for Small Sample Tabular Biomedical Data

Cambridge
arXiv:2211.15616v120 citationsh-index: 26Has Code
Originality Incremental advance
AI Analysis

This addresses the problem of overfitting for researchers working with small biomedical datasets, though it is incremental as it builds on existing neural network approaches.

The paper tackles overfitting in high-dimensional, small-sample tabular biomedical data by proposing WPFS, a method that reduces learnable parameters and performs feature selection, and shows it outperforms other methods on nine real-world datasets.

Tabular biomedical data is often high-dimensional but with a very small number of samples. Although recent work showed that well-regularised simple neural networks could outperform more sophisticated architectures on tabular data, they are still prone to overfitting on tiny datasets with many potentially irrelevant features. To combat these issues, we propose Weight Predictor Network with Feature Selection (WPFS) for learning neural networks from high-dimensional and small sample data by reducing the number of learnable parameters and simultaneously performing feature selection. In addition to the classification network, WPFS uses two small auxiliary networks that together output the weights of the first layer of the classification model. We evaluate on nine real-world biomedical datasets and demonstrate that WPFS outperforms other standard as well as more recent methods typically applied to tabular data. Furthermore, we investigate the proposed feature selection mechanism and show that it improves performance while providing useful insights into the learning task.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes