LG AI MLMay 28, 2019

Efficient Wrapper Feature Selection using Autoencoder and Model Based Elimination

arXiv:1905.11592v22.77 citationsHas Code

Originality Incremental advance

AI Analysis

This is an incremental improvement for researchers and practitioners needing efficient feature selection in machine learning applications.

The authors tackled the problem of computationally efficient wrapper feature selection by proposing AMBER, which uses a ranker model and autoencoders for greedy backward elimination, and demonstrated superior classification accuracies compared to other state-of-the-art methods on four datasets.

We propose a computationally efficient wrapper feature selection method - called Autoencoder and Model Based Elimination of features using Relevance and Redundancy scores (AMBER) - that uses a single ranker model along with autoencoders to perform greedy backward elimination of features. The ranker model is used to prioritize the removal of features that are not critical to the classification task, while the autoencoders are used to prioritize the elimination of correlated features. We demonstrate the superior feature selection ability of AMBER on 4 well known datasets corresponding to different domain applications via comparing the classification accuracies with other computationally efficient state-of-the-art feature selection techniques. Interestingly, we find that the ranker model that is used for feature selection does not necessarily have to be the same as the final classifier that is trained on the selected features. Finally, we note how a smaller number of features can lead to higher accuracies on some datasets, and hypothesize that overfitting the ranker model on the training set facilitates the selection of more salient features.

View on arXiv PDF Code

Similar