LGCVJan 30, 2022

Sparse Centroid-Encoder: A Nonlinear Model for Feature Selection

arXiv:2201.12910v2
AI Analysis

This provides a feature selection method for domains like biology and image processing, but it appears incremental as it builds on existing Centroid-Encoder architecture.

The authors tackled the problem of feature selection for multi-class data by proposing Sparse Centroid-Encoder (SCE), a nonlinear model that uses sparsity to identify minimal discriminatory feature sets, and empirically showed it often achieves better classification accuracy than state-of-the-art methods on test sets.

Autoencoders have been widely used as a nonlinear tool for data dimensionality reduction. While autoencoders don't utilize the label information, Centroid-Encoders (CE)\cite{ghosh2022supervised} use the class label in their learning process. In this study, we propose a sparse optimization using the Centroid-Encoder architecture to determine a minimal set of features that discriminate between two or more classes. The resulting algorithm, Sparse Centroid-Encoder (SCE), extracts discriminatory features in groups using a sparsity inducing $\ell_1$-norm while mapping a point to its class centroid. One key attribute of SCE is that it can extract informative features from a multi-modal data set, i.e., data sets whose classes appear to have multiple clusters. The algorithm is applied to a wide variety of real world data sets, including single-cell data, high dimensional biological data, image data, speech data, and accelerometer sensor data. We compared our method to various state-of-the-art feature selection techniques, including supervised Concrete Autoencoders (SCAE), Feature Selection Network (FsNet), deep feature selection (DFS), Stochastic Gate (STG), and LassoNet. We empirically showed that SCE features often produced better classification accuracy than other methods on sequester test set.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes