LGJun 6, 2013

Performance analysis of unsupervised feature selection methods

arXiv:1306.1326v129 citations
Originality Synthesis-oriented
AI Analysis

This work addresses feature selection for classification tasks, but it is incremental as it applies existing methods without introducing new techniques.

The paper tackled the problem of selecting informative features to improve classification accuracy and reduce computational overhead by applying and evaluating four unsupervised feature selection methods: PCA, Rough PCA, USQR, and EDR, with results assessed using standard classification metrics.

Feature selection (FS) is a process which attempts to select more informative features. In some cases, too many redundant or irrelevant features may overpower main features for classification. Feature selection can remedy this problem and therefore improve the prediction accuracy and reduce the computational overhead of classification algorithms. The main aim of feature selection is to determine a minimal feature subset from a problem domain while retaining a suitably high accuracy in representing the original features. In this paper, Principal Component Analysis (PCA), Rough PCA, Unsupervised Quick Reduct (USQR) algorithm and Empirical Distribution Ranking (EDR) approaches are applied to discover discriminative features that will be the most adequate ones for classification. Efficiency of the approaches is evaluated using standard classification metrics.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes