LG AI MLNov 19, 2024

K-means Derived Unsupervised Feature Selection using Improved ADMM

arXiv:2411.15197v12.61 citationsh-index: 3

Originality Incremental advance

AI Analysis

This addresses feature selection in unsupervised learning for data analysis, but it is incremental as it adapts existing methods like K-means and ADMM.

The paper tackles unsupervised feature selection for high-dimensional data by proposing K-means Derived Unsupervised Feature Selection (K-means UFS), which uses the K-means objective instead of spectral analysis and solves the NP-hard optimization with an improved ADMM, showing it is more effective than baselines in selecting features for clustering.

Feature selection is important for high-dimensional data analysis and is non-trivial in unsupervised learning problems such as dimensionality reduction and clustering. The goal of unsupervised feature selection is finding a subset of features such that the data points from different clusters are well separated. This paper presents a novel method called K-means Derived Unsupervised Feature Selection (K-means UFS). Unlike most existing spectral analysis based unsupervised feature selection methods, we select features using the objective of K-means. We develop an alternating direction method of multipliers (ADMM) to solve the NP-hard optimization problem of our K-means UFS model. Extensive experiments on real datasets show that our K-means UFS is more effective than the baselines in selecting features for clustering.

View on arXiv PDF

Similar