LGMLMar 2, 2019

One-Pass Incomplete Multi-view Clustering

arXiv:1903.00637v1160 citations
Originality Incremental advance
AI Analysis

This work addresses incomplete multi-view clustering for real-world applications with large-scale data, offering an incremental improvement over existing methods by enabling online processing.

The paper tackles the problem of incomplete multi-view clustering, where some views have missing instances, by proposing a one-pass framework (OPIMC) that reduces computational and memory costs compared to offline methods, achieving efficient and effective results as demonstrated on four real datasets.

Real data are often with multiple modalities or from multiple heterogeneous sources, thus forming so-called multi-view data, which receives more and more attentions in machine learning. Multi-view clustering (MVC) becomes its important paradigm. In real-world applications, some views often suffer from instances missing. Clustering on such multi-view datasets is called incomplete multi-view clustering (IMC) and quite challenging. To date, though many approaches have been developed, most of them are offline and have high computational and memory costs especially for large scale datasets. To address this problem, in this paper, we propose an One-Pass Incomplete Multi-view Clustering framework (OPIMC). With the help of regularized matrix factorization and weighted matrix factorization, OPIMC can relatively easily deal with such problem. Different from the existing and sole online IMC method, OPIMC can directly get clustering results and effectively determine the termination of iteration process by introducing two global statistics. Finally, extensive experiments conducted on four real datasets demonstrate the efficiency and effectiveness of the proposed OPIMC method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes