LGAIDBSep 27, 2019

Clustering Uncertain Data via Representative Possible Worlds with Consistency Learning

arXiv:1909.12514v11 citations
Originality Incremental advance
AI Analysis

This work addresses clustering uncertain data for applications like the internet of things, but it appears incremental as it builds on existing possible world approaches with specific improvements.

The paper tackles the problem of clustering uncertain data by addressing issues in existing possible world based algorithms, such as negative effects from marginal possible worlds and lack of consistency utilization, and proposes a representative possible world based consistent clustering algorithm that outperforms state-of-the-art methods.

Clustering uncertain data is an essential task in data mining for the internet of things. Possible world based algorithms seem promising for clustering uncertain data. However, there are two issues in existing possible world based algorithms: (1) They rely on all the possible worlds and treat them equally, but some marginal possible worlds may cause negative effects. (2) They do not well utilize the consistency among possible worlds, since they conduct clustering or construct the affinity matrix on each possible world independently. In this paper, we propose a representative possible world based consistent clustering (RPC) algorithm for uncertain data. First, by introducing representative loss and using Jensen-Shannon divergence as the distribution measure, we design a heuristic strategy for the selection of representative possible worlds, thus avoiding the negative effects caused by marginal possible worlds. Second, we integrate a consistency learning procedure into spectral clustering to deal with the representative possible worlds synergistically, thus utilizing the consistency to achieve better performance. Experimental results show that our proposed algorithm performs better than the state-of-the-art algorithms.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes