OC LG MLAug 28, 2018

Weighted total variation based convex clustering

arXiv:1808.09144v12.4

Originality Incremental advance

AI Analysis

This work addresses the fundamental problem of data clustering for researchers and practitioners, offering an incremental improvement over existing convex relaxation methods.

The paper tackles the problem of data clustering by proposing a weighted convex clustering model that improves upon existing total variation methods, establishing sharper exact clustering properties applicable to general data and demonstrating better empirical performance compared to standard methods.

Data clustering is a fundamental problem with a wide range of applications. Standard methods, eg the $k$-means method, usually require solving a non-convex optimization problem. Recently, total variation based convex relaxation to the $k$-means model has emerged as an attractive alternative for data clustering. However, the existing results on its exact clustering property, ie, the condition imposed on data so that the method can provably give correct identification of all cluster memberships, is only applicable to very specific data and is also much more restrictive than that of some other methods. This paper aims at the revisit of total variation based convex clustering, by proposing a weighted sum-of-$\ell_1$-norm relating convex model. Its exact clustering property established in this paper, in both deterministic and probabilistic context, is applicable to general data and is much sharper than the existing results. These results provided good insights to advance the research on convex clustering. Moreover, the experiments also demonstrated that the proposed convex model has better empirical performance when be compared to standard clustering methods, and thus it can see its potential in practice.

View on arXiv PDF

Similar