MEAPCOMLMar 17, 2018

Provable Convex Co-clustering of Tensors

arXiv:1803.06518v249 citations
Originality Highly original
AI Analysis

This addresses the gap between statistical guarantees and computational efficiency for tensor clustering, which is important for scientific and business applications dealing with general-order tensor data.

The authors developed a convex formulation for tensor co-clustering that provides computational efficiency with polynomial costs and statistical guarantees including a non-asymptotic error bound, demonstrating a 'blessing of dimensionality' phenomenon not seen in vector/matrix clustering.

Cluster analysis is a fundamental tool for pattern discovery of complex heterogeneous data. Prevalent clustering methods mainly focus on vector or matrix-variate data and are not applicable to general-order tensors, which arise frequently in modern scientific and business applications. Moreover, there is a gap between statistical guarantees and computational efficiency for existing tensor clustering solutions due to the nature of their non-convex formulations. In this work, we bridge this gap by developing a provable convex formulation of tensor co-clustering. Our convex co-clustering (CoCo) estimator enjoys stability guarantees and its computational and storage costs are polynomial in the size of the data. We further establish a non-asymptotic error bound for the CoCo estimator, which reveals a surprising "blessing of dimensionality" phenomenon that does not exist in vector or matrix-variate cluster analysis. Our theoretical findings are supported by extensive simulated studies. Finally, we apply the CoCo estimator to the cluster analysis of advertisement click tensor data from a major online company. Our clustering results provide meaningful business insights to improve advertising effectiveness.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes