LGMLAug 11, 2024

Scalable and Adaptive Spectral Embedding for Attributed Graph Clustering

arXiv:2408.05765v13 citationsh-index: 4
Originality Incremental advance
AI Analysis

This addresses scalability issues in attributed graph clustering for large datasets, offering an incremental improvement over existing methods.

The paper tackles the problem of attributed graph clustering on large graphs by introducing SASE, a method that achieves a 6.9% improvement in ACC and a 5.87x speedup on the ArXiv dataset compared to the runner-up.

Attributed graph clustering, which aims to group the nodes of an attributed graph into disjoint clusters, has made promising advancements in recent years. However, most existing methods face challenges when applied to large graphs due to the expensive computational cost and high memory usage. In this paper, we introduce Scalable and Adaptive Spectral Embedding (SASE), a simple attributed graph clustering method devoid of parameter learning. SASE comprises three main components: node features smoothing via $k$-order simple graph convolution, scalable spectral clustering using random Fourier features, and adaptive order selection. With these designs, SASE not only effectively captures global cluster structures but also exhibits linear time and space complexity relative to the graph size. Empirical results demonstrate the superiority of SASE. For example, on the ArXiv dataset with 169K nodes and 1.17M edges, SASE achieves a 6.9\% improvement in ACC and a $5.87\times$ speedup compared to the runner-up, S3GC.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes