LGNov 22, 2024

HyReaL: Clustering Attributed Graph via Hyper-Complex Space Representation Learning

arXiv:2411.14727v2h-index: 15
Originality Incremental advance
AI Analysis

This work addresses attributed graph clustering for researchers and practitioners in machine learning, offering a novel method to improve representation learning, though it appears incremental as it builds on existing quaternion and graph clustering techniques.

The paper tackles the problem of clustering attributed graphs by addressing the Over-Smoothing effect in Graph Convolutional Networks, which homogenizes node representations, and introduces a hyper-complex space model using quaternion feature transformation to enhance attribute learning and alleviate this issue, resulting in more discriminative node representations suitable for various cluster numbers.

Clustering complex data in the form of attributed graphs has attracted increasing attention, where powerful graph representation is a critical prerequisite. However, the well-known Over-Smoothing (OS) effect makes Graph Convolutional Networks tend to homogenize the representation of graph nodes, while the existing OS solutions focus on alleviating the homogeneity of nodes' embeddings from the aspect of graph topology information, which is inconsistent with the attributed graph clustering objective. Therefore, we introduce hyper-complex space with powerful quaternion feature transformation to enhance the representation learning of the attributes. A generalized \textbf{Hy}per-complex space \textbf{Re}present\textbf{a}tion \textbf{L}earning (\textbf{HyReaL}) model is designed to: 1) bridge arbitrary dimensional attributes to the well-developed quaternion algebra with four parts, and 2) connect the learned representations to more generalized clustering objective without being restricted to a given number of clusters $k$. The novel introduction of quaternion benefits attributed graph clustering from two aspects: 1) enhanced attribute coupling learning capability allows complex attribute information to be sufficiently exploited in clustering, and 2) stronger learning capability makes it unnecessary to stack too many graph convolution layers, naturally alleviating the OS problem. It turns out that the node representations learned by HyReaL are more discriminative and widely suit downstream clustering with different $k$s. Extensive experiments including significance tests, ablation studies, qualitative results, etc., show the superiority of HyReaL.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes