AICLIRSep 1, 2024

Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph

arXiv:2409.00727v12 citationsh-index: 6
Originality Incremental advance
AI Analysis

This work addresses the lack of supervision signals in graph-based learning for applications like academia and social networks, offering a novel but incremental enhancement to existing methods.

The paper tackles the challenge of few- and zero-shot node classification on text-attributed graphs by introducing Hound, which uses node perturbation, text matching, and semantics negation to generate additional supervision signals, resulting in accuracy improvements of over 5% compared to state-of-the-art baselines across 5 datasets.

Text-attributed graph (TAG) is an important type of graph structured data with text descriptions for each node. Few- and zero-shot node classification on TAGs have many applications in fields such as academia and social networks. However, the two tasks are challenging due to the lack of supervision signals, and existing methods only use the contrastive loss to align graph-based node embedding and language-based text embedding. In this paper, we propose Hound to improve accuracy by introducing more supervision signals, and the core idea is to go beyond the node-text pairs that come with data. Specifically, we design three augmentation techniques, i.e., node perturbation, text matching, and semantics negation to provide more reference nodes for each text and vice versa. Node perturbation adds/drops edges to produce diversified node embeddings that can be matched with a text. Text matching retrieves texts with similar embeddings to match with a node. Semantics negation uses a negative prompt to construct a negative text with the opposite semantics, which is contrasted with the original node and text. We evaluate Hound on 5 datasets and compare with 13 state-of-the-art baselines. The results show that Hound consistently outperforms all baselines, and its accuracy improvements over the best-performing baseline are usually over 5%.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes