CLMay 11

PHAGE: Patent Heterogeneous Attention-Guided Graph Encoder for Representation Learning

arXiv:2605.1007378.4
Predicted impact top 74% in CL · last 90 daysOriginality Incremental advance
AI Analysis

For patent analysis and legal informatics, PHAGE provides a novel method to encode claim dependencies, improving representation learning for downstream tasks.

PHAGE introduces a heterogeneous graph encoder that models the directed dependency structure of patent claims, outperforming all baselines on classification, retrieval, and clustering tasks by leveraging intra-document claim topology as a stronger inductive bias than inter-document structure.

Patent claims form a directed dependency structure in which dependent claims inherit and refine the scope of earlier claims; however, existing patent encoders linearize claims as text and discard this hierarchy. Directly encoding this structure into self-attention poses two challenges: claim dependencies mix relation types that differ in semantics and extraction reliability, and the dependency graph is defined over claims while Transformers attend over tokens. PHAGE addresses the first challenge through a deterministic graph construction pipeline that separates near-deterministic legal citations from noisier rule-based technical relations, preserving type distinctions as heterogeneous edges. It addresses the second through a connectivity mask and learnable relation-aware biases that lift claim-level topology into token-level attention, allowing the encoder to differentially weight each relation type. A dual-granularity contrastive objective then aligns representations with both inter-patent taxonomy and intra-patent topology. PHAGE outperforms all baselines on classification, retrieval, and clustering, showing that intra-document claim topology is a stronger inductive bias than inter-document structure and that this bias persists in the encoder weights after training.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes