CLOct 15, 2024

GT2Vec: Large Language Models as Multi-Modal Encoders for Text and Graph-Structured Data

arXiv:2410.11235v22 citationsh-index: 40
AI Analysis

This addresses the challenge of exploiting heterogeneous graph and text data for applications such as retrieval and classification, representing an incremental advance over existing methods.

The paper tackles the problem of integrating graph and text data by proposing GT2Vec, a framework that uses Large Language Models with an MLP adapter and contrastive learning, achieving significant improvements across six datasets in tasks like question answering and retrieval.

Graph-structured information offers rich contextual information that can enhance language models by providing structured relationships and hierarchies, leading to more expressive embeddings for various applications such as retrieval, question answering, and classification. However, existing methods for integrating graph and text embeddings, often based on Multi-layer Perceptrons (MLPs) or shallow transformers, are limited in their ability to fully exploit the heterogeneous nature of these modalities. To overcome this, we propose GT2Vec, a simple yet effective framework that leverages Large Language Models (LLMs) to jointly encode text and graph data. Specifically, GT2Vec employs an MLP adapter to project graph embeddings into the same space as text embeddings, allowing the LLM to process both modalities jointly. Unlike prior work, we also introduce contrastive learning to align the graph and text spaces more effectively, thereby improving the quality of learned joint embeddings. Empirical results across six datasets spanning three tasks, knowledge graph-contextualized question answering, graph-text pair classification, and retrieval, demonstrate that GT2Vec consistently outperforms existing baselines, achieving significant improvements across multiple datasets. These results highlight GT2Vec's effectiveness in integrating graph and text data. Ablation studies further validate the effectiveness of our method.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes