CLFeb 15, 2025

Back Attention: Understanding and Enhancing Multi-Hop Reasoning in Large Language Models

arXiv:2502.10835v122 citationsh-index: 55EMNLP
Originality Incremental advance
AI Analysis

This addresses the challenge of enhancing multi-hop reasoning in large language models for tasks like knowledge-based question answering, though it is incremental as it builds on existing interpretability methods.

The paper tackled the problem of how large language models perform latent multi-hop reasoning, identifying that failures often occur in the relation attribute extraction stage, and proposed back attention, a novel mechanism that improved accuracy on five reasoning datasets, with a 1-layer transformer achieving the performance of a 2-layer transformer.

We investigate how large language models perform latent multi-hop reasoning in prompts like "Wolfgang Amadeus Mozart's mother's spouse is". To analyze this process, we introduce logit flow, an interpretability method that traces how logits propagate across layers and positions toward the final prediction. Using logit flow, we identify four distinct stages in single-hop knowledge prediction: (A) entity subject enrichment, (B) entity attribute extraction, (C) relation subject enrichment, and (D) relation attribute extraction. Extending this analysis to multi-hop reasoning, we find that failures often stem from the relation attribute extraction stage, where conflicting logits reduce prediction accuracy. To address this, we propose back attention, a novel mechanism that enables lower layers to leverage higher-layer hidden states from different positions during attention computation. With back attention, a 1-layer transformer achieves the performance of a 2-layer transformer. Applied to four LLMs, back attention improves accuracy on five reasoning datasets, demonstrating its effectiveness in enhancing latent multi-hop reasoning ability.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes