DLApr 13

CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

arXiv:2510.17853100.05 citationsh-index: 13Has Code
AI Analysis

For researchers using LLMs for scientific writing, CiteGuard provides a more reliable method to verify citation faithfulness, addressing the unreliability of LLM-as-a-Judge.

CiteGuard reframes citation evaluation as citation attribution alignment and proposes a retrieval-aware agent that improves citation validation accuracy by 10 percentage points over baselines, achieving 68.1% on CiteME (near human 69.2%).

Large Language Models (LLMs) have emerged as powerful assistants for scientific writing. However, concerns remain about the quality and reliability of the generated text, including citation accuracy and faithfulness. While most recent work relies on methods such as LLM-as-a-Judge, the reliability of LLM-as-a-Judge alone is also in doubt. In this work, we reframe citation evaluation as a problem of citation attribution alignment, which assesses whether LLM-generated citations match those a human author would include for the same text. We propose CiteGuard, a retrieval-aware agent framework designed to provide more faithful grounding for citation validation. CiteGuard improves over the prior baseline by 10 percentage points and achieves up to 68.1% accuracy on the CiteME benchmark, approaching human performance (69.2%). It also identifies alternative valid citations and demonstrates generalization ability for cross-domain citation attribution. Our code is available at https://github.com/KathCYM/CiteGuard.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes