Reducing Hallucinations in LLM-based Scientific Literature Analysis Using Peer Context Outlier Detection
This work addresses the issue of inaccurate data extraction from scientific literature for researchers, though it is incremental as it builds on existing methods by adding peer context.
The paper tackled the problem of hallucinations in LLM-based scientific literature analysis by introducing Peer Context Outlier Detection (P-COD), which uses relationships between documents to improve extraction accuracy, achieving up to 98% precision in outlier detection across six science domains.
Reducing hallucinations in Large Language Models (LLMs) is essential for improving the accuracy of data extraction from large text corpora. Current methods, like prompt engineering and chain-of-thought prompting, focus on individual documents but fail to consider relationships across a corpus. This paper introduces Peer Context Outlier Detection (P-COD), a novel approach that uses the relationships between documents to improve extraction accuracy. Our application domain is in scientific literature summarization, where papers with similar experiment settings should draw similar conclusions. By comparing extracted data to validated peer information within the corpus, we adjust confidence scores and flag low-confidence results for expert review. High-confidence results, supported by peer validation, are considered reliable. Our experiments demonstrate up to 98% precision in outlier detection across 6 domains of science, demonstrating that our design reduces hallucinations, enhances trust in automated systems, and allows researchers to focus on ambiguous cases, streamlining the data extraction workflows.