CL AI LGMay 21, 2025

Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions

David Thulke, Jakob Kemmler, Christian Dugast, Hermann Ney

arXiv:2505.15633v14.91 citationsh-index: 112025 IEEE 25th International Symposium on Cluster, Cloud and Internet Computing Workshops (CCGridW)

Originality Incremental advance

AI Analysis

This addresses the issue of factual hallucinations in making climate documents accessible for researchers, policymakers, and the public, but it is incremental as it builds on existing models and methods.

The paper tackled the problem of ensuring faithfulness in retrieval-augmented generation for climate questions by developing ClimateGPT Faithful+, which improved faithfulness from 30% to 57% in supported atomic claims.

Large language models that use retrieval augmented generation have the potential to unlock valuable knowledge for researchers, policymakers, and the public by making long and technical climate-related documents more accessible. While this approach can help alleviate factual hallucinations by relying on retrieved passages as additional context, its effectiveness depends on whether the model's output remains faithful to these passages. To address this, we explore the automatic assessment of faithfulness of different models in this setting. We then focus on ClimateGPT, a large language model specialised in climate science, to examine which factors in its instruction fine-tuning impact the model's faithfulness. By excluding unfaithful subsets of the model's training data, we develop ClimateGPT Faithful+, which achieves an improvement in faithfulness from 30% to 57% in supported atomic claims according to our automatic metric.

View on arXiv PDF

Similar