CVAIFeb 8, 2025

XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion

arXiv:2502.05615v11 citationsh-index: 11Has Code
Originality Incremental advance
AI Analysis

This work addresses the problem of limited public understanding and participation in nuclear fusion research, which is significant for the scientific community and the general public interested in sustainable energy solutions, and is an incremental step in applying AI to science communication.

The authors tackled the problem of science communication in nuclear fusion by developing XiHeFusion, a large language model that can provide accurate and logical answers, with the model performing well on a test questionnaire of 180+ questions. The result is a model that can effectively communicate nuclear fusion knowledge to a broader audience.

Nuclear fusion is one of the most promising ways for humans to obtain infinite energy. Currently, with the rapid development of artificial intelligence, the mission of nuclear fusion has also entered a critical period of its development. How to let more people to understand nuclear fusion and join in its research is one of the effective means to accelerate the implementation of fusion. This paper proposes the first large model in the field of nuclear fusion, XiHeFusion, which is obtained through supervised fine-tuning based on the open-source large model Qwen2.5-14B. We have collected multi-source knowledge about nuclear fusion tasks to support the training of this model, including the common crawl, eBooks, arXiv, dissertation, etc. After the model has mastered the knowledge of the nuclear fusion field, we further used the chain of thought to enhance its logical reasoning ability, making XiHeFusion able to provide more accurate and logical answers. In addition, we propose a test questionnaire containing 180+ questions to assess the conversational ability of this science popularization large model. Extensive experimental results show that our nuclear fusion dialogue model, XiHeFusion, can perform well in answering science popularization knowledge. The pre-trained XiHeFusion model is released on https://github.com/Event-AHU/XiHeFusion.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes