AIFeb 12

Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision

Xiaohan He, Shiyang Feng, Songtao Huang, Lei Bai, Bin Wang, Bo Zhang

arXiv:2602.12164v12.4h-index: 3Has Code

Originality Incremental advance

AI Analysis

This work addresses the fragility of LLMs in scientific reasoning for researchers and practitioners, representing an incremental improvement through a novel hybrid method.

The paper tackles the problem of unreliable solution evaluation and limited diversity in verification strategies for scientific reasoning tasks with large language models, proposing Sci-CoE, a two-stage co-evolving framework that enhances complex reasoning capabilities and exhibits strong scalability on general scientific benchmarks.

Large language models (LLMs) have demonstrated exceptional reasoning capabilities, and co-evolving paradigms have shown promising results in domains such as code and math. However, in scientific reasoning tasks, these models remain fragile due to unreliable solution evaluation and limited diversity in verification strategies. In this work, we propose Sci-CoE, a two-stage scientific co-evolving framework that enables models to self-evolve as both solver and verifier through a transition from sparse supervision to unsupervised learning. In the first stage, the model uses a small set of annotated data to establish fundamental correctness judgment anchors for the Verifier. In the second stage, we introduce a geometric reward mechanism that jointly considers consensus, reliability, and diversity, driving large-scale self-iteration on unlabeled data. Experiments on several general scientific benchmarks demonstrate that Sci-CoE enhances complex reasoning capabilities and exhibits strong scalability, facilitating the construction of more robust and diverse evaluation systems. Codes are available at https://github.com/InternScience/Sci-CoE.

View on arXiv PDF Code

Similar