CLAIFeb 27, 2025

Order Doesn't Matter, But Reasoning Does: Training LLMs with Order-Centric Augmentation

arXiv:2502.19907v21 citationsh-index: 22Has CodeEMNLP
Originality Incremental advance
AI Analysis

This addresses a key limitation in LLMs' logical reasoning for applications requiring robust inference, though it is incremental as it builds on existing augmentation techniques.

The paper tackles the problem of large language models struggling with reasoning order variations by introducing an order-centric data augmentation framework based on commutativity in logical reasoning, resulting in significant enhancements in reasoning performance and adaptability across multiple benchmarks.

Logical reasoning is essential for large language models (LLMs) to ensure accurate and coherent inference. However, LLMs struggle with reasoning order variations and fail to generalize across logically equivalent transformations. LLMs often rely on fixed sequential patterns rather than true logical understanding. To address this issue, we introduce an order-centric data augmentation framework based on commutativity in logical reasoning. We first randomly shuffle independent premises to introduce condition order augmentation. For reasoning steps, we construct a directed acyclic graph (DAG) to model dependencies between steps, which allows us to identify valid reorderings of steps while preserving logical correctness. By leveraging order-centric augmentations, models can develop a more flexible and generalized reasoning process. Finally, we conduct extensive experiments across multiple logical reasoning benchmarks, demonstrating that our method significantly enhances LLMs' reasoning performance and adaptability to diverse logical structures. We release our codes and augmented data in https://github.com/qianxiHe147/Order-Centric-Data-Augmentation.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes