PowerFlow: Unlocking the Dual Nature of LLMs via Principled Distribution Matching

Ruishuo Chen, Yu Chen, Zhuoran Li, Longbo Huang

arXiv:2603.1836333.0h-index: 4

AI Analysis

This work addresses the challenge of eliciting latent capabilities in LLMs without external supervision, offering a principled solution that improves performance and diversity in creative tasks, though it is incremental as it builds on existing RLIF paradigms.

The paper tackles the problem of unsupervised fine-tuning of Large Language Models (LLMs) by introducing PowerFlow, a framework that reformulates it as a distribution matching problem to mitigate biases and enable directional control, resulting in consistent outperformance over existing methods and matching or exceeding supervised approaches.

Unsupervised Reinforcement Learning from Internal Feedback (RLIF) has emerged as a promising paradigm for eliciting the latent capabilities of Large Language Models (LLMs) without external supervision. However, current methods rely on heuristic intrinsic rewards, which often lack a well-defined theoretical optimization target and are prone to degenerative biases. In this work, we introduce PowerFlow, a principled framework that reformulates unsupervised fine-tuning as a distribution matching problem. By casting GFlowNet as an amortized variational sampler for unnormalized densities, we propose a length-aware Trajectory-Balance objective that explicitly neutralizes the structural length biases inherent in autoregressive generation. By targeting $Î±$-power distributions, PowerFlow enables the directional elicitation of the dual nature of LLMs: sharpening the distribution ($Î±> 1$) to intensify logical reasoning, or flattening it ($Î±< 1$) to unlock expressive creativity. Extensive experiments demonstrate that PowerFlow consistently outperforms existing RLIF methods, matching or even exceeding supervised GRPO. Furthermore, by mitigating over-sharpening in aligned models, our approach achieves simultaneous gains in diversity and quality, shifting the Pareto frontier in creative tasks.

View on arXiv PDF

Similar