CLJan 30

ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

arXiv:2601.23184v12 citationsh-index: 4Has Code
Originality Highly original
AI Analysis

This addresses the problem of computational inefficiency in reasoning for LLM users, offering an incremental improvement over prior latent methods.

The paper tackles the computational redundancy of explicit reasoning chains in LLMs by proposing ReGuLaR, a variational latent reasoning method guided by rendered CoT images, which outperforms existing latent methods in efficiency and effectiveness and even surpasses CoT through multi-modal reasoning.

While Chain-of-Thought (CoT) significantly enhances the performance of Large Language Models (LLMs), explicit reasoning chains introduce substantial computational redundancy. Recent latent reasoning methods attempt to mitigate this by compressing reasoning processes into latent space, but often suffer from severe performance degradation due to the lack of appropriate compression guidance. In this study, we propose Rendered CoT-Guided variational Latent Reasoning (ReGuLaR), a simple yet novel latent learning paradigm resolving this issue. Fundamentally, we formulate latent reasoning within the Variational Auto-Encoding (VAE) framework, sampling the current latent reasoning state from the posterior distribution conditioned on previous ones. Specifically, when learning this variational latent reasoning model, we render explicit reasoning chains as images, from which we extract dense visual-semantic representations to regularize the posterior distribution, thereby achieving efficient compression with minimal information loss. Extensive experiments demonstrate that ReGuLaR significantly outperforms existing latent reasoning methods across both computational efficiency and reasoning effectiveness, and even surpasses CoT through multi-modal reasoning, providing a new and insightful solution to latent reasoning. Code: https://github.com/FanmengWang/ReGuLaR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes