LGOct 7, 2025

On Powerful Ways to Generate: Autoregression, Diffusion, and Beyond

arXiv:2510.06190v25 citationsh-index: 3
Originality Incremental advance
AI Analysis

This work addresses the fundamental limitations of current language models for researchers and practitioners, offering a novel approach to extend LLMs beyond natural language to domains like coding and science, though it is incremental in building on existing diffusion and autoregressive methods.

The paper tackled the problem of understanding the computational power and limitations of non-autoregressive diffusion models compared to autoregressive models, showing that while vanilla masked diffusion models do not expand what autoregressive models can solve, a new any-process generation method enables scalability to significantly harder reasoning problems that are otherwise intractable.

Diffusion language models have recently emerged as a competitive alternative to autoregressive language models. Beyond next-token generation, they are more efficient and flexible by enabling parallel and any-order token generation. However, despite empirical successes, their computational power and fundamental limitations remain poorly understood. In this paper, we formally study whether non-autoregressive generation in Masked Diffusion Models (MDM) enables solving problems beyond the reach of Auto-Regressive Models (ARM). Our results show that MDM with sufficiently large context length is computationally universal with decoding steps matching the optimal parallel time complexity in PRAM. However, when controlling for other factors, MDM's flexibility to generate in any-order does not expand what ARM can already solve. To address this, we propose a new form of generation called any-process generation, which extends MDM with capabilities to remask, insert and delete tokens, allowing self-correction, length-variable editing, and adaptive parallelism. Theoretically and empirically, we demonstrate these capabilities enable scalability to significantly harder reasoning problems that are otherwise intractable for ARM and vanilla MDM. Additionally, they prove essential for generation tasks where objects naturally evolve through non-sequential processes, crucial for extending current LLMs beyond natural language to domains such as coding and science.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes