CVAIOct 27, 2025

Nested AutoRegressive Models

arXiv:2510.23028v12 citationsh-index: 5
Originality Incremental advance
AI Analysis

This addresses computational bottlenecks in autoregressive image generation for researchers and practitioners, though it appears incremental as an architectural improvement.

The authors tackled the computational intensity and limited diversity of autoregressive image generation models by proposing a nested autoregressive architecture that reduces complexity from O(n) to O(log n) while increasing image diversity, achieving competitive performance with significantly lower computational cost.

AutoRegressive (AR) models have demonstrated competitive performance in image generation, achieving results comparable to those of diffusion models. However, their token-by-token image generation mechanism remains computationally intensive and existing solutions such as VAR often lead to limited sample diversity. In this work, we propose a Nested AutoRegressive~(NestAR) model, which proposes nested AutoRegressive architectures in generating images. NestAR designs multi-scale modules in a hierarchical order. These different scaled modules are constructed in an AR architecture, where one larger-scale module is conditioned on outputs from its previous smaller-scale module. Within each module, NestAR uses another AR structure to generate ``patches'' of tokens. The proposed nested AR architecture reduces the overall complexity from $\mathcal{O}(n)$ to $\mathcal{O}(\log n)$ in generating $n$ image tokens, as well as increases image diversities. NestAR further incorporates flow matching loss to use continuous tokens, and develops objectives to coordinate these multi-scale modules in model training. NestAR achieves competitive image generation performance while significantly lowering computational cost.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes