CVNov 21, 2025

Diversity Has Always Been There in Your Visual Autoregressive Models

arXiv:2511.17074v1Has Code
Originality Incremental advance
AI Analysis

This addresses a specific bottleneck in VAR models for image generation, offering an incremental improvement to enhance output variability.

The paper tackles the diversity collapse problem in Visual Autoregressive (VAR) models, where output variability is reduced, and introduces DiverseVAR, a method that restores generative diversity without additional training, achieving substantial enhancement with negligible performance impact.

Visual Autoregressive (VAR) models have recently garnered significant attention for their innovative next-scale prediction paradigm, offering notable advantages in both inference efficiency and image quality compared to traditional multi-step autoregressive (AR) and diffusion models. However, despite their efficiency, VAR models often suffer from the diversity collapse i.e., a reduction in output variability, analogous to that observed in few-step distilled diffusion models. In this paper, we introduce DiverseVAR, a simple yet effective approach that restores the generative diversity of VAR models without requiring any additional training. Our analysis reveals the pivotal component of the feature map as a key factor governing diversity formation at early scales. By suppressing the pivotal component in the model input and amplifying it in the model output, DiverseVAR effectively unlocks the inherent generative potential of VAR models while preserving high-fidelity synthesis. Empirical results demonstrate that our approach substantially enhances generative diversity with only neglectable performance influences. Our code will be publicly released at https://github.com/wangtong627/DiverseVAR.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes