CV AIDec 10, 2025

Color encoding in Latent Space of Stable Diffusion Models

Guillem Arias, Ariadna Solà, Martí Armengod, Maria Vanrell

arXiv:2512.09477v18.42 citationsh-index: 20Color and Imaging Conference

Originality Synthesis-oriented

AI Analysis

This provides insights for researchers in model understanding and editing applications, but it is incremental as it analyzes an existing model without introducing new methods.

The paper tackled the problem of understanding how color is encoded in the latent space of Stable Diffusion models, revealing that color information is encoded along circular, opponent axes in specific latent channels (c_3 and c_4), while intensity and shape are in others (c_1 and c_2).

Recent advances in diffusion-based generative models have achieved remarkable visual fidelity, yet a detailed understanding of how specific perceptual attributes - such as color and shape - are internally represented remains limited. This work explores how color is encoded in a generative model through a systematic analysis of the latent representations in Stable Diffusion. Through controlled synthetic datasets, principal component analysis (PCA) and similarity metrics, we reveal that color information is encoded along circular, opponent axes predominantly captured in latent channels c_3 and c_4, whereas intensity and shape are primarily represented in channels c_1 and c_2. Our findings indicate that the latent space of Stable Diffusion exhibits an interpretable structure aligned with a efficient coding representation. These insights provide a foundation for future work in model understanding, editing applications, and the design of more disentangled generative frameworks.

View on arXiv PDF

Similar