CVLGFeb 7, 2024

Towards Aligned Layout Generation via Diffusion Model with Aesthetic Constraints

arXiv:2402.04754v233 citationsh-index: 13ICLR
AI Analysis

This work addresses layout misalignment for graphic design applications, representing an incremental improvement over existing diffusion-based methods.

The paper tackles the problem of misalignment in controllable layout generation by proposing LACE, a continuous diffusion model that incorporates aesthetic constraints, achieving state-of-the-art performance in generating high-quality layouts.

Controllable layout generation refers to the process of creating a plausible visual arrangement of elements within a graphic design (e.g., document and web designs) with constraints representing design intentions. Although recent diffusion-based models have achieved state-of-the-art FID scores, they tend to exhibit more pronounced misalignment compared to earlier transformer-based models. In this work, we propose the $\textbf{LA}$yout $\textbf{C}$onstraint diffusion mod$\textbf{E}$l (LACE), a unified model to handle a broad range of layout generation tasks, such as arranging elements with specified attributes and refining or completing a coarse layout design. The model is based on continuous diffusion models. Compared with existing methods that use discrete diffusion models, continuous state-space design can enable the incorporation of differentiable aesthetic constraint functions in training. For conditional generation, we introduce conditions via masked input. Extensive experiment results show that LACE produces high-quality layouts and outperforms existing state-of-the-art baselines.

Code Implementations1 repo
Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes