CVJun 3, 2025

LayoutRAG: Retrieval-Augmented Model for Content-agnostic Conditional Layout Generation

arXiv:2506.02697v13 citationsh-index: 19
Originality Incremental advance
AI Analysis

This addresses the challenge of controllable layout generation for graphic design, offering an incremental improvement over previous methods.

The paper tackles the problem of generating optimal visual layouts under given constraints by proposing LayoutRAG, which retrieves layout templates as references to guide generation, and it outperforms existing state-of-the-art models in producing high-quality layouts.

Controllable layout generation aims to create plausible visual arrangements of element bounding boxes within a graphic design according to certain optional constraints, such as the type or position of a specific component. While recent diffusion or flow-matching models have achieved considerable advances in multifarious conditional generation tasks, there remains considerable room for generating optimal arrangements under given conditions. In this work, we propose to carry out layout generation through retrieving by conditions and reference-guided generation. Specifically, we retrieve appropriate layout templates according to given conditions as references. The references are then utilized to guide the denoising or flow-based transport process. By retrieving layouts compatible with the given conditions, we can uncover the potential information not explicitly provided in the given condition. Such an approach offers more effective guidance to the model during the generation process, in contrast to previous models that feed the condition to the model and let the model infer the unprovided layout attributes directly. Meanwhile, we design a condition-modulated attention that selectively absorbs retrieval knowledge, adapting to the difference between retrieved templates and given conditions. Extensive experiment results show that our method successfully produces high-quality layouts that meet the given conditions and outperforms existing state-of-the-art models. Code will be released upon acceptance.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes